Careful editing can make coding easier.
Codes are meant to represent the meaning in the data.
Assigning numerical symbols permits the transfer of data from questionnaires or interview
forms to a computer.
Codes often are, but not always, numerical symbols; however, they are more broadly defined
as rules for interpreting, classifying, and recording the data.
In qualitative research, numbers are seldom used for codes.
Coding Qualitative Responses
Unstructured Qualitative Responses (Long Interview)
Qualitative coding was introduced in Chapter 7 (i.e., hermeneutic unit, network, or
grounded theory).
The codes are usually words or phrases that represent themes.
Structured Qualitative Responses
Qualitative responses to structured questions (i.e., yes/no) can be stored in a data file
with letters (i.e., “Y” or “N”) or as numbers, but even though numbers may be used,
the variable is classificatory simply separating the positive from the negative
responses.
The researcher may consider adopting dummy coding for dichotomous responses
(i.e., yes/no) that assigns a “0” to one category and a “1” to the other.
Dummy coding provides the researcher with more flexibility in how structured,
qualitative responses are analyzed statistically.
Because a dummy variable can only represent two categories, multiple dummy
variables are needed to represent a single qualitative response that can take on more
than two categories.
The rule is that if k is the number of categories for a qualitative variable, k-1 dummy
variables are needed to represent the variable.
Data File Terminology
Most terminology describing files goes back to the early days of computers, which
produced results that were stored on actual computer cards.
Researchers organize coded data into fields, records, and files.
A field is a collection of characters (a character is a single number, letter, or special
symbol such as a question mark) that represents a single type of data, usually a
variable.
Text variables are represented by string characters which is computer terminology
for a series of alphabetic characters (nonnumeric characters) that may form a word.
String characters often contain long fields of 8 or more characters.
In contrast, a dummy variable is a numeric variable that needs only 1 character to
form a field.
A record is a collection of related fields, and was the way a single, complete
computer card was represented.
Researchers may use the term to refer to one respondent’s data.
A data file is a collection of related records that make up a data set.
Value labels are extremely useful and allow a word or short phrase to be associated
with numeric coding.