Table of Contents | Previous | Next |
Technical Report #1
Creation of the Data Set for Analysis
The basic data set used for analysis in this report has been the product of a series of critical decisions made and processes completed over the course of four years. Each step and decision was based on careful review of the data themselves as well as other available information, consideration of alternatives and implications, and often consultation with key advisors within ACYF and the National Transition Demonstration Study’s Technical Work Group.
Because the National Transition Demonstration Study incorporated data obtained from a variety of informants (children, families, teachers, principals, school records, national databases), the construction of the final data set is based on the concept of family units. Unique seven-digit family/child identification numbers were assigned to each child participating in the study, indicating: (10 the cohort (one of 2); (2) the site (one of 31); (3) Head Start affiliation (former Head Start or non-Head Start); (4) unique family affiliation (a 3-digit identifier); and (5) whether or not the child had a sibling in the study (a single digit identifier). All data that were associated with a given child (i.e., from the family interview, child assessments, child-specific teacher questionnaire, and information from school records) was linked to the child, on all relevant forms, via this family/child identification number. A family unit, then, was defined as a child/family unit (defined by the unique identification number) that had, in kindergarten (at baseline), some associated data -- either a family interview, a child assessment, a teacher questionnaire, or information from the school records. Any combination of the four data sources could exist for the family unit at any given time point throughout the four-year longitudinal follow-up period. In April 1995, at the midpoint of the study, a total of 12, 067 family units were involved in the National Transition Demonstration Study.
A series of data clarification processes were completed in the ensuing four years, each resulting in some modification to the number of observations within the data set. The steps of clarification and the consequences in terms of lost cases (totaling 1,238 cases) are outlined in Table 1 below. The deletion of one site from the analysis sample (because of failure to submit adequate documentation of random assignment of schools) led to the largest reduction in sample size (a total of 785 cases removed from the analysis sample). The additional steps together removed total of a 435 cases from the analysis sample. The final analysis set, then, includes 7,515 cases -- 90 percent of the original sample.
| Decision/activity |
Cases remaining |
|---|---|
| Began with: | 12,067 |
| Clarification of family/child ID numbers, consolidation
of records in baseline, correction of cohort, or correction of Head Start status: total loss = 248 cases (2% of family units) |
11,819 |
| Removal of family units whose initial school was not included among study schools (primarily one site carrying a third local cohort): total loss = 82 cases (0.8% of family units) | 11,737 |
| Removal of family units whose only source of information at baseline was the Information from School Records: total loss = 85 cases (0.8% of family units) | 11,652 |
| Removal of family units with no baseline information about treatment condition of school: total loss = 38 cases (0.3% of family units) | 11,614 |
| Removal of family units from one site that did not provide adequate documentation of random assignment of schools: total loss = 785 cases (468 cohort 1, 317 cohort 2) | 10,829 |
Of these 10,829 family units included in the final data analysis set, 7,515 were former Head Start participants and the remainder (3,314) did not participate in Head Start prior to enrolling in kindergarten. Thus, the final analysis set used as a basis for child and family analyses reported in this final report includes the 7,515 former Head Start children and families.
There was some variability in the completeness of data within and across rounds for a given family unit. For example, a family interview could be present but the child assessment and teacher questionnaire missing, or the family unit could have data available in kindergarten but not in the following year. Table 2 summarizes the availability of data forms of certain types across rounds of data collection, singly and in key combinations.
| Source of Data Record | Years in School | ||||
|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 3 or 4 | |
| Child Assessment | 7325 | 6065 | 5788 | 5740 | 6193 |
| Family Interview | 7078 | 5493 | 5284 | 5196 | 5903 |
| Information from School Records | 6535 | 5675 | 5383 | 5390 | 5900 |
| Teacher, Part B | 6501 | 5331 | 4821 | 4572 | 5445 |
| Child Assessment and Family Interview | 6894 | 5289 | 5037 | 4907 | 4276 |
| Child Assessment and Information from School Records | 6435 | 5460 | 5153 | 5145 | 4606 |
| Child Assessment and Teacher B | 6412 | 5188 | 4714 | 4484 | 3854 |
| Family Interview and Information from School Records | 4945 | 4616 | 4583 | 3901 | 6253 |
| Family Interview and Teacher B | 6208 | 4688 | 4229 | 4015 | 3308 |
| Information from School Records and Teacher B | 6054 | 5037 | 4620 | 4438 | 3800 |
| Child Assessment, Family Interview and Information from School Records | 6154 | 4859 | 4539 | 4452 | 5200 |
| Child Assessment, Family Interview and Teacher B | 6124 | 4609 | 4177 | 3962 | 4886 |
| Child Assessment, Information from School Records, and Teacher B | 5991 | 4947 | 4560 | 4385 | 5206 |
| Family Interview, Information from School Records, and Teacher B | 5798 | 4492 | 4091 | 3929 | 4807 |
| Table of Contents | Previous | Next |

