Skip Navigation
Administration for Children and Families  
ACF
ACF Home   |   Services   |   Working with ACF   |   Policy/Planning   |   About ACF   |   ACF News   |   HHS Home

  Questions?  |  Privacy  |  Site Index  |  Contact Us  |  Download Reader™  |  Print      

Office of Planning, Research & Evaluation (OPRE) skip to primary page content
Advanced
Search

 Table of Contents | Previous | Next

Technical Report #1

Creation of the Data Set for Analysis

The basic data set used for analysis in this report has been the product of a series of critical decisions made and processes completed over the course of four years. Each step and decision was based on careful review of the data themselves as well as other available information, consideration of alternatives and implications, and often consultation with key advisors within ACYF and the National Transition Demonstration Study’s Technical Work Group.

Because the National Transition Demonstration Study incorporated data obtained from a variety of informants (children, families, teachers, principals, school records, national databases), the construction of the final data set is based on the concept of family units. Unique seven-digit family/child identification numbers were assigned to each child participating in the study, indicating: (10 the cohort (one of 2); (2) the site (one of 31); (3) Head Start affiliation (former Head Start or non-Head Start); (4) unique family affiliation (a 3-digit identifier); and (5) whether or not the child had a sibling in the study (a single digit identifier). All data that were associated with a given child (i.e., from the family interview, child assessments, child-specific teacher questionnaire, and information from school records) was linked to the child, on all relevant forms, via this family/child identification number. A family unit, then, was defined as a child/family unit (defined by the unique identification number) that had, in kindergarten (at baseline), some associated data -- either a family interview, a child assessment, a teacher questionnaire, or information from the school records. Any combination of the four data sources could exist for the family unit at any given time point throughout the four-year longitudinal follow-up period. In April 1995, at the midpoint of the study, a total of 12, 067 family units were involved in the National Transition Demonstration Study.

A series of data clarification processes were completed in the ensuing four years, each resulting in some modification to the number of observations within the data set. The steps of clarification and the consequences in terms of lost cases (totaling 1,238 cases) are outlined in Table 1 below. The deletion of one site from the analysis sample (because of failure to submit adequate documentation of random assignment of schools) led to the largest reduction in sample size (a total of 785 cases removed from the analysis sample). The additional steps together removed total of a 435 cases from the analysis sample. The final analysis set, then, includes 7,515 cases -- 90 percent of the original sample.

Table 1.
Steps taken in clarification of data set and resulting loss of cases available for analysis
Decision/activity
Cases remaining
Began with: 12,067
Clarification of family/child ID numbers, consolidation of records in baseline, correction of cohort, or correction of
Head Start status: total loss = 248 cases (2% of family units)
11,819
Removal of family units whose initial school was not included among study schools (primarily one site carrying a third local cohort): total loss = 82 cases (0.8% of family units) 11,737
Removal of family units whose only source of information at baseline was the Information from School Records: total loss = 85 cases (0.8% of family units) 11,652
Removal of family units with no baseline information about treatment condition of school: total loss = 38 cases (0.3% of family units) 11,614
Removal of family units from one site that did not provide adequate documentation of random assignment of schools: total loss = 785 cases (468 cohort 1, 317 cohort 2) 10,829

 

Of these 10,829 family units included in the final data analysis set, 7,515 were former Head Start participants and the remainder (3,314) did not participate in Head Start prior to enrolling in kindergarten. Thus, the final analysis set used as a basis for child and family analyses reported in this final report includes the 7,515 former Head Start children and families.

There was some variability in the completeness of data within and across rounds for a given family unit. For example, a family interview could be present but the child assessment and teacher questionnaire missing, or the family unit could have data available in kindergarten but not in the following year. Table 2 summarizes the availability of data forms of certain types across rounds of data collection, singly and in key combinations.

Table 2. Data available by type and by years (data collection points)
Source of Data Record Years in School
1 2 3 4 3 or 4
Child Assessment 7325 6065 5788 5740 6193
Family Interview 7078 5493 5284 5196 5903
Information from School Records 6535 5675 5383 5390 5900
Teacher, Part B 6501 5331 4821 4572 5445
Child Assessment and Family Interview 6894 5289 5037 4907 4276
Child Assessment and Information from School Records 6435 5460 5153 5145 4606
Child Assessment and Teacher B 6412 5188 4714 4484 3854
Family Interview and Information from School Records 4945 4616 4583 3901 6253
Family Interview and Teacher B 6208 4688 4229 4015 3308
Information from School Records and Teacher B 6054 5037 4620 4438 3800
Child Assessment, Family Interview and Information from School Records 6154 4859 4539 4452 5200
Child Assessment, Family Interview and Teacher B 6124 4609 4177 3962 4886
Child Assessment, Information from School Records, and Teacher B 5991 4947 4560 4385 5206
Family Interview, Information from School Records, and Teacher B 5798 4492 4091 3929 4807


 

 

 Table of Contents | Previous | Next