Skip Navigation
acfbanner  
ACF
Department of Health and Human Services 		  
		  Administration for Children and Families
          
ACF Home   |   Services   |   Working with ACF   |   Policy/Planning   |   About ACF   |   ACF News   |   HHS Home

  Questions?  |  Privacy  |  Site Index  |  Contact Us  |  Download Reader™Download Reader  |  Print Print      

Office of Planning, Research & Evaluation (OPRE) skip to primary page content
Advanced
Search

 Table of Contents | Previous | Next

D.1 COMPARING THE BASELINE CHARACTERISTICS OF PROGRAM AND CONTROL GROUP MEMBERS

In theory, randomized experimental designs ensure that differences in the average outcomes between program and control groups can be attributed to the intervention under investigation. This rigor is possible, however, only if the random assignment process generates program and control groups with similar characteristics, on average, at the time of random assignment. Thus, the benefits of the random assignment design can be realized only if random assignment is implemented correctly and produces equivalent research groups.

We believe that the process used in the Early Head Start study to randomly assign families to the program or control groups was implemented correctly. MPR staff controlled the process, random numbers generated from a computer were used to assign the families to a research status, and, to the best of our knowledge, local programs and research staff followed the specified procedures for obtaining applicants and notifying families of their group assignment.

In this appendix, we compare the characteristics of program and control group families to check that the random assignment process was implemented correctly. First, we discuss data sources and methods and then discuss analysis results.

1. Data Sources and Methods

We used data from the Head Start Family Information System (HSFIS) application and enrollment forms for the analysis. This information was collected prior to random assignment, so neither the quality of the data nor item response should differ by research status if random assignment was conducted properly. The HSFIS data contain demographic information on families, primary caregivers, and focus children.

We used standard statistical tests to assess the similarity of the two research groups, including univariate t-tests to compare variable means for binary and continuous variables and chi-square tests to compare distributions of categorical variables. In addition, we conducted a more formal multivariate analysis to test the hypothesis that variable means and distributions are jointly similar. For this analysis, we estimated logit regression models, where the probability that a family is in the program group was regressed on the HSFIS variables; we used chi-square tests to assess whether the coefficients on these explanatory variables were jointly significant. This multivariate procedure adjusts for the fact that univariate tests are expected to produce some significant test statistics by chance, even when the program and control groups are identical. For example, if the hypothesis tests are conducted at the 10 percent level of significance, then we would expect that 10 percent of independent tests would be falsely rejected. The multivariate procedure also accounts for correlations across measures, whereas the univariate procedure assumes that the measures are independent.

For several reasons, our main approach was to conduct the analysis using the sample pooled across all 17 research sites, rather than conduct separate analyses by site. First, pooling increases the power of the statistical tests. Second, it allows us to examine more HSFIS variables, because some variables vary little within sites. Finally, and most important, we used the same random assignment procedures for each site, so that we had no reason to believe that there would be differences in results across sites. However, we also conducted the analysis separately by site for selected HSFIS variables and display p-values for these tests.

2. Analysis Results

Table D.1A displays analysis results for the sample pooled across the 17 research sites. The table displays variable distributions for the program and control groups, as well as p-values for testing differences across the two groups. Table D.1B displays p-values by site for 12 selected variables.


TABLE D.1A

COMPARISON OF THE BASELINE CHARACTERISTICS OF ALL PROGRAM AND CONTROL GROUP MEMBERS
Variable Program
Group
Control
Group
P-Value for
Testing
Differences
Site Characteristics
Program Approach     0.813
Center-based 20.2 20.6  
Home-based 46.7 45.6  
Mixed 33 33.9  
Overall Implementation Pattern     0.957
Early implementers 34.5 34.8  
Later implementers 35 35.1  
Incomplete implementers 30.5 30  
Family and Parent Characteristics
Age of Mother at Birth of Focus Child     0.803
Younger than 20 39 39.5  
20 to 25 33.2 32  
25 or older 27.9 28.5  
Mother Was Younger than 19 at First Birth 42.9 41.2 0.336
Highest Grade Completed     0.175
Less than 12 47.7 47.8  
12 or earned a GED 27.3 29.8  
More than 12 24.9 22.4  
Race and Ethnicity     0.968
White non-Hispanic 37.3 37.1  
Black non-Hispanic 34.2 35  
Hispanic 23.8 23.4  
Other (Asian or Pacific Islander, American Indian, Eskimo, Aleut) 4.7 4.5  
Primary Occupation     0.826
Employed 22.9 23.8  
In school or a training program 22 21.4  
Other 55 54.7  
English Language Ability     0.485
Primary language is English 79.9 78.1  
Primary language is not English but the applicant speaks English well 9.6 10.3  
Primary language is not English and the applicant does not speak English well 10.5 11.6  
Living Arrangements     0.762
Living with a spouse 24.9 25.4  
Living with other adults 38.3 39.1  
Living with no other adults 36.8 35.5  
Adult Male Present in the Household 38.1 39.1 0.586
Number of Adults in the Household(a)     0.804
1 37.8 36.6  
2 49.8 50.8  
3 or more 12.4 12.6  
Number of Children Less than 5 Years      
Old in the Household Other than the     0.781
Focus Child      
0 64.3 65.1  
1 27 26.8  
2 or more 8.7 8.1  
Number of Children Between 6 and 17 in the Household     0.454
0 64.3 66.4  
1 23.1 21.3  
2 or more 12.6 12.3  
Number of Moves in the Past Year     0.884
0 49.5 49.8  
1 28.9 28.1  
2 or more 21.6 22.1  
Owns Home 11 11.1 0.907
      0.257
Household Income as a Percent of the      
Poverty Level (Percent)      
Less than 33 30.2 30  
33 to 67 32.5 29.2  
67 to 99 24 26.5  
100 or more 13.3 14.3  
Welfare Receipt      
AFDC/TANFa 35.6 34.7 0.627
Food Stamps 48 47.8 0.889
Medicaid 76.6 74.7 0.217
SSI 7 7 0.978
WIC 87.5 85.9 0.235
Public housing 9.5 8.9 0.565
Has Inadequate Resources      
Food 4.9 6.3 0.111
Housing 12.3 13.3 0.432
Money to buy necessities 20.8 21.7 0.588
Medical care 14 14.7 0.577
Transportation 20.9 22.4 0.334
Child care 34.4 34.6 0.913
Money for supplies 27.1 29.4 0.28
Support from friends 12.9 14 0.414
Parent information 12.5 16.3 .005*
Maternal Risk Indexc     0.469
0 or 1 (low risk) 18.8 17.3  
2 or 3 (moderate risk) 54.2 56.4  
4 or 5 (high risk) 27.1 26.3  
Random Assignment Date     0.808
Before 10/96 36 36.5  
10/96 to 6/97 30.2 30.8  
After 6/97 33.8 32.7  
Previously Enrolled in Head Start or Another Childhood Development Programb 12.8 13.4 0.628
Characteristics of Focus Child
Age (Months)     0.33
Unborn 24.2 26.5  
Less than 5 36.1 34.7  
5 or more 39.7 38.7  
Male 51.7 50.4 0.493
First Born 62.3 62.8 0.783
Birthweight Less than 2,500 Gramsb 9.9 8.4 0.237
Born more than 3 Weeks Earlyb 15.8 12 .014*
Stayed in Hospital After Birthb 18.3 16 0.178
People Concerned About the Child's Overall Health and Developmentb 13 13.3 0.87
Received an Evaluation Because of Concerns About the Child's Overall Health and Development or Because of Suspected Developmental Delayb 6 6.9 0.412
Risk Categories      
Has established risksb 11.6 10.6 0.444
Has biological or medical risksb 18.3 16.8 0.396
Has environmental risksb 32.5 36.4 .062*
Covered by Health Insuranceb 90.1 89.6 0.723
Sample Size 1,513 1,488  
SOURCE: HSFIS application and enrollment forms.

aThe primary caregiver is considered to be an adult regardless of her age.(back)

bThese variables pertain to families with focus children who were born at baseline.(back)

cThis index was constructed by summing the number of the following risk factors that the mother faced: (1) being a teenage mother; (2) having no high school credential; (3) receiving public assistance; (4) not being employed or in school or training, and (5) being a single mother.(back)

*Significantly different from zero at the .10 level, two-tailed test.


TABLE D.1B

P-VALUES FROM TESTS COMPARING THE BASELINE CHARACTERISTICS OF
PROGRAM AND CONTROL GROUP MEMBERS, BY SITE
Site Mother's
Age
Mother's
Education
Race and
Ethnicity
Mother's
Primary
Occupation
Living
Arrangements
Received
AFDC or
TANF
Received
Food
Stamps
Maternal
Risk Index
Random Assignment Date Child's
Age
Child's
Gender
1 .446 .903 .211 .976 .459 .82 .707 .809 .970 .576 .027*
2 .165 .482 .252 .948 .472 .7 .734 .82 .615 .4 .227
3 .927 .782 .795 .219 .073* .107 .041* .138 .981 .626 .896
4 .748 .496 .434 .722 .662 .682 .401 .131 1 .939 .951
5 .55 .158 .19 .559 .694 .361 .808 .84 .845 .464 .308
6 .863 .943 .505 .393 .598 .611 .757 .715 .666 .344 .952
7 .978 .084* .84 .071* .052* .147 .726 .893 .924 .541 .677
8 .824 .355 .683 .499 .773 .115 .858 .879 1 .749 .778
9 .97 .217 .579 .533 .401 .326 .791 .286 .985 .306 .362
10 .594 .786 .507 .619 .68 .225 .331 .185 .707 .592 .951
11 .749 .534 .405 .326 .755 .402 .075* .156 .454 .040* .215
12 .549 .716 .739 .411 .681 .2 .095* .083* .99 .967 .698
13 .003* .996 .824 .735 .367 .051* .92 .406 .67 .751 .347
14 .381 .54 .387 .884 .993 .984 .403 .417 .948 .417 .402
15 .744 .88 .395 .343 .766 .776 .934 .469 .924 .911 .453
16 .075* .622 .622 .464 .492 .142 .887 .244 .791 .242 .867
17 .733 .804 .367 .188 .358 .122 .895 .714 1 .457 .496
SOURCE: HSFIS application and enrollment forms.

*Statistically different from zero at the .10 level, two-tailed test.

The results indicate that random assignment produced program and control groups with equivalent characteristics. For the full sample, the program and control group differences are statistically significant at the 10 percent level for only 3 of the 47 univariate tests (which is less than the approximately 5 tests that would be expected by chance), and only 4 of the tests are statistically significant at the 15 percent level. Furthermore, the joint test from the multivariate regression model yields a p-value of .630. Finally, very few (15 of 207) univariate tests for 12 key variables are rejected at the 10 percent level across the sites, and the significant test statistics are scattered across sites and variables. We conclude that random assignment produced equivalent research groups.



 

 

 Table of Contents | Previous | Next