
SOURCE OF DATA
The data for this microdata file come from the November 1994 Current
Population Survey (CPS). This month's survey uses two sets of
questions, the basic CPS and the supplement. The Bureau of the
Census conducts the basic CPS every month and asks supplementary
questions during certain months.
Basic CPS. The basic CPS collects primarily labor
force data about the civilian noninstitutional population. Interviewers
ask questions concerning labor force participation about each
member 15 years old and over in every sample household.
November 1994 supplement. In addition to the basic
CPS questions, interviewers asked supplementary questions on computer
literacy and voting and registration.
Sample Design. The present CPS sample was selected
from the 1980 Decennial Census files with coverage in all 50 states
and the District of Columbia. The sample is continually updated
to account for new residential construction. The United States
was divided into 1,973 geographic areas. In most states, a geographic
area consisted of a county or several contiguous counties. In
some areas of New England and Hawaii, minor civil divisions are
used instead of counties. A total of 729 geographic areas were
selected for sample. About 60,000 occupied households are eligible
for interview every month. Interviewers are unable to obtain interviews
at about 2,600 of these units. This occurs when the occupants
are not found at home after repeated calls or are unavailable
for some other reason.
Since the introduction of the CPS, the Bureau of the Census has
redesigned the CPS sample several times. These redesigns have
improved the quality and accuracy of the data and have satisfied
changing data needs. A redesigned CPS sample based on the 1990
census is currently being phased-in. The phase-in procedure started
in April 1994 and will be completed in July 1995. In July 1995,
there will be 818 geographic areas in sample.
Estimation procedure. This survey's estimation procedure
inflates weighted sample results to independent estimates of the
civilian noninstitutional population of the United States by age,
sex, race, Hispanic/non-Hispanic origin, and state of residence.
This adjustment is called the post-stratification ratio estimate.
The independent estimates are calculated based on information
from four primary sources:
ACCURACY OF THE ESTIMATES
Since the CPS estimates come from a sample, they may differ from
figures from a complete census using the same questionnaires,
instructions, and enumerators. A sample survey estimate has two
possible types of error: sampling and nonsampling. The accuracy
of an estimate depends on both types of error, but the full extent
of the nonsampling error is unknown. Consequently, one should
be particularly careful when interpreting results based on a relatively
small number of cases or on small differences between estimates.
The standard errors for CPS estimates primarily indicate the magnitude
of sampling error. They also partially measure the effect of some
nonsampling errors in responses and enumeration, but do not measure
systematic biases in the data. (Bias is the average over all possible
samples of the differences between the sample estimates and the
desired value.)
Nonsampling variability. There are several sources
of nonsampling errors including the following:
CPS undercoverage results from missed housing units and missed
persons within sample households. Compared to the level of the
1990 Decennial Census, overall CPS undercoverage is about 8 percent.
CPS undercoverage varies with age, sex, and race. Generally, undercoverage
is larger for males than for females and larger for Blacks and
other races combined than for Whites. The post-stratification
ratio estimate described previously partially corrects for bias
due to undercoverage. However, biases exist in the estimates to
the extent that missed persons in missed households or missed
persons in interviewed households have different characteristics
from those of interviewed persons in the same agesexraceorigin-state
group.
A common measure of survey coverage is the coverage ratio, the
estimated population before the post-stratification ratio estimate
divided by the independent population control. Table A shows CPS
coverage ratios for age-sex-race groups for a typical month. The
CPS coverage ratios can exhibit some variability from month to
month. Other Census Bureau household surveys experience similar
coverage.
| Age | |||||||
| 014 | 0.929 | ||||||
| 15 | 0.933 | ||||||
| 16-19 | 0.881 | ||||||
| 2029 | 0.847 | ||||||
| 3039 | 0.904 | ||||||
| 4049 | 0.928 | ||||||
| 5059 | 0.953 | ||||||
| 6064 | 0.961 | ||||||
| 6569 | 0.919 | ||||||
| 70+ | 0.993 | ||||||
| 15+ | 0.914 | ||||||
| 0+ | 0.918 | ||||||
For additional information on nonsampling error including the
possible impact on CPS data when known, refer to Statistical Policy
Working Paper 3, An Error Profile: Employment as Measured by
the Current Population Survey, Office of Federal Statistical
Policy and Standards, U.S. Department of Commerce, 1978 and Technical
Paper 40, The Current Population Survey: Design and Methodology,
Bureau of the Census, U.S. Department of Commerce.
Comparability of data. Data obtained from the CPS
and other sources are not entirely comparable. This results from
differences in interviewer training and experience and in differing
survey processes. This is an example of nonsampling variability
not reflected in the standard errors. Use caution when comparing
results from different sources.
A number of changes were made in data collection and estimation
procedures beginning with the January 1994 CPS. The major change
was the use of a new questionnaire. The questionnaire was redesigned
to measure the official labor force concepts more precisely, to
expand the amount of data available, to implement several definitional
changes, and to adapt to a computer-assisted interviewing environment.
The supplemental questions were also modified for adaptation to
computer-assisted interviewing, although there were no changes
in definitions and concepts. Due to these and other changes, one
should use caution when comparing estimates from data collected
in 1994 with estimates from earlier years.
Caution should also be used when comparing estimates obtained
from this microdata file (which reflects 1990 censusbased
population controls) with estimates for 1993 and earlier years
(which reflect 1980 censusbased population controls). This
change in population controls had relatively little impact on
summary measures such as means, medians, and percentage distributions.
It did have a significant impact on levels. For example, use of
1990 based population controls results in about a 1percent
increase in the civilian noninstitutional population and in the
number of families and households. Thus, estimates of levels for
data collected in 1994 and later years will differ from those
for earlier years by more than what could be attributed to actual
changes in the population. These differences could be disproportionately
greater for certain subpopulation groups than for the total population.
Since no independent population control totals for persons of
Hispanic origin were used before 1985, compare Hispanic estimates
over time cautiously.
For more information on the introduction of the new questionnaire,
the modernized data collection methods, and the introduction of
new population controls based on the 1990 census, see "Revisions
in the Current Population Survey Effective January 1994"
in the February 1994 issue of Employment and Earnings published
by the Bureau of Labor Statistics.
Note when using small estimates. Because of the
large standard errors involved, summary measures (such as medians
and percent distributions) probably do not reveal useful information
when computed on a base smaller than 75,000. Take care in the
interpretation of small differences. For instance, even a small
amount of nonsampling error can cause a borderline difference
to appear significant or not, thus distorting a seemingly valid
hypothesis test.
Sampling variability. Sampling variability is variation
that occurred by chance because a sample was surveyed rather than
the entire population. Standard errors, as calculated below, are
primarily measures of sampling variability, but they may include
some nonsampling error.
Standard errors and their use. A number of approximations
are required to derive, at a moderate cost, standard errors applicable
to estimates from this microdata file. Instead of providing an
individual standard error for each estimate, two parameters, a
and b, are provided to calculate standard errors for each type
of characteristic. These parameters are in Tables B through J.
The sample estimate and its standard error enable one to construct
a confidence interval. A confidence interval is a range that would
include the average result of all possible samples with a known
probability. For example, if all possible samples were surveyed
under essentially the same general conditions and using the same
sample design, and if an estimate and its standard error were
calculated from each sample, then approximately 90 percent of
the intervals from 1.645 standard errors below the estimate to
1.645 standard errors above the estimate would include the average
result of all possible samples.
A particular confidence interval may or may not contain the average
estimate derived from all possible samples. However, one can say
with specified confidence that the interval includes the average
estimate calculated from all possible samples.
Standard errors may also be used to perform hypothesis testing.
This is a procedure for distinguishing between population parameters
using sample estimates. One common type of hypothesis is that
two population parameters are different. An example of this would
be comparing the number of men who were part-time workers with
the number of women who were part-time workers.
Tests may be performed at various levels of significance. A significance
level is the probability of concluding that the characteristics
are different when, in fact, they are the same. To conclude that
two parameters are different at the 0.10 level of significance,
for example, the absolute value of the estimated difference between
characteristics must be greater than or equal to 1.645 times the
standard error of the difference.
The Census Bureau uses 90percent confidence intervals and
0.10 levels of significance to determine statistical validity.
Consult standard statistical textbooks for alternative criteria.
For information on calculating standard errors for labor force
data from the CPS which involve quarterly or yearly averages,
changes in consecutive quarterly or yearly averages, consecutive
month-to-month changes in estimates, and consecutive year-to-year
changes in monthly estimates; see "Explanatory Notes and
Estimates of Error: Household Data" in the corresponding
Employment and Earnings published by the Bureau of Labor
Statistics.
Standard errors of estimated numbers. The approximate
standard error, sx, of an estimated number from this
microdata file can be obtained using this formula:
Here x is the size of the estimate and a and b are the parameters
in Tables B through J associated with the particular type of characteristic.
When calculating standard errors from crosstabulations involving
different characteristics, use the set of parameters for the characteristic
which will give the largest standard error.
Illustration
Suppose there were 6,000,000 unemployed men in the civilian labor
force. Use the appropriate parameters from Table B and formula
(1) to get
| Number, x | 6,000,000 |
| a parameter | -0.000016 |
| b parameter | 2,465 |
| Standard error | 119,000 |
| 90% conf. int. | 5,804,000 to 6,196,000 |
The standard error is calculated as
The 90-percent confidence interval is calculated as 6,000,000
± 1.645>×119,000.
A conclusion that the average estimate derived from all possible
samples lies within a range computed in this way would be correct
for roughly 90 percent of all possible samples.
Standard errors of estimated percentages. The reliability
of an estimated percentage, computed using sample data from both
numerator and denominator, depends on both the size of the percentage
and its base. Estimated percentages are relatively more reliable
than the corresponding estimates of the numerators of the percentages,
particularly if the percentages are 50 percent or more. When the
numerator and denominator of the percentage are in different categories,
use the parameter from one of the parameter tables (Tables B through
J) indicated by the numerator.
The approximate standard error, sx ,p, of an estimated percentage can be obtained by use of the formula
Here x is the total number of persons, families, households, or
unrelated individuals in the base of the percentage, p is the
percentage (0 <= p <=
100), and b is the parameter from the parameter table associated
with the characteristic in the numerator of the percentage.
Illustration - Computer Literacy
Suppose that of approximately 99,846,000 households, 25.5 percent
had a computer in the household. Use the appropriate parameter
from Table C and formula (2) to get
| Percentage, p | |
| Base, x | |
| b parameter | |
| Standard error | |
| 90% conf. int. |
The standard error is calculated as
The 90-percent confidence interval of the percentage of households
with computers is calculated as 25.5 ± 1.645×0.2.
Illustration - Voting and Registration
Suppose that of 15,391,000 people with an elementary school education,
35.1 percent reported voting. Use the appropriate parameter from
Table D and formula (2) to get
| Percentage, p | |
| Base, x | |
| b parameter | |
| Standard error | |
| 90% conf. int. |
The standard error is calculated as
the 90-percent confidence interval of the percentage of people with an elementary school education who reported voting is calculated as 35.1 ± 1.645×0.7.
Standard error of a difference. The standard error of the difference between two sample estimates is approximately equal to
where sx and sy are the standard errors
of the estimates, x and y. The estimates can be numbers, percentages,
ratios, etc. This will represent the actual standard error quite
accurately for the difference between estimates of the same characteristic
in two different areas, or for the difference between separate
and uncorrelated characteristics in the same area. However, if
there is a high positive (negative) correlation between the two
characteristics, the formula will overestimate (underestimate)
the true standard error.
Illustration
Suppose that of 6,285,000 employed men between 20-24 years of
age, 1,516,000 or 24.1 percent were part-time workers, and of
the 5,824,000 employed women between 20-24 years of age, 2,169,000
or 37.2 percent were part-time workers. Use the appropriate parameters
from Table B and formulas (2) and (3) to get
| Percentage, p | 24.1 | 37.2 | 13.1 |
| Number, x | 6,285,000 | 5,824,000 | - |
| b parameter | 2,301 | 2,112 | - |
| Standard error | 0.8 | 0.9 | 1.2 |
| 90% conf. int. | 22.8 to 25.4 | 35.7 to 38.7 | 11.1 to 15.1 |
The standard error of the difference is calculated as
The 90-percent confidence interval around the difference is calculated
as 13.1 ± 1.645×1.2. Since
this interval does not include zero, we can conclude with 90 percent
confidence that the percentage of part-time women workers between
20-24 years of age is greater than the percentage of part-time
men workers between 20-24 years of age.
Characteristics November 1994 | ||
| Characteristic | ||
| Labor Force and Not In Labor Force Data Other than Agricultural Employment and Unemployment | ||
| Total 1 | ||
| - Men 1 | ||
| - Women | ||
| - Both sexes, 16 to 19 years | ||
| White 1 | ||
| - Men | ||
| - Women | ||
| - Both sexes, 16 to 19 years | ||
| Black | ||
| - Men | ||
| - Women | ||
| - Both sexes, 16 to 19 years | ||
| Hispanic origin | ||
| Not In Labor Force (use only for Total, Total Men, and White) | ||
| Agricultural Employment | ||
| Total or White | ||
| - Men | ||
| - Women or Both sexes, 16 to 19 years | ||
| Black | ||
| Hispanic origin | ||
| - Total or Women | ||
| - Men or Both sexes, 16 to 19 years | ||
| Unemployment | ||
| Total or White | ||
| Black | ||
| Hispanic origin | ||
1 For not in labor force characteristics, use the Not
In Labor Force parameters.
November 1994 | |||||||||
| Characteristic | |||||||||
| PERSONS | |||||||||
| Educational Attainment | -0.000013 | 2,545 | -0.000160 | 3,448 | -0.000377 | 6.399 | -0.000478 | 3,758 | |
| Persons by Family Income | -0.000024 | 4,531 | -0.000240 | 5,188 | -0.000568 | 9,630 | -0.000719 | 5,655 | |
| Income | -0.000012 | 2,266 | -0.000120 | 2,594 | -0.000168 | 2,857 | -0.000360 | 2,827 | |
| Marital Status, Household & Family Characteristics |
-0.000025 | 4,811 | -0.000320 | 6,910 | -0.000756 | 12,826 | -0.000958 | 7,533 | |
| Poverty | -0.000051 | 9,551 | -0.000443 | 9,564 | -0.000624 | 10,534 | -0.001326 | 10,425 | |
| FAMILIES, HOUSEHOLDS, OR UNRELATED INDIVIDUALS | |||||||||
| Income | -0.000011 | 2,069 | -0.000105 | 2,258 | -0.000147 | 2,487 | -0.000313 | 2,461 | |
| Marital Status, Household & Family, Educational Attainment, Population by Age or Sex |
-0.000010 | 1,909 | -0.000080 | 1,727 | -0.000189 | 3,206 | -0.000239 | 1,883 | |
| Poverty | 0.000012 | 2,255 | 0.000105 | 2,258 | -0.000147 | 2,487 | -0.000313 | 2,461 | |
and Registration in November 1994: Total or White Persons | ||
| Voting, registration, reasons for not voting or registering (includes breakdowns by: Citizenship, Household relationship, Family heads by presence of children, Marital status, Duration of residence, Tenure, Education level, Family income of persons, Occupation group) | ||
| Characteristics of all persons, | ||
| Voting and nonvoting: | ||
| Marital status | ||
| Education of persons | ||
| Education of family head | ||
| Persons by family income | ||
| Duration of residence tenure | ||
| Household relationships, | ||
| Voting and nonvoting: | ||
| Head, spouse of head | ||
| Nonrelative or other relative of head | ||
and Registration in November 1994: Black Persons | ||
| Voting, registration, reasons for not voting or registering (includes breakdowns by: Citizenship, Household relationship, Family heads by presence of children, Marital status, Duration of residence, Tenure, Education level, Family income of persons, Occupation group) | ||
| Characteristics of all persons, | ||
| Voting and nonvoting: | ||
| Marital status | ||
| Education of persons | ||
| Education of family head | ||
| Persons by family income | ||
| Duration of residence tenure | ||
| Household relationships, | ||
| Voting and nonvoting: | ||
| Head, spouse of head | ||
| Nonrelative or other relative of head | ||
and Registration in November 1994: Hispanic Persons | ||
| Voting, registration, reasons for not voting or registering (includes breakdowns by: Citizenship, Household relationship, Family heads by presence of children, Marital status, Duration of residence, Tenure, Education level, Family income of persons, Occupation group) | ||
| Characteristics of all persons, | ||
| Voting and nonvoting: | ||
| Marital status | ||
| Education of persons | ||
| Education of family head | ||
| Persons by family income | ||
| Duration of residence tenure | ||
| Household relationships, | ||
| Voting and nonvoting: | ||
| Head, spouse of head | ||
| Nonrelative or other relative of head | ||
and Registration in November 1994: Asian and Pacific Islanders | ||
| Voting, registration, reasons for not voting or registering (includes breakdowns by: Citizenship, Household relationship, Family heads by presence of children, Marital status, Duration of residence, Tenure, Education level, Family income of persons, Occupation group) | ||
| Characteristics of all persons, | ||
| Voting and nonvoting: | ||
| Marital status | ||
| Education of persons | ||
| Education of family head | ||
| Persons by family income | ||
| Duration of residence tenure | ||
| Household relationships, | ||
| Voting and nonvoting: | ||
| Head, spouse of head | ||
| Nonrelative or other relative of head | ||
Table H. State Voting and Registration Parameters
| State | a | b |
| Alabama | 0.001083 | 3,481 |
| Alaska | 0.000963 | 394 |
| Arizona | 0.001069 | 3,209 |
| Arkansas | 0.001077 | 1,998 |
| California | 0.000162 | 3,784 |
| Colorado | 0.001176 | 3,209 |
| Connecticut | 0.001431 | 3,632 |
| Delaware | 0.001296 | 696 |
| Dist. of Col. | 0.001530 | 696 |
| Florida | 0.000253 | 2,724 |
| Georgia | 0.001028 | 5,388 |
| Hawaii | 0.001230 | 1,059 |
| Idaho | 0.000968 | 787 |
| Illinois | 0.000318 | 2,815 |
| Indiana | 0.001184 | 5,146 |
| Iowa | 0.001084 | 2,301 |
| Kansas | 0.001071 | 1,998 |
| Kentucky | 0.001107 | 3,209 |
| Louisiana | 0.001240 | 3,905 |
| Maine | 0.001176 | 1,120 |
| Maryland | 0.001206 | 4,601 |
| Massachusetts | 0.000318 | 1,483 |
| Michigan | 0.000309 | 2,210 |
| Minnesota | 0.001177 | 3,965 |
| Mississippi | 0.001021 | 1,998 |
| Missouri | 0.001205 | 4,752 |
| Montana | 0.001049 | 666 |
| Nebraska | 0.001044 | 1,241 |
| Nevada | 0.001004 | 1,090 |
| New Hampshire | 0.001438 | 1,241 |
| New Jersey | 0.000301 | 1,846 |
| New Mexico | 0.001034 | 1,241 |
| New York | 0.000172 | 2,422 |
| North Carolina | 0.000278 | 1,483 |
| North Dakota | 0.001043 | 484 |
| Ohio | 0.000298 | 2,512 |
| Oklahoma | 0.001104 | 2,664 |
| Oregon | 0.001222 | 2,876 |
| Pennsylvania | 0.000293 | 2,724 |
| Rhode Island | 0.001393 | 1,059 |
| South Carolina | 0.000890 | 2,452 |
| South Dakota | 0.000940 | 484 |
| Tennessee | 0.000980 | 3,875 |
| Texas | 0.000284 | 3,814 |
| Utah | 0.001082 | 1,392 |
| Vermont | 0.001369 | 605 |
| Virginia | 0.000858 | 4,208 |
| Washington | 0.001041 | 4,147 |
| West Virginia | 0.001094 | 1,574 |
| Wisconsin | 0.000982 | 3,723 |
| Wyoming | 0.001575 | 545 |
Table I. Census Division Voting and Registration Parameters
| Division | a | b |
| New England | 0.000187 | 1,907 |
| Middle Atlantic | 0.000081 | 2,391 |
| East North Central | 0.000093 | 3,027 |
| West North Central | 0.000234 | 3,148 |
| South Atlantic | 0.000091 | 3,209 |
| East South Central | 0.000274 | 3,299 |
| West South Central | 0.000168 | 3,511 |
| Mountain | 0.000196 | 2,179 |
| Pacific | 0.000117 | 3,632 |
Table J. Census Region Voting and Registration Parameters
| Region | a | b |
| Northeast | 0.000056 | 2,240 |
| Midwest | 0.000064 | 2,966 |
| South | 0.000046 | 3,148 |
| West | 0.000076 | 3,209 |
| All Except South | 0.000022 | 2,808 |
CPS Voting and Registration Supp - 1994 Methodology and Documentation Page
CPS Main Page