Skip to content

2017 Grant: Manipulation of the NLSY information to infer kinship relationships

License

Notifications You must be signed in to change notification settings

nlsy-links/nlsy-links-determination-2017

Repository files navigation

nlsy-links-determination-2017

Manipulation of the NLSY information to infer kinship relationships

This repository contains the products from our current grant (that started in 2017). Products from our 2011-2015 grant are contained in https://github.com/LiveOak/nlsy-links-determination-2011. See also the NlsyLinks R package.

Personnel

Database Structure

Tables related to an Item:

item

Tables related to an Marker:

marker

Tables related to an Relationship:

relationship

Tables related to an Subject:

subject

Additional Information:

  1. Some metadata sets are described in CSVs (i.e., 79 and 97). The metadata tables are uploaded to the database and logged.
  2. Other metadata sets that are hardcoded into the C#:
  3. Row & columns counts for each table are available and tracked over time.
  4. For the masochistically curious, the SQL scripts for the full 79 and 97 databases are available.

Grant Aims

This application requests funds to continue our past NLSY kinship linking efforts, and to use recently-developed kinship links in new and innovative empirical research. With previous support from a recent NIH grant, we updated kinship links for two National Longitudinal Survey of Youth datasets (NLSY79 and NLSY-Children/Young Adults), completed in 2013. The proposed research will identify and circulate links for a third NLSY dataset, the NLSY97, and will provide data for four proposed studies that use the kinship link structure from all three datasets.

Our specific aims emerge from the structure built into the three NLSY datasets. The NLSY79 data, first collected in 1979 through a household probability sample of 14-21 year old U.S. adolescents(aged 49-57 in the 2014 release), represents a longitudinal database collected annually (early) and biennially (more recently) for 35 years. The NLSYC contains longitudinal data from all biological children born to the NLSY79 females (for whom childbearing is now complete), collected biennially since 1986, with an age range in 2014 from infancy to the early 40’s. The NLSY97 is an 18-year approximate replication of the NLSY79, with annual surveys from 1997 to 2012, then biennially from 2012 and continuing. Each data source has thousands of outcome variables related to work, family, health, cognition, and education. These NLSY data structures allow researchers to study: 1) Longitudinal patterns within a dataset; 2) Cross-generational patterns across NLSY79 and NLSYC; 3) Cohort/period changes across NLSY79 and NLSY97; 4) Within-family patterns; 5) Cross-dataset outcomes using innovative research designs. Thousands of research teams from many disciplines have conducted research using the NLSY data. The data become even more valuable, and support more focused (behavior genetic) research, with their biometrical properties identified.

Through three previous funded projects, our research team has created and circulated high-quality kinship links (including validity analyses, and online software vignettes) for the NLSY79 and NLSYC, for thousands of respondents who share a household (the master combined dataset includes 42,774 kinship pairs). Around 60 research papers and dissertations/theses have been produced using NLSY79 and NLSYC links. However, kinship links using sophisticated and modern linking methods have never been developed for the NLSY97 (which has 1,862 households with multiple respondents, and 4,027 respondents in those households). Our first specific aim is to further expand family/kinship information in three NLSY datasets to support researchers (including our own research team) to take advantage of many valuable within-family features. We will create twin, sibling, and half-sibling links for the NLSY97, and archive those with existing NLSY79 and NLSY-Children links. We will validate the new links, and create matches from NLSY97 into the NLSY79 and NLSYC datasets (the matching links also will be publicly available to other research teams). Further, we will develop additional vignettes for illustration and software support, offered through a broad set of platforms. We will create minor updates for the previous NLSY79 and NLSYC links. And we will continue to run a User Service Office to support research use of the links.

Our second aim involves using the new links from the three datasets in two lines of biometrical NLSY research. The first is a set of two fertility studies that include: 1) A new “proxy mother” design, including “virtual siblings” (NLSY97 respondents matched to NLSYC/YA respondents using propensity score analysis) which leverages features of the connections across the three datasets, applied to fertility outcomes; 2) A design developed to biometrically decompose the relationship of childbearing to health outcomes, using fertility histories and the health modules collected at age 40 and 50. The second study uses cognitive outcomes in two studies: 1) Using six different cognitive measures and all 45,000+ kinship links across the three datasets, we will evaluate and elaborate the changing heritability patterns across age during childhood and adolescence; 2) Using the NLS79 and NLSYC, we will evaluate biometrically the link between SES and cognitive ability, with health outcomes used to moderate the relationship to health, using as health measures the health batteries noted above.

The NLSY kinship links have potential to provide research support for hundreds of researchers within an interdisciplinary research community studying behavior genetics, demography, developmental psychology, aging, economics, criminology, medical outcomes, and family studies. The general aims of our proposed research are to support this broad research community, and to contributes original research from our own research team.

Publication

Please see [research-publications](http://liveoak.github.io/NlsyLinks/research-publications.html for a list of the 70+ publications arising from the kinship links. An overview is available at:

Joseph Lee Rodgers,William H. Beasley, David E. Bard, Kelly M. Meredith, Michael D. Hunter, Amber B. Johnson, Maury Buster, Chengchang Li, Kim O. May, S. Mason Garrison, Warren B. Miller, Edwin van den Oord, and David C. Rowe (2016). Behavior Genetics, 46. https://doi.org/10.1007/s10519-016-9785-3.

The National Longitudinal Survey of Youth datasets (NLSY79; NLSY-Children/Young Adults; NLSY97) have extensive family pedigree information contained within them. These data sources are based on probability sampling, a longitudinal design, and a cross-generational and within-family data structure, with hundreds of phenotypes relevant to behavior genetic (BG) researchers, as well as to other developmental and family researchers. These datasets provide a unique and powerful source of information for BG researchers. But much of the information required for biometrical modeling has been hidden, and has required substantial programming effort to uncover—until recently. Our research team has spent over 20 years developing kinship links to genetically inform biometrical modeling. In the most recent release of kinship links from two of the NLSY datasets, the direct kinship indicators included in the 2006 surveys allowed successful and unambiguous linking of over 94 % of the potential pairs. In this paper, we provide details for research teams interested in using the NLSY data portfolio to conduct BG (and other family-oriented) research.

About

2017 Grant: Manipulation of the NLSY information to infer kinship relationships

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published