Screening autism-associated environmental factors in differentiating human neural progenitors with fractional factorial design-based transcriptomics
Abishek Arora (1,2), Martin Becker (1,2), Cátia Marques (3), Marika Oksanen (1,2), Danyang Li (1,2), Francesca Mastropasqua (1,2), Michelle Evelyn Watts (1,2), Manish Arora (4), Anna Falk (5,6), Carsten Oliver Daub (7,8), Ingela Lanekoff (3) and Kristiina Tammimies (1,2,*)
- Center of Neurodevelopmental Disorders (KIND), Centre for Psychiatry Research, Department of Women's and Children's Health, Karolinska Institutet, and Child and Adolescent Psychiatry, Stockholm Health Care Services, Stockholm County Council, Stockholm, Sweden.
- Astrid Lindgren Children’s Hospital, Karolinska University Hospital, Region Stockholm, Stockholm, Sweden
- Department of Chemistry - BMC, Uppsala University, Uppsala, Sweden.
- Department of Environmental Medicine and Public Health, Icahn School of Medicine at Mount Sinai, New York, USA.
- Department of Neuroscience, Karolinska Institutet, Stockholm, Sweden.
- Lund Stem Cell Center, Division of Neurobiology, Department of Experimental Medical Science, Lund University, Lund, Sweden.
- Department of Biosciences and Nutrition, Karolinska Institutet, Stockholm, Sweden.
- Science for Life Laboratory, Stockholm, Sweden.
*Correspondence: [email protected]
Published in Scientific Reports (2023), DOI: 10.1038/s41598-023-37488-0 | PubMed: 37386098 | GEO: GSE229546
Thank you for showing interest in our manuscript. In this GitHub repository you will find the source code and data sets required to replicate our figures and findings using R. With respect to the four cell lines included in our study, they have been coded as follows in the included R Markdown files:
- CTRLMale – CTRL9II (Neurotypical Male)
- CTRLFemale – AF22 (Neurotypical Female)
- ASDHNRNPU – ASD12BI (Male with ASD diagnosis and HNRNPU deletion)
- ASDCASK – ASD17AII (Male with ASD diagnosis and CASK splice site variant)
The directory contains the R Markdown file and data sets required for differential gene expression analysis and gene-set enrichment analysis (GSEA) for independent effects of environmental factors, based on the fractional factorial experimental design. The analysis pipeline in this directory corresponds to the following stratified analysis approach as described in our manuscript: Global Effects (Level I), Clinical Background Effects (Level II) and Cell Lines Effects (Level III).
The directory contains the R Markdown file and data sets required to perform differential gene expression analysis and GSEA for two-way interactions effects of environmental factors, based on the fractional factorial experimental design. The analysis pipeline in this directory corresponds to the following stratified analysis approach as described in our manuscript: Interaction Effects (Level IV).
The directory contains the R Markdown file and data sets to test for enrichment of significant differentially expressed genes in other gene lists. These were identified following analysis of global effects (Level I) for exposure to lead (Pb) and fluoxetine hydrochloride (FH). Enrichment was checked in publicly available gene lists that have been described in the manuscript, namely SFARI, SFARI high confidence, Epilepsy Panel App, Intellectual Disability Panel App and a developmental gene list (DOI: 10.1002/aur.2666). The directory also includes the analysis of gene list enrichment for genes with differential exon usage events following bisphenol A (BPA) exposure in the CTRLFemale(AF22) cell line.
The directory contains the R script and data sets to check the enrichment of significant differentially expressed genes at certain timepoints in different regions of the developing human brain. For this, we used the publicly available BrainSpan Atlas that contains spatio-temporally stratified gene expression data pertaining to typical neurodevelopment.
The directory contains the R Markdown files and data sets to detect differential exon usage events as independent effects of environmental factors, based on the fractional factorial experimental design. The analysis pipeline in this directory corresponds to the following stratified analysis approach as described in our manuscript: Global Effects (Level I), Clinical Background Effects (Level II) and Cell Lines Effects (Level III).
The directory contains the R Markdown file and data sets needed to analyse the significance of changes in concentrations of metabolites detected in cell lines treated with FH, compared to non-treated controls. The analysis pipeline in this directory corresponds to the following stratified analysis approach as described in our manuscript: Global Effects (Level I), Clinical Background Effects (Level II) and Cell Lines Effects (Level III). Source code to produce the concentration plots for different levels of analysis are also included.