Skip to content

A pipeline for identifying aging-related biomarkers in mouse large intestine

Notifications You must be signed in to change notification settings

pocession/Tabula_muris

Repository files navigation

Tabula_muris_LargeIntestine: a project for identifying aging-related biomarkers in large intestine

Large intestine is the largest habitat for microbiota as well as the largest immune tissues. To help people understand how aging affect the large intestine physiology, I collect and visualize the Tabula muris data.

How I get and process the data

  1. The data was first downloaded from Tabula muris senis. A data set generated by 10X (droplet) and SMARTseq (facs) are hosted in the following path /Data-objects/tabula-muris-senis-droplet-processed-official-annotations.h5ad and /Data-objects/tabula-muris-senis-facs-processed-official-annotations.h5ad, respectively.
  2. The droplet dataset contains only 3' mRNA seqeunces but the facs data set contains full-length mRNA sequences.
  3. For the large intestine, only facs data set were used since droplet data set only contains one age group.
  4. The downloaded h5ad files were subset, filtered, and normalized by my Python script.
  5. To produce tables for Tableau, the data were further process by my R script.
  6. The main toolkit for pre-process and data subset is scanpy and pandas.
  7. The IDE is Visual Studio code.
  8. Visualization tools are Tableau Public and Biorender.
  9. The data visualization could be found on My Tableau page.

The files are organized as the following structure

github.com/pocession/Tabula_muris/
├── subset: csv files for subsetted data
│   ├── *.csv
├── figures : pdf or png files for output data
│   ├── *.pdf
|   ├── *.png
├── output : csv files for output data, used by Tableau
│   ├── *.csv
├── scRNA_differentialAnalysis.ipynb: the python script for processing the input data
├── SscRNA_subsetting.ipynb: the python script for subsetting the original data
├── readme.md
├── *.h5ad: original h5ad files

The processed data is provided in 'csv' formats and stored in the follwoing folder.

Useful links

Tableau pverview

About

A pipeline for identifying aging-related biomarkers in mouse large intestine

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published