Adaptive ImmunoSeq Repository

(a.k.a. AIRR-seq Adaptive Adapter)

Team Members (by last name)

Brian Corrie, Technical Director, iReceptor, Simon Fraser University
Michael R. Crusoe, VU Amsterdam, DTL Projects (ELIXIR-NL)
Laura Gutierrez Funderburk, Simon Fraser University, Department of Mathematics
Nicole Knoetze, University of British Columbia, Bioinformatics Department
Artem Kushner, University of British Columbia, Mathematics Department
Akiff Manji, University of British Columbia

Project video below.

About the Project

Microsoft and Adaptive Biotechnologies are exploring how the immune system responds to COVID-19. As part of this project, they are gathering data from the Adaptive Immune Receptor Repertoire (AIRR-seq data) from COVID-19 patients. The AIRR-seq data from this study will be stored in Adaptive's publicly available ImmunoSeq repository. In order to make this data more broadly accessible, there is a need to convert this data from the Adaptive metadata format to standard AIRR-seq data formats as created by the AIRR Community.

The goal of this project is to develop a tool (or set of tools) to query the Adaptive ImmunoSeq repository (using their Web APIs), download data, and convert it to the standard AIRR-seq data formats for Repertoires. Once we have a converter for the Adaptive metadata, it will be possible to load the COVID-19 data produced by the Microsoft/Adaptive project and load that data into an AIRR Compliant repository such as the iReceptor Turnkey repository. The iReceptor project will be operating a COVID-19 AIRR-seq repository, and will be working with Adaptive and Microsoft to curate this and other publicly available AIRR-seq data once it is made available. This effort is in collaboration with, and in response to, the AIRR Community’s call for sharing COVID-19 AIRR-seq data.

Once curated, this data will be a part of the AIRR Data Commons, making it accessible to the international research community in the fight against COVID-19. Not only that, but because the data is part of the AIRR Data Commons, it will be possible for researchers to compare this data with other AIRR-seq data sets from both healthy subjects and data from subjects with other diseases through searching and federating that data using the iReceptor Scientific Gateway. Sharing these data will be critical for developing diagnostics and therapeutics against cancer, infectious and autoimmune diseases.

The Challenge

Adaptive Biotech’s approach to supporting their customers in tracking metadata about projects and studies that are in the ImmunoSeq repository is to be as flexible as possible. Adaptive uses both controlled metadata for some important fields and an extensible key:value tagging mechanism for other metadata fields. This makes it flexible for the researcher to annotate their study with metadata that meets their needs. Unfortunately, this approach makes it challenging for that data to be FAIR (Findable, Accessible, Interoperable, and Reusable). In particular, a flexible and extensible tagging mechanism makes it difficult to make this data Interoperable and Reusable (the IR in FAIR) as there is no guarantee that any two studies will use the same field names, field definitions, or field types for any given field (consider the many ways that the age of a subject in a study can be represented). This is the bane of any data science project!

To solve this problem, the AIRR Community has established a set of standards for the curation and sharing of AIRR-seq data.

This includes the MiAIRR standard, which is a recommended minimal standard for the curation of study, sample, sample processing, cell processing, and nucleic acid processing metadata for studies that involve AIRR-seq data.

The challenge, if you choose to accept it, is to convert Adaptive’s sample metadata file format into files that adhere to the AIRR Repertoire file format specification. Note that this is a non-trivial challenge, as there is no well defined transformation for performing this conversion.

The Process

More info about iReceptor

For more information about iReceptor, checkout the iReceptor and iReceptor Plus websites.

Name		Name	Last commit message	Last commit date
Latest commit History 91 Commits
Mapping_Files		Mapping_Files
adaptive		adaptive
cwl		cwl
scripts		scripts
5eaca36e-c0b9-4be5-aeaa-bc1d230af791.json		5eaca36e-c0b9-4be5-aeaa-bc1d230af791.json
Dockerfile		Dockerfile
LICENSE		LICENSE
LogosTo.png		LogosTo.png
README.cwl.md		README.cwl.md
README.md		README.md
Resources.md		Resources.md
Tasks.md		Tasks.md
airr-ir-irplus.png		airr-ir-irplus.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adaptive ImmunoSeq Repository

About the Project

The Challenge

The Process

More info about iReceptor

About

Releases

Packages

Contributors 5

Languages

License

sfu-ireceptor/AIRR-seqAA

Folders and files

Latest commit

History

Repository files navigation

Adaptive ImmunoSeq Repository

About the Project

The Challenge

The Process

More info about iReceptor

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages