Contributing to CG

👍🎉 First off, thanks for taking the time to contribute! 🎉👍

This is a guide for contributing to the CG package. Please check here first if you want to set up an environment and develop, open and issue, suggest an enhancement, open a pull request etc.

Code of Conduct

Communicating around code can be a sensitive thing so please do your best to keep a positive tone. Remember that people are putting significant amount of work behind a PR or a review, stay humble ⭐

Branch Model

CG is using github flow branching model as described in our development manual.

How Can I Contribute?

Reporting Bugs

This section guides you through submitting a bug report to CG. Following these guidelines helps other developers and contributors understand your report :pencil: reproduce the behavior :computer: :computer: and find related issues and reports :mag_right:

Before creating bug reports, please try to search the issues (opened and closed) if the problem has been described before, there might be no reason to create one. When creating a bug report, please include as many details as possible.

Note: If you find a Closed issue that seems like it is the same thing that you're experiencing, open a new issue and include a link to the original issue in the body of your new one.

How Do I Submit A (Good) Bug Report?

Bugs are tracked as GitHub issues.

Explain the problem and include additional details to help maintainers reproduce the problem:

Use a clear and descriptive title for the issue to identify the problem.
Describe the exact steps which reproduce the problem in as many details as possible. For example, start by explaining where CG was run and how it was used, i.e. which command exactly you used in the terminal.
Provide specific examples to demonstrate the steps. Include links to files or case IDs, or copy/pasteable snippets, which you use in those examples. If you're providing snippets in the issue, use Markdown code blocks.
Describe the behavior you observed after following the steps and point out what exactly is the problem with that behavior.
Explain which behavior you expected to see instead and why.

Provide more context by answering these questions:

Can you reproduce the problem?
Did the problem start happening recently (e.g. after updating to a new version of CG) or was this always a problem?
If the problem started happening recently, can you reproduce the problem in an older version of CG? What's the most recent version in which the problem doesn't happen? You can test and run older versions of CG in the stage environments by using the update-cg-stage.sh script.

Include details about your configuration and environment:

Which version of CG are you using? You can get the exact version by running cg --version in your terminal.
What's the name of the environment you're using?

Suggesting Enhancements

This section guides you through submitting an enhancement suggestion for CG, including completely new features and minor improvements to existing functionality. Following these guidelines helps maintainers and the community understand your suggestion 📝 and find related suggestions 🔎

How Do I Submit A (Good) Enhancement Suggestion?

Enhancement suggestions are tracked as GitHub issues. To suggest an enhancement create an issue on that repository and provide the following information:

Use a clear and descriptive title for the issue to identify the suggestion.
Provide a step-by-step description of the suggested enhancement in as many details as possible.
Provide specific examples to demonstrate the steps. Include copy/pasteable snippets which you use in those examples, as Markdown code blocks.
Describe the current behavior and explain which behavior you expected to see instead and why.
Explain why this enhancement would be useful

Local Development

NEVER USE PREINSTALLED PYTHON

First of all, make sure that you are managing your python versions that are used on your machine, never use the OS native python. Suggested ways to handle python version are either through homebrew(OSX), pyenv or conda.

For local development, it is recommended to use Poetry. Ensure that you have Poetry installed and run

poetry install

On our servers where the production and stage versions of CG are run the packages are maintained by using conda environments. For local development it is suggested to follow the python packaging guidelines where it is suggested to manage your local python environment with poetry.

Pull Requests

The process described here has several goals:

Maintain CG's quality
Engage the developers in working toward the best possible CG
Enable a sustainable system for CG's maintainers to review contributions

Please follow these steps to have your contribution considered by the maintainers:

Follow all instructions in the template
Follow the styleguides
After you submit your pull request, verify that all status checks are passing
What if the status checks are failing?
If a status check is failing, and you believe that the failure is unrelated to your change, please leave a comment on the pull request explaining why you believe the failure is unrelated. A maintainer will re-run the status check for you.
Update CHANGELOG.md with relevant information

While the prerequisites above must be satisfied prior to having your pull request reviewed, the reviewer(s) may ask you to complete additional design work, tests, or other changes before your pull request can be ultimately accepted.

Styleguides

Git Commit Messages

Use the present tense ("Add feature" not "Added feature")
Limit the first line to 72 characters or less
Reference issues and pull requests liberally after the first line

Python styleguide

We use black to format all files, this is done automatically with each push on GitHub so don't forget to update your local branch with git pull after pushing to the origin. More details are described in the general development manual.

Design decisions

This package is a little special. Essentially it should include all the "Clinical"-specific code that has to be integrated across multiple tools such as LIMS, Trailblazer, Scout etc. However, we still aim to structure it in such a way as to make maintainance as smooth as possible!

Apps

This part of the package contains connectors to the various tools that we integrate with. An app interface can be a wrapper for an external tool like Trailblazer (tb) or be implemented completely in cg like lims. It's very important that the code stays confined to each individual tool. The Housekeeper connector cannot directly talk to Trailblazer for example - such communication has to go through a meta module.

We also try to group all app-related imports and functionality in these interfaces. You shouldn't import e.g. a function from Scout from any other place than its app interface. This way, it's easier to overview if an update to an external package will affect the rest of the system.

Coverage

Interface to Chanjo. It is used to load coverage information from Sambamba output.

Invoice

Internal app for working with invoices of groups of samples or pools.

Lims

Internal app for interfacing with the Clarity LIMS API. We use the genologics Python API as much as possible. Some actions are not supported, however, and then we fall back to using the official XML-based REST API directly.

We convert all the info that we get from LIMS/genologics to dictionaries before passing it along to other tools. We don't pass around objects that have some implicit connection to update things in LIMS - such actions needs to go through the lims app interface explicitly.

Trailblazer (tb)

Interface to Trailblazer.

Monitor analysis workflow status

Genotype (gt)

Interface to Genotype. For uploading results from the workflow about genotypes to compare and validate that we are clear of sample mix-ups.

Housekeeper (hk)

Interface to Housekeeper. For storing files from analysis runs and FASTQ files from demultiplexing.

Loqus

Interface to LoqusDB. For loading observation counts from the analysis output.

Osticket

Internal app for opening tickets in SupportSystems. We use this mainly to link a ticket with the opening of an order for new samples/analyses.

Scout (scoutapi)

Interface to Scout. For uploading analysis results to Scout. It's also used to access the generation of gene panels files used in the analysis workflow.

Delivery report

Module to generate Delivery Reports. This module is designed to convey the results of genetic analysis to the customer. It includes information on sample characteristics, laboratory preparation, sequencing attributes, as well as data analysis performance and limitations.

Cli

The command line code is written in the Click framework.

Add

This set of commands let's you quite easily add things to the status database. For example when a new customer is signed you could run:

cg add customer cust101 "Massachusetts Institute of Technology"

You can also accomplish simliar tasks through the admin interface of the REST server.

Transfer

Lims

Includes: status, lims

Some info if primarily stored in LIMS and needs to be syncronized over to status. This is the case for both the date when a samples was received and when it was finally delivered. This interface is intended to run continuously as part of a crontab job.

cg transfer lims --status received

And similarly for filling in the delivery date:

cg transfer lims --status delivered

Flowcell

Includes: stats, hk, status

The API accepts the name of a flow cell which will be looked up in stats. For all samples on the flow cell it will:

Check if the quality (Q30) is good enough to include the sequencing results
update the number of reads that the sample has been sequenced overall and match this with the requirement given by the application.
accordingly, the interface will look up FASTQ files and store them using hk.
if a sample has sufficient number of reads, the sequenced_at date will be filled in (status) according to the sequencing date of the most recent flowcell.

Server

The REST API server handles a number of actions. It's written in Flask and exposes an admin interface for quickly editing information in the backend MySQL database. The admin interface is served under a hidden route but the plan is to move it to Google OAuth.

The API is protected by JSON Web Tokens generated by Google OAuth. It authorizes access using the user table in the database.

Order endpoint

The /order/<type> endpoint accepts orders for new samples. If you supply a JSON document on the expected format, a new order is opened in status and LIMS.

Store

This really is the status app more or less. It's the interface to the central database that keeps track of samples and it which state they are currently in. All records that enters the database go through this API. Simple updates to properies on records are handled directly on the model instances followed by a manual commit.

Misc.

There's one file for storing all constants like how priority levels are translated between the database representation and the human readable equivalent.

Another module /exc.py contains the custom Exception classes that are used across the package.

trailblazer
trailblazer-ui
housekeeper
genotype
chanjo
scout
mip
scilifelab
flask
click
cgweb
servers

Files

CONTRIBUTING.md

Latest commit

History

CONTRIBUTING.md

File metadata and controls

Contributing to CG

Table Of Contents

Code of Conduct

Branch Model

How Can I Contribute?

Reporting Bugs

How Do I Submit A (Good) Bug Report?

Suggesting Enhancements

How Do I Submit A (Good) Enhancement Suggestion?

Local Development

Pull Requests

Styleguides

Git Commit Messages

Python styleguide

Design decisions

Apps

Coverage

Invoice

Lims

Trailblazer (tb)

Genotype (gt)

Housekeeper (hk)

Loqus

Osticket

Scout (scoutapi)

Delivery report

Cli

Add

Transfer

Lims

Flowcell

Meta

Orders

Upload

Coverage

Gt

Observations

Scoutapi

Invoice

Server

Order endpoint

Store

Misc.