Skip to content

v1.2.0

Latest
Compare
Choose a tag to compare
@David-YuWei David-YuWei released this 04 May 16:45
· 242 commits to main since this release
801cdaf

Summary

The “Core Project Remodel” of the Index of NCI Studies (INS) is available now at studycatalog.cancer.gov. The
primary focus of this release is to reorganize the data and display of the INS to center around (core) projects
rather than grants, which addresses one of the major improvements identified in the initial release.
As before, we are only including outputs from projects funded at least in part by extramural grants associated
with the Moonshot and Childhood Cancer Data Initiative (CCDI). The task of identifying and defining
programs remains a challenge to be addressed in future releases of the INS.

KEY FEATURES

  1. Redesigned data architecture to organize grants (formerly known as projects within the INS) into groups of grants. Those groups are now called projects and become the focal point of the INS
    Redesigned the Project Details Page to focus on one project with all associated grants and
    outputs displayed. Project descriptions are pulled from the most recently awarded grant,
    except for the Award Amount Total fields, which display summed values from all associated
    grants.
    Added Project Details Page links to all tabs of the Explore Page to quickly find all information for
    a project from any grant or output of that project
    Updated Explore Page filters, Programs Page, Program Details Page, and Global Search to
    reflect the new roles of projects and grants
    Updated documentation pages (About, Glossary, Release Notes)

  2. Added animation and updated the Stats Bar Graph image on the Home Page

  3. Improved data gathering date filters to reduce false positive associations

  4. Added 3 new manually curated CCDI P30 supplement datasets and projects

Data Details

This release includes the following data totals:
• 2 Programs
• 350 Projects
• 1,211 Grants
• 4,748 Publications
• 1,388 Datasets
• 349 Clinical Trials
• 117 Patents

New in this release of INS are several datasets shared due to CCDI P30 grant supplements to the following
organizations:
• Memorial Sloan-Kettering Cancer Center (MSKCC): phs003111.v1.p1 | 3P30CA008748-54S3
Note: this is an additional dataset shared due to a CCDI P30 grant supplement already
associated with another dataset in the INS (phs002620.v1.p1)
• University of Kansas Medical Center: phs002529.v1.p1 | 3P30CA168524-09S3
• University of Southern California: phs002518.v1.p1 | 3P30CA014089-45S6

Note that due to the indirect nature of grant supplements, these 3 projects and their outputs were gathered
and added to the INS through manual curation rather than the standard automated process.
This release includes a new batch of data gathered using the automated processes. It includes improvements
in the preprocessing pipeline to filter out any outputs published before the earliest associated grant was
awarded. This helps to limit false positive associations.

Projects, grants, publications, datasets, and clinical trials included in this release were gathered on April 4,
2023. Due to instability in the external source from which we gather Patents (USPTO API) in April, patents
included in this release were gathered on March 2, 2023.