Skip to content

Find all PDBs associated with a UniProt entry.

Notifications You must be signed in to change notification settings

connyyu/find_PDBs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 

Repository files navigation

Find all PDB structures associated with a UniProt entry.

This repository contains a Python script, written in a Jupyter Notebook, that retrieves all Protein Data Bank (PDB) structures associated with a UniProt entry.

The tool uses APIs from PDBe, UniProt, and RCSB PDB to gather structural data and bibliographic references (including preprints and PubMed IDs) linked to each PDB structure, including newly released structures not yet indexed in UniProt.

Features

  • PDBe API: Retrieve PDB structures associated with a UniProt entry, including recently released ones.
  • UniProt API: Find all PDBs listed in the UniProt entry.
  • RCSB PDB API: Gather references and PubMed IDs (PMIDs) for each structure.

Input

  • UniProt Accession Code (AC): e.g. P40967.

Output

  • List of all associated PDB entries.
  • References and PMIDs for each structure.

Example Usage

Below is an example output generated by the tool:

Example Screenshot

Prerequisites

  • Jupyter Notebook: You can install it via Anaconda or pip.
  • Python 3.x
  • Python libraries: requests, json, pandas, and numpy. Install them using:
    pip install requests pandas numpy

Author

Releases

No releases published

Packages

No packages published