-
Notifications
You must be signed in to change notification settings - Fork 0
ModelSearch
The bioCADDIE DDI is developing metadata models for the data and content being indexed. These metadata models are being incorporated within the structure of the Elasticsearch documents being indexed. The example below is from the PDB dataset 1K0S.
Inspecting the JSON document returned for the search above one can identify a number of sections within the returned _source data:
- dataItem - information about the data record itself
- citation - information about the primary citation(s) for the data record
- identifiers - the identifiers related to the data record
- materialEntity - information about material entities associated with the data record
- organism - information about organism associated with the data record
- dataResource - information about the data repository that houses this data record
The json representation of citation information is as follows:
"citation":
{
"DOI" : "doi:10.1038/nsb753",
"PMID" : "pmid:11799399",
"author" : {
"name" : [
"Griswold, I.J.",
"Zhou, H.",
"Matison, M.",
"Swanson, R.V.",
"McIntosh, L.P.",
"Simon, M.I.",
"Dahlquist, F.W."
]
},
"firstPage" : "121",
"journal" : "Nat.Struct.Biol.",
"journalISSN" : "1072-8368",
"lastPage" : "125",
"title" : "The solution structure and interactions of CheW from Thermotoga maritima.",
"year" : "2002"
}
In order to search specifically within the metadata model one needs to construct a search that targets the specific element to be searched.
- Title search: [Thermotoga maritima](https://data.biocaddie.org/biocaddie/pdb/_search?q=citation.title:"Thermotoga maritima") [https://data.biocaddie.org/biocaddie/pdb/_search?q=citation.title:"Thermotoga maritima"](https://data.biocaddie.org/biocaddie/pdb/_search?q=citation.title:"Thermotoga maritima")
- PMID search: pmid:11799399
bioCADDIE is supported by the National Institutes of Health through the NIH Big Data to Knowledge, Grant 1U24AI117966-01.