Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE] Create Code to Characterize a Given OMOP CDM Database #9

Open
12 of 24 tasks
TheCedarPrince opened this issue Oct 18, 2021 · 1 comment
Open
12 of 24 tasks
Assignees
Labels
enhancement New feature or request Medium Medium priority issue Study Related to the study component

Comments

@TheCedarPrince
Copy link
Collaborator

TheCedarPrince commented Oct 18, 2021

Is your feature request related to a problem? Please explain.

As we have defined our concept sets in #1 and are in the process of wrapping up the automation of creating these concept sets in #2, we are now well-positioned to start moving forward to characterizations of these populations.

Describe the solution you'd like

As a first pass, let's focus on some very basic characterizations of a given dataset.
The tasking can be broken up as follows over a few different population definitions - I also would suggest we do each population definition break down as its own separate PR:

  • Inpatient

    • Total unique patients
    • Total unique patients by gender (at admit)
    • Total unique patients by age (at admit)
    • Total unique patients by race
    • Total unique patients by location (i.e. state)
    • Total patients by care site type (any visit to capture trend of care site visits)
    • Total patients by condition (anyone with a diagnosis)
  • Outpatient

    • Total unique patients
    • Total unique patients by gender (at admit)
    • Total unique patients by age (at admit)
    • Total unique patients by race
    • Total unique patients by location (i.e. state)
    • Total patients by care site type (any visit to capture trend of care site visits)
    • Total patients by condition (anyone with a diagnosis)
  • Entire database

    • Total unique patients
    • Total unique patients by gender (at admit)
    • Total unique patients by age (at admit)
    • Total unique patients by race
    • Total unique patients by location (i.e. state)
    • Total patients by care site type (any visit to capture trend of care site visits)
    • Total patients by condition (anyone with a diagnosis)

Describe alternatives you've considered

Currently, I looked at the HADES package suite but no luck finding a package that does this sort of characterization. The one that may be useful in the future is this: https://ohdsi.github.io/FeatureExtraction/index.html

@TheCedarPrince TheCedarPrince added the enhancement New feature or request label Oct 18, 2021
@TheCedarPrince
Copy link
Collaborator Author

TheCedarPrince commented Oct 18, 2021

Note, we have access to a SynPUF database by which we can prototype queries against.
OHDSI provides the great package, DatabaseConnector that makes connecting to a DB easy.
Here is an example of how to connect to a database using DatabaseConnector:

connectionDetails <- createConnectionDetails(
  pathToDriver = "./R/utils",
  dbms = "postgresql",
  server = "server/host",
  user = "matcha",
  password = "lavendar",
  port = 1234
)

conn <- connect(connectionDetails)

sql <- "
   SELECT PERSON_ID, YEAR_OF_BIRTH, RACE_CONCEPT_ID
   FROM person
   LIMIT 1;
   "

querySql(conn, sql)

disconnect(conn)

That should help with creating prototype scripts and queries.

@TheCedarPrince TheCedarPrince added Study Related to the study component Medium Medium priority issue labels Nov 5, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request Medium Medium priority issue Study Related to the study component
Projects
None yet
Development

No branches or pull requests

2 participants