Update code to most recent iteration

Large Language Model Assertion Pipeline (updated - 2/25/24)

This code is for the large language model assertion pipeline. Detailed instructions coming soon!

Run run_umls_synonym_ner.py and run_dataset_ner.py to build NER datasets (recommend using targeted NER prompts instead of broad NER prompts for NER dataset pull)
(Optional - highly recommended) Run run_ner_cosine_similarity.py followed by run_llm_filter_cosine_sim_ner_output.py to filter NER outputs (filter NER outputs to remove the low-yield named entities --> also helpful to review filtered NER outputs and remove those that are not related to your target entity)
Run run_extraction.py to build target-matcher and extract high-yield text from clinical notes
Run run_llm_assertion.py to generate LLM assertions

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
helper_functions.py		helper_functions.py
run_dataset_ner.py		run_dataset_ner.py
run_extraction.py		run_extraction.py
run_finetune_classifier.py		run_finetune_classifier.py
run_llm_assertion.py		run_llm_assertion.py
run_llm_filter_cosine_sim_ner_output.py		run_llm_filter_cosine_sim_ner_output.py
run_ner_cosine_similarity.py		run_ner_cosine_similarity.py
run_umls_synonym_ner.py		run_umls_synonym_ner.py