entity-graph-covid19

This project's target is extracting entities and relations among them, then building a graph. We used COVID-19 medical literatures, which is from the COVID-19 Open Research Dataset Challenge (CORD-19).

methodology

During process of extracting entities and relations, our implementation took advantage of pre-trained model BioBERT，which a biomedical language representation model designed for biomedical text mining tasks such as biomedical named entity recognition, relation extraction, question answering, etc. We fininshed fine-tuning for 8 datasets(BC2GM, BC4CHEMD, BC5CDR-chem, BC5CDR-disease, JNLPBA, NCBI-disease, linnaeus, s800) on biomedical named entity recognition, and for 2 datasets(GAD, euadr) on biomedical relation extraction. Our code recalled the Huggingface Tranformers API to excute and finish those series of process. After, we used Spark GraphX to load the extracting results and build COVID-19 entity graph.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.ipynb_checkpoints		.ipynb_checkpoints
README.md		README.md
covid19-graph-build.ipynb		covid19-graph-build.ipynb
extract_ner.py		extract_ner.py
extract_re.py		extract_re.py
fine_tuning_ner.sh		fine_tuning_ner.sh
fine_tuning_re_all.sh		fine_tuning_re_all.sh
graph.json		graph.json
presenting-graph-from-json.ipynb		presenting-graph-from-json.ipynb
processing_data.sh		processing_data.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

entity-graph-covid19

methodology

About

Releases

Packages

Languages

leeivan/entity-graph-covid19

Folders and files

Latest commit

History

Repository files navigation

entity-graph-covid19

methodology

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages