From Text to Tables: A Local Privacy Preserving Large Language Model for Structured Information Retrieval from Medical Documents

Note: This documentation is currently under construction. Some sections may be updated or changed as development progresses.

General Setup Instructions

Before running the scripts, please ensure the following setup steps are completed:

Python Installation: Make sure Python is installed on your system. The scripts are compatible with Python 3.8.
Dependency Installation: Install the required Python packages. You can do this easily by using the requirements.txt file provided:
```
pip install -r requirements.txt
```

Data Preparation

Place your dataset files in accessible paths on your system.

Script-Specific Instructions

MIMIC Features Extraction Script (`extract_mimic_features_from_report.py`)

This Python script extracts and analyzes specific medical features from patient reports using a predefined grammar and prompt.

Usage

Run the script from the command line by specifying the path to your MIMIC ground truth data:

python extract_mimic_features_from_report.py path/to/MIMIC_groundtruth.csv

Confusion Matrix Analysis Script (`confusionmatrix.py`)

This Python script generates confusion matrices for machine learning model predictions, comparing predictions against a ground truth dataset to visualize the performance of a classification model.

Usage

Run the script from the command line by specifying the path to your ground truth data and predictions:

python confusionmatrix.py path/to/ground_truth.csv path/to/predictions.jsonl

Accuracy Comparison Script (`accuracy_comparison.py`)

This Python script compares the accuracy of different machine learning models, calculating and visualizing the accuracy of each model for various symptoms.

Usage

Run the script from the command line with the path to your ground truth data:

python accuracy_comparison.py path/to/ground_truth.csv

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
README.md		README.md
accuracy_comparison.py		accuracy_comparison.py
confusionmatrix.py		confusionmatrix.py
extract_mimic_features_from_report.py		extract_mimic_features_from_report.py
requirement.txt		requirement.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

From Text to Tables: A Local Privacy Preserving Large Language Model for Structured Information Retrieval from Medical Documents

General Setup Instructions

Data Preparation

Script-Specific Instructions

MIMIC Features Extraction Script (`extract_mimic_features_from_report.py`)

Usage

Confusion Matrix Analysis Script (`confusionmatrix.py`)

Usage

Accuracy Comparison Script (`accuracy_comparison.py`)

Usage

About

Releases

Packages

Contributors 2

Languages

I2C9W/fromtexttotables

Folders and files

Latest commit

History

Repository files navigation

From Text to Tables: A Local Privacy Preserving Large Language Model for Structured Information Retrieval from Medical Documents

General Setup Instructions

Data Preparation

Script-Specific Instructions

MIMIC Features Extraction Script (extract_mimic_features_from_report.py)

Usage

Confusion Matrix Analysis Script (confusionmatrix.py)

Usage

Accuracy Comparison Script (accuracy_comparison.py)

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

MIMIC Features Extraction Script (`extract_mimic_features_from_report.py`)

Confusion Matrix Analysis Script (`confusionmatrix.py`)

Accuracy Comparison Script (`accuracy_comparison.py`)

Packages