Skip to content

AAVSO/observing-campaigns-migration

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

observing-campaigns-migration

Overview

The American Association of Variable Star Observers regularly gets requests from researchers for data collection on stars or novae. Since the 1990s, we've conducted over 600 campaigns, all of which required migration to a new application. By using the Chat-GPT API, I saved our staff weeks of time. The API extracted key information from text documents such as “abstract”, “targets”, and “principal investigator” converting them to JSON format in order to populate our database.

migration_demo

Migration

A Python scraper fetched HTML pages from observing campaigns and converted them into markdown to minimize noise. The data was then processed into JSON format, marking uncertain fields as N/A, and sent to the Chat-GPT OpenAI API. To save time, this process was done in parallel, allowing over 600 campaigns to be converted within 3 hours. The resulting data was transferred to a CSV for review and cross-referenced with the VSX star database for linking star information, before being added to the application database.

Usage

Run pipenv run migration [start] [end] [out_file]

where start and end are alert notice ids and out_file is where to output data.

About

Using AI to migrate unstructured text documents to databases

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages