Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add User Resumes to Training Data for Improved Model Performance #3

Open
sriraamav opened this issue Oct 15, 2024 · 0 comments
Open
Labels

Comments

@sriraamav
Copy link
Collaborator

Description:

The resume parsing system can be enhanced by allowing users to submit their resumes for inclusion in the training data. By continuously updating the training dataset with real-world resumes, the system can improve its accuracy and relevance over time, making it better at identifying skills, categories, and parsing resumes from various industries.

Tips for the issue:

  • Implement an opt-in mechanism for users to consent to their resumes being used for training.
  • Add data anonymization to remove personal and sensitive information from resumes before storing them.
  • Update the pipeline to automatically include new resumes in the dataset used for training.
  • Consider a method for periodic retraining of the model as new resumes are collected.

To do:

  • Ask us to assign the issue.
  • Once assigned, you can start working on the task.
  • Create a pull request (PR).

Resource:

  • Explore techniques for updating machine learning models with new data.
  • Review best practices for data anonymization and privacy.

Notes:
The task is assigned on a first-come, first-serve basis, and the contributor must report progress every 3 days to ensure active development.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant