Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add notebook with example code for LLMMetadataExtractor #154

Open
julian-risch opened this issue Dec 10, 2024 · 1 comment
Open

Add notebook with example code for LLMMetadataExtractor #154

julian-risch opened this issue Dec 10, 2024 · 1 comment
Labels

Comments

@julian-risch
Copy link
Member

julian-risch commented Dec 10, 2024

We added LLMMetadataExtractor in #125 and recently updated it in #147

What's still missing is a notebook showing how to use it, which we need to link in the experiment catalogue in the repo's readme file. Currently it says coming soon.

Maybe something related to https://colab.research.google.com/github/deepset-ai/haystack-cookbook/blob/main/notebooks/metadata_enrichment.ipynb ?

@sjrl
Copy link
Collaborator

sjrl commented Dec 11, 2024

Here is an example yaml that could help you all get started.

It extracts the company name from text versions of Slide decks.

I had to change the file format to .txt even though it is yaml since GH doesn't support yaml files.

haystack-metadata-extraction-example.txt

@julian-risch julian-risch added P2 and removed P1 labels Dec 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants