fix: Add Korean AutoRAGRetrieval #1388

yjoonjang · 2024-11-05T04:57:04Z

Adding datasets checklist

Reason for dataset addition:
This is a Korean Retrieval benchmark dataset covering 5 domains: Finance, Public, Medicine, Law, Commerce

I have run the following models on the task (adding the results to the pr). These can be run using the mteb -m {model_name} -t {task_name} command.
- sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
  - {"ndcg_at_1": 0.29825, "ndcg_at_5": 0.3973, "ndcg_at_10": 0.42296}
- intfloat/multilingual-e5-small
  - {"ndcg_at_1": 0.59649, "ndcg_at_5": 0.73205, "ndcg_at_10": 0.75466}
I have checked that the performance is neither trivial (both models gain close to perfect scores) nor random (both models gain close to random scores).
If the dataset is too big (e.g. >2048 examples), considering using self.stratified_subsampling() under dataset_transform()
I have filled out the metadata object in the dataset file (find documentation on it here).
Run tests locally to make sure nothing is broken using make test.
Run the formatter to format the code using make lint.

ruff format . # running ruff formatting 716 files left unchanged ruff check . --fix # running ruff linting All checks passed!

yjoonjang · 2024-11-05T04:58:46Z

Hello, I tried to contribute this AutoRAGRetrieval task, but I got some errors while testing. Can I please get some help?
My codes are in mteb/tasks/Retrieval/kor/AutoRAGRetrieval.py

Thank you.

KennethEnevoldsen · 2024-11-05T13:49:44Z

mteb/tasks/Retrieval/kor/AutoRAGRetrieval.py

+        date=None,
+        form=None,
+        domains=None,
+        task_subtypes=None,
+        license=None,
+        socioeconomic_status=None,
+        annotations_creators=None,
+        dialect=None,
+        text_creation=None,


If the metadata is not filled out you will get an error in the tests. Let me know if there are any problems with these.

mteb/tasks/Retrieval/kor/AutoRAGRetrieval.py

KennethEnevoldsen · 2024-11-07T11:35:34Z

@yjoonjang I think we are almost there. Will you complete the checklist for adding a new dataset?

I also want to double check that annotations are human and not LLM generated?

yjoonjang · 2024-11-10T10:11:33Z

Hello, @KennethEnevoldsen

I'm sorry, but I when I try to test with the dataset AutoRAGRetrieval, I still get this error: "KeyError: 'AutoRAGRetrieval' not found. Did you mean: DuRetrieval?"
- My command was this: mteb run -m sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 -t AutoRAGRetrieval
- I added my dataset name to Retrieval/__init__.py, but I don't know why I get this issue. Cand I get some help please?
And yes, this dataset is not LLM-generated.

mteb/tasks/Retrieval/kor/AutoRAGRetrieval.py

Samoed · 2024-11-10T12:11:31Z

I tried to run your code and maybe you need to reinstall your package to fix your problems, because I can run task like this mteb run -m intfloat/multilingual-e5-small -t AutoRAGRetrieval

yjoonjang · 2024-11-11T03:45:34Z

I tried to run your code and maybe you need to reinstall your package to fix your problems, because I can run task like this mteb run -m intfloat/multilingual-e5-small -t AutoRAGRetrieval

I still had the same error, but I ran the test by python code.
I updated the scores on the checklist !!
It would be great if you could check it.

Thank you, @KennethEnevoldsen @Samoed

…adata_metrics.py

yjoonjang · 2024-11-11T07:00:57Z

I have deleted the descriptive_stats in TaskMetadata field and made it to a json file with running calculated_metadata_metrics.

Samoed · 2024-11-11T07:23:15Z

I got these results
sentence-transformers__paraphrase-multilingual-MiniLM-L12-v2 AutoRAGRetrieval.json
intfloat__multilingual-e5-small AutoRAGRetrieval.json

yjoonjang · 2024-11-11T07:26:36Z

I got these results sentence-transformers__paraphrase-multilingual-MiniLM-L12-v2 AutoRAGRetrieval.json intfloat__multilingual-e5-small AutoRAGRetrieval.json

This corresponds with the results I attatched on the checklist above!

Samoed · 2024-11-11T07:27:15Z

Results for the e5-small model are slightly different. I suspect this is because I overlooked that the model should be initialized with model = mteb.get_model(model_name) rather than directly with SentenceTransformer. Using SentenceTransformer directly would skip the prompts.

yjoonjang · 2024-11-11T08:21:40Z

Results for the e5-small model are slightly different. I suspect this is because I overlooked that the model should be initialized with model = mteb.get_model(model_name) rather than directly with SentenceTransformer. Using SentenceTransformer directly would skip the prompts.

Oh yes. I tested with model = mteb.get_model(model_name) and got the same reslult as yours.
I don't really know why. Because e5-small doesn't need prompts.
Do you have some opinions?
And also, is this the end of contribution, or do I have to do anything else ?

KennethEnevoldsen · 2024-11-11T08:44:21Z

I don't really know why. Because e5-small doesn't need prompts.

e5 does use prompts though they are minimal (e.g. "query: " ).

KennethEnevoldsen

Things looks good on my end - @Samoed feel free to merge this in if you feel the same

yjoonjang · 2024-11-12T04:16:35Z

Thank you so much for your help.
Is my point automatically updated, or do I have to add my contribution to points folder ?

KennethEnevoldsen · 2024-11-14T10:09:05Z

@yjoonjang we no longer take point for the MMTEB contribution (it has already been submitted). However you will appear as a contributor on MTEB.

yjoonjang added 4 commits November 1, 2024 20:45

feat: add AutoRAG Korean embedding retrieval benchmark

7463853

fix: run --- 🧹 Running linters ---

8459384

ruff format . # running ruff formatting 716 files left unchanged ruff check . --fix # running ruff linting All checks passed!

fix: add metadata for AutoRAGRetrieval

cab4eb8

change link for markers_bm

96cf1ad

KennethEnevoldsen reviewed Nov 5, 2024

View reviewed changes

yjoonjang added 2 commits November 7, 2024 13:54

add AutoRAGRetrieval to init.py and update metadata

b57fcad

add precise metadata

e721875

KennethEnevoldsen reviewed Nov 7, 2024

View reviewed changes

mteb/tasks/Retrieval/kor/AutoRAGRetrieval.py Outdated Show resolved Hide resolved

mteb/tasks/Retrieval/kor/AutoRAGRetrieval.py Outdated Show resolved Hide resolved

mteb/tasks/Retrieval/kor/AutoRAGRetrieval.py Outdated Show resolved Hide resolved

update metadata: description and license

cd5b445

KennethEnevoldsen changed the title ~~Add AutoRAGRetrieval for Korean embedding model~~ fix: Add Korean AutoRAGRetrieval Nov 7, 2024

Samoed reviewed Nov 10, 2024

View reviewed changes

mteb/tasks/Retrieval/kor/AutoRAGRetrieval.py Outdated Show resolved Hide resolved

Merge branch 'embeddings-benchmark:main' into main

df27217

delete descriptive_stats in AutoRAGRetrieval.py and run calculate_mat…

ccfcce8

…adata_metrics.py

Merge branch 'embeddings-benchmark:main' into main

1a546ff

KennethEnevoldsen approved these changes Nov 11, 2024

View reviewed changes

Samoed merged commit f79d9ba into embeddings-benchmark:main Nov 11, 2024
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Add Korean AutoRAGRetrieval #1388

fix: Add Korean AutoRAGRetrieval #1388

yjoonjang commented Nov 5, 2024 •

edited

Loading

yjoonjang commented Nov 5, 2024

KennethEnevoldsen Nov 5, 2024 •

edited

Loading

KennethEnevoldsen commented Nov 7, 2024

yjoonjang commented Nov 10, 2024 •

edited

Loading

Samoed commented Nov 10, 2024

yjoonjang commented Nov 11, 2024

yjoonjang commented Nov 11, 2024

Samoed commented Nov 11, 2024

yjoonjang commented Nov 11, 2024 •

edited

Loading

Samoed commented Nov 11, 2024

yjoonjang commented Nov 11, 2024

KennethEnevoldsen commented Nov 11, 2024

KennethEnevoldsen left a comment •

edited

Loading

yjoonjang commented Nov 12, 2024

KennethEnevoldsen commented Nov 14, 2024

fix: Add Korean AutoRAGRetrieval #1388

fix: Add Korean AutoRAGRetrieval #1388

Conversation

yjoonjang commented Nov 5, 2024 • edited Loading

Adding datasets checklist

yjoonjang commented Nov 5, 2024

KennethEnevoldsen Nov 5, 2024 • edited Loading

Choose a reason for hiding this comment

KennethEnevoldsen commented Nov 7, 2024

yjoonjang commented Nov 10, 2024 • edited Loading

Samoed commented Nov 10, 2024

yjoonjang commented Nov 11, 2024

yjoonjang commented Nov 11, 2024

Samoed commented Nov 11, 2024

yjoonjang commented Nov 11, 2024 • edited Loading

Samoed commented Nov 11, 2024

yjoonjang commented Nov 11, 2024

KennethEnevoldsen commented Nov 11, 2024

KennethEnevoldsen left a comment • edited Loading

Choose a reason for hiding this comment

yjoonjang commented Nov 12, 2024

KennethEnevoldsen commented Nov 14, 2024

yjoonjang commented Nov 5, 2024 •

edited

Loading

KennethEnevoldsen Nov 5, 2024 •

edited

Loading

yjoonjang commented Nov 10, 2024 •

edited

Loading

yjoonjang commented Nov 11, 2024 •

edited

Loading

KennethEnevoldsen left a comment •

edited

Loading