Question regarding the german translation of SQuAD v1.1 mentioned for GELECTRA fine tuning #2236
-
Hi, I'm currently trying to fine tune a model for the task of end-to-end question generation in German. Since I don't have exactly a clue how much training data for the fine-tuning is necessary and most approaches are using the English SQuAD dataset, I searched for an automatic translated SQuAD dataset and found something here https://huggingface.co/deepset/gelectra-base-germanquad#performance . My question is now if you could share that automatic translation of SQuAD v1.1 and if not maybe give me an insight with which tools you approached this translation. Thanks in advance and best regards |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 3 replies
-
Hey @TiloMichel, The German part of the MLQA dataset was used for the performance comparisons. Hope this helps :) |
Beta Was this translation helpful? Give feedback.
-
As a coincidence, we just released a blogpost on NLP Resources Beyond English today 😁 https://www.deepset.ai/blog/nlp-resources-beyond-english |
Beta Was this translation helpful? Give feedback.
Hey @TiloMichel,
The German part of the MLQA dataset was used for the performance comparisons. Hope this helps :)