Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

colloquial dialects #2

Open
maherr13 opened this issue Jul 28, 2024 · 1 comment
Open

colloquial dialects #2

maherr13 opened this issue Jul 28, 2024 · 1 comment

Comments

@maherr13
Copy link

Hello Thanks for your work,

you mentioned in the paper that pretraining phase would help in performing good on colloquial dialects.

I wanted to ask how the model would respond to colloquial dialects something like finetuning the pretrained models on Egyptian dialects.

if well, how much data do i need to finetune the model on to get fine good results on such a case.

@ahmedbr
Copy link

ahmedbr commented Sep 5, 2024

Hi, I have a similar issues actually.

I've just used the encoder-decoder pretrained model to train a diacritizer for a gulf dialect. I didn't make major changes in the script provided, only provided datasets and pretrained model's paths. But as training proceeds, both val_loss and val_der were getting worse and worse. Please have a look at the screenshot below:

image

Did I do something wrong? Any explanation?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants