GitHub

Transformer-based dialect identification

Description:

This is an end-to-end DID model based on the transformer neural network architecture.

All the experiences are carried out on the ADI17 dataset.(http://groups.csail.mit.edu/sls/downloads/adi17/)

All the results of this experience have been summited to IALP 2020 conference. (http://www.colips.org/conferences/ialp2020/wp/)

Wanqiu Lin, Maulik Madhavi, Rohan Kumar Das and Haizhou Li, "Transformer-based Arabic Dialect Identification," International Conference on Asian Language Processing (IALP), 4-6 Dec. 2020.

Install:

Python3 (recommend Anaconda)

PyTorch 0.4.1+

Kaldi (just for feature extraction)

Work flow:

step 1: run prep_data.sh(for prepare data and shuffle)

step 2: run extract_feat.sh(for extract acoustic features)

step 3:run run_train.sh(for training model)

step 4:run base_line.py(for test model)

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
conf		conf
scripts		scripts
README.md		README.md
attention.py		attention.py
base_line.py		base_line.py
encoder.py		encoder.py
extract_feat.sh		extract_feat.sh
language_id_initial		language_id_initial
m5data.py		m5data.py
module.py		module.py
path.sh		path.sh
prep_acc.sh		prep_acc.sh
prep_data.sh		prep_data.sh
run_train.sh		run_train.sh
train_model.py		train_model.py
transformer.py		transformer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformer-based dialect identification

Description:

Install:

Work flow:

About

Releases

Packages

Languages

Guibeen/ADI17

Folders and files

Latest commit

History

Repository files navigation

Transformer-based dialect identification

Description:

Install:

Work flow:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages