Dependency-Based Self-Attention for Transformer NMT (Deguchi et al., 2019)

Installation

git clone https://github.com/de9uch1/dbsa.git
cd dbsa/
pip install ./

Training a Transformer + DBSA model on ASPEC-JE

1. Extract and preprocess the ASPEC-JE data (including dependency parsing)

export NUM_WORKERS=8  # specify the number of CPUs
bash scripts/prepare-aspec.sh

2. Preprocess the dataset

# binarize the dataset
fairseq-preprocess --source-lang ja --target-lang en \
    --trainpref aspec_ja_en/train \
    --validpref aspec_ja_en/valid \
    --testpref aspec_ja_en/test \
    --destdir data-bin/ \
    --workers $NUM_WORKERS

# deploy the dependency labels
for split in train valid test; do
    for l in ja en; do
        cp aspec_ja_en/$split.$l.dep data-bin/$split.dep.$l
    done
done

3. Train a model

fairseq-hydra-train \
    --config-dir dbsa/configs/ \
    --config-name transformer_dep \
    common.user_dir=dbsa/

4. Evaluate and generate the translations

fairseq-generate \
    data-bin --gen-subset test \
    --user-dir dbsa/ \
    --task translation_dep \
    --path checkpoints/checkpoint_last.pt \
    --max-len-a 1 --max-len-b 50 \
    --post-process \
    --beam 5 --nbest 1

You can also generate dependencies (BPE level)

fairseq-dbsa-generate \
    data-bin --gen-subset test \
    --user-dir dbsa/ \
    --task translation_dep \
    --path checkpoints/checkpoint_last.pt \
    --max-len-a 1 --max-len-b 50 \
    --print-dependency \
    --beam 5 --nbest 1

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
dbsa		dbsa
examples/ja2en		examples/ja2en
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.rst		README.rst
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dependency-Based Self-Attention for Transformer NMT (Deguchi et al., 2019)

Installation

Training a Transformer + DBSA model on ASPEC-JE

1. Extract and preprocess the ASPEC-JE data (including dependency parsing)

2. Preprocess the dataset

3. Train a model

4. Evaluate and generate the translations

About

Releases

Packages

Languages

License

de9uch1/dbsa

Folders and files

Latest commit

History

Repository files navigation

Dependency-Based Self-Attention for Transformer NMT (Deguchi et al., 2019)

Installation

Training a Transformer + DBSA model on ASPEC-JE

1. Extract and preprocess the ASPEC-JE data (including dependency parsing)

2. Preprocess the dataset

3. Train a model

4. Evaluate and generate the translations

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages