Code for the Transformer and the semi-autoregressive Transformer (SAT)

The is the official implementation of the paper: Semi-Autoregressive Neural Machine Translation, which will be presented at EMNLP 2018.

Contact

Email: [email protected]

Prepare

Clone this project

git clone https://github.com/chqiwang/sa-nmt.git
cd sa-nmt

Download data.zip from google drive then decompress it. Make sure the data folder is a subfolder of sa-nmt.

Usage

Step 1: Train the base Transformer model

Train the Transformer

python train.py -c configs/transformer.yaml

Then average the last five checkpoints

python third_party/tensor2tensor/avg_checkpoints.py --prefix "model-transformer/model_step_" --checkpoints 96000,97000,98000,99000,100000 --output_path "model-transformer/model_avg"

Step 2: Translate source sentences in the corpus into the target language

python evaluate.py -c configs/transformer.yaml

Step 3: Train and evaluate the SAT

(Plsease replace [K] in the following commands with 2, 4 or 6)

Copy the base model

cp -r model-transformer model-sat-[K]

Train the SAT

python train.py -c configs/sat-[K].yaml

Then average the last five checkpoints

python third_party/tensor2tensor/avg_checkpoints.py --prefix "model-sat-[K]/model_step_" --checkpoints 96000,97000,98000,99000,100000 --output_path "model-sat-[K]/model_avg"

Evaluate the model

python evaluate.py -c configs/sat-[K].yaml

Notes

Each steps will takes a long time (maybe one day, depend on your gpu device).
By default, we use 8 gpu devices when train and predict. If you have less then 8 gpus, you should modify the yaml config files (num_gpus, batch_size and tokens_per_batch).
Raise an issue or email me if you have problem.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
configs		configs
models		models
third_party		third_party
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
evaluate.py		evaluate.py
multi-bleu.perl		multi-bleu.perl
train.py		train.py
train_wkd.py		train_wkd.py
utils.py		utils.py
vocab.py		vocab.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Code for the Transformer and the semi-autoregressive Transformer (SAT)

Contact

Prepare

Usage

Step 1: Train the base Transformer model

Step 2: Translate source sentences in the corpus into the target language

Step 3: Train and evaluate the SAT

Notes

About

Releases

Packages

Languages

License

chqiwang/sa-nmt

Folders and files

Latest commit

History

Repository files navigation

Code for the Transformer and the semi-autoregressive Transformer (SAT)

Contact

Prepare

Usage

Step 1: Train the base Transformer model

Step 2: Translate source sentences in the corpus into the target language

Step 3: Train and evaluate the SAT

Notes

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages