Transformer visualization via dictionary learning

This repo contains the code for paper: Transformer visualization via dictionary learning: contextualized embedding as a linear superposition of transformer factors by Zeyu Yun*, Yubei Chen*, Bruno A Olshausen, and Yann LeCun (DeeLIO Workshop@NAACL 2021).

Demo

The Demo is here: Demo

Instruction

To visualize the hidden states for transformer factor. We need to first train a dictionary and then infer the sparse code using these dictionary.

To train a dictionary,

run python train.py

If you want to use your own data, you need to put it in a python list, where each element is a string (sentences). Then save this list as a .npy file, then run

python train.py --training_data ./your_data.npy

To infer the sparse code and save the top activated examples for each transformer factors, run

python inference_efficient.py --dictionary_dir ./the_path_for_your_trained_dictionary

(Optional) To use LIME to generate the attribution (color) map, run

python write_color.py --dictionary_dir ./the_path_for_your_trained_dictionary --example_dir ./the_path_of_your_top_activated_examples

Citation

If you find this repo useful, please consider to cite our work:

@proceedings{Yun2021TransformerVis,
    title={Transformer visualization via dictionary learning: contextualized embedding as a linear superposition of transformer factors},
    author={Yun, Zeyu and Chen, Yubei and Olshausen, Bruno A and LeCun, Yann},
    booktitle = "Proceedings of Deep Learning Inside Out (DeeLIO) NAACL: The Second Workshop on Knowledge Extraction and Integration for Deep Learning Architectures",
    year = "2021",
    publisher = "Association for Computational Linguistics",
}

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
dictionaries		dictionaries
top_activate_examples_2		top_activate_examples_2
top_activate_examples_w_color		top_activate_examples_w_color
LICENSE		LICENSE
demo.ipynb		demo.ipynb
generating_training_data.py		generating_training_data.py
inference.py		inference.py
readme.md		readme.md
run_eval.sh		run_eval.sh
sparsify_PyTorch.py		sparsify_PyTorch.py
train.py		train.py
util.py		util.py
vis.png		vis.png
write_color.py		write_color.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformer visualization via dictionary learning

Demo

Instruction

Citation

About

Releases

Packages

Contributors 3

Languages

License

zeyuyun1/TransformerVis

Folders and files

Latest commit

History

Repository files navigation

Transformer visualization via dictionary learning

Demo

Instruction

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages