Axis-Tour

Axis Tour: Word Tour Determines the Order of Axes in ICA-transformed Embeddings
Hiroaki Yamagiwa, Yusuke Takase, Hidetoshi Shimodaira
EMNLP 2024 Findings

Setup

This repository is intended to be run in a Docker environment. If you are not familiar with Docker, please install the packages listed in requirements.txt.

Docker build

Please create a Docker image as follows:

bash script/docker_build.sh

Environment variables

# Set OPENAI API key
export OPENAI_API_KEY="sk-***"

# Set the DOCKER_HOME to specify the path of the directory to be mounted as the home directory inside the Docker container
export DOCKER_HOME="path/to/your/docker_home"

Docker run

bash script/docker_run.sh

Preliminary

Access to the embeddings used in our paper

Instead of recomputing the embeddings, you can access the embeddings used in the paper through the following links. Note that sign flip was not applied to the ICA-transformed embeddings to ensure that the skewness of the axes remains positive.

Raw embeddings

GloVe (Google Drive)

Place the downloaded file under the directory output/raw_embeddings as shown below:

$ ls output/raw_embeddings/
raw_glove.pkl

PCA-transformed and ICA-transformed embeddings

GloVe (Google Drive)

Place the downloaded file under the directory output/pca_ica_embeddings/ as shown below:

$ ls output/pca_ica_embeddings/
pca_ica_glove.pkl

Axis Tour embedidngs

GloVe (Google Drive) for $k=100$

Place the downloaded file under the directory output/axistour_embeddings/ as shown below:

$ ls output/axistour_embeddings/
axistour_top100_glove.pkl

POLAR-applied GloVe embeddings (in necessary) [4]

POLAR for Random Selection, Variance Maximization, and Orthogonality Maximization (Google Drive)

Place the downloaded files under the directory output/polar_glove_embeddings/ as shown below:

$ ls output/polar_glove_embeddings/
orthogonal_antonymy_gl_500_StdNrml.bin rand_antonym_gl_500_StdNrml.bin variance_antonymy_gl_500_StdNrml.bin

Download GloVe for reproducibility experiments

Create the data/embeddings directory:

mkdir -p data/embeddings

Download GloVe embeddings as follows:

wget https://nlp.stanford.edu/data/glove.6B.zip
unzip glove.6B.zip -d data/embeddings/glove.6B

For more details, please refer to the original repository: stanfordnlp/GloVe.

LKH solver for TSP

Similar to [1], download LKH solver for TSP:

wget http://webhotel4.ruc.dk/~keld/research/LKH-3/LKH-3.0.6.tgz
tar xvfz LKH-3.0.6.tgz

word-embeddings-benchmarks

We have modified the repository word-embeddings-benchmarks [5]. To install it, use the following commands:

cd word-embeddings-benchmarks
pip install -e .
cd ..

Code

Save embeddings for reproducibility experiments

PCA and ICA

Similar to [2], calculate PCA-transformed and ICA-transformed embeddings as follows:

python save_pca_and_ica_embeddings.py --emb_type glove

Axis Tour

python save_axistour_embeddings.py --emb_type glove --topk 100

Scatterplots

This will generate the scatterplot shown above:

python make_scatterplots.py --emb_type glove --topk 100 --left_axis_index 86 --length 9

If you are not using adjustText==1.0.4, you may need to manually adjust the position of the text.

Save top words for Axis Tour embeddings

To save the top 5 words of the Axis Tour embeddings ($k=100$), run the following:

python save_axistour_top_words.py --emb_type glove --topk 100 --top_words 5

The output file will be as follows:

$ head -n 6 output/axistour_top_words/glove_top100-top5_words.csv
axis_idx,top1_word,top2_word,top3_word,top4_word,top5_word
0,phaen,sandretto,nakhchivan,burghardt,regno
1,region,goriška,languedoc,regions,saguenay-lac-saint-jean
2,mountain,mount,mountains,everest,peaks
3,stage,vinokourov,vuelta,stages,magicians
4,italy,italian,di,francesco,pietro

Histogram of cosine similarities

python make_cossim_histogram.py --emb_type glove --topk 100

Quantitative evaluation of semantic continuity by GPT models

python eval_continuity_by_OpenAI_API.py

Dimensionality reduction

Calculate POLAR-applied embeddings (if necessary)

git clone https://github.com/Sandipan99/POLAR.git

Then

cp polar_glove.ipynb POLAR/

Run POLAR/polar_glove.ipynb to generate embeddings saved under output/polar_glove_embeddings/ with the filename ${method}_gl_500_StdNrml.bin.

Evaluation

python make_dimred_figure.py --emb_type glove --fig_type main

Relation between skewness and the average of two cosine similarities

python make_relation_skewness_and_two_cossims.py --emb_type glove --topk 100

(a) Axis Tour

(b) Skewness Sort

Reference

[1] Sato. Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem. NAACL. 2022.

[2] Yamagiwa et al. Discovering Universal Geometry in Embeddings with ICA. EMNLP. 2023.

[3] Chelba et al. One billion word benchmark for measuring progress in statistical language modeling. INTER-SPEECH. 2014.

[4] Mathew, et al. The POLAR framework: Polar opposites enable interpretability of pretrained word embeddings. Web Conference. 2020.

[5] Jastrzebski et al. How to evaluate word embeddings? On importance of data efficiency and simple supervised tasks. arXiv. 2017.

Citation

If you find our code or model useful in your research, please cite our paper:

@inproceedings{yamagiwa-etal-2024-axis,
    title = "Axis Tour: Word Tour Determines the Order of Axes in {ICA}-transformed Embeddings",
    author = "Yamagiwa, Hiroaki  and
      Takase, Yusuke  and
      Shimodaira, Hidetoshi",
    editor = "Al-Onaizan, Yaser  and
      Bansal, Mohit  and
      Chen, Yun-Nung",
    booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2024",
    month = nov,
    year = "2024",
    address = "Miami, Florida, USA",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.findings-emnlp.28",
    pages = "477--506",
    abstract = "Word embedding is one of the most important components in natural language processing, but interpreting high-dimensional embeddings remains a challenging problem. To address this problem, Independent Component Analysis (ICA) is identified as an effective solution. ICA-transformed word embeddings reveal interpretable semantic axes; however, the order of these axes are arbitrary. In this study, we focus on this property and propose a novel method, Axis Tour, which optimizes the order of the axes. Inspired by Word Tour, a one-dimensional word embedding method, we aim to improve the clarity of the word embedding space by maximizing the semantic continuity of the axes. Furthermore, we show through experiments on downstream tasks that Axis Tour yields better or comparable low-dimensional embeddings compared to both PCA and ICA.",
}

Note

Since the URLs of published embeddings may change, please refer to the GitHub repository URL instead of the direct URL when referencing in papers, etc.
This directory was created by Hiroaki Yamagiwa.
The code for TICA was created by Yusuke Takase.
See README.Appendix.md for the experiments in the Appendix.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
images_camera_ready		images_camera_ready
scripts		scripts
word-embeddings-benchmarks		word-embeddings-benchmarks
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE-web		LICENSE-web
LICENSE-wordtour		LICENSE-wordtour
README.Appendix.md		README.Appendix.md
README.md		README.md
eval_avg_d_I_and_avg_c_I.py		eval_avg_d_I_and_avg_c_I.py
eval_avg_d_I_and_avg_c_I_tica.py		eval_avg_d_I_and_avg_c_I_tica.py
eval_continuity_by_OpenAI_API.py		eval_continuity_by_OpenAI_API.py
eval_dimred_main.py		eval_dimred_main.py
eval_dimred_polar_glove.py		eval_dimred_polar_glove.py
eval_dimred_projection.py		eval_dimred_projection.py
eval_dimred_tica.py		eval_dimred_tica.py
eval_dimred_topk.py		eval_dimred_topk.py
make_3d_figure.py		make_3d_figure.py
make_LKH_file.cpp		make_LKH_file.cpp
make_comparing_k.py		make_comparing_k.py
make_cossim_histogram.py		make_cossim_histogram.py
make_cossim_histogram_and_scatterplot_tica.py		make_cossim_histogram_and_scatterplot_tica.py
make_dimred_figure.py		make_dimred_figure.py
make_higher_order_histogram.py		make_higher_order_histogram.py
make_relation_skewness_and_two_cossims.py		make_relation_skewness_and_two_cossims.py
make_scatterplots.py		make_scatterplots.py
make_scatterplots_tica.py		make_scatterplots_tica.py
makefile		makefile
numpy_tica.py		numpy_tica.py
polar_glove.ipynb		polar_glove.ipynb
requirements.txt		requirements.txt
ruff.toml		ruff.toml
save_axistour_embeddings.py		save_axistour_embeddings.py
save_axistour_top_words.py		save_axistour_top_words.py
save_pca_ica_embeddings.py		save_pca_ica_embeddings.py
save_raw_embeddings.py		save_raw_embeddings.py
save_tica_embeddings.py		save_tica_embeddings.py
tica.py		tica.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

Axis-Tour

Setup

Docker build

Environment variables

Docker run

Preliminary

Access to the embeddings used in our paper

Raw embeddings

PCA-transformed and ICA-transformed embeddings

Axis Tour embedidngs

POLAR-applied GloVe embeddings (in necessary) [4]

Download GloVe for reproducibility experiments

LKH solver for TSP

word-embeddings-benchmarks

Code

Save embeddings for reproducibility experiments

PCA and ICA

Axis Tour

Scatterplots

Save top words for Axis Tour embeddings

Histogram of cosine similarities

Quantitative evaluation of semantic continuity by GPT models

Dimensionality reduction

Calculate POLAR-applied embeddings (if necessary)

Evaluation

Relation between skewness and the average of two cosine similarities

Reference

Citation

Note

About

Licenses found

Releases

Packages

Languages

License

Licenses found

ymgw55/Axis-Tour

Folders and files

Latest commit

History

Repository files navigation

Axis-Tour

Setup

Docker build

Environment variables

Docker run

Preliminary

Access to the embeddings used in our paper

Raw embeddings

PCA-transformed and ICA-transformed embeddings

Axis Tour embedidngs

POLAR-applied GloVe embeddings (in necessary) [4]

Download GloVe for reproducibility experiments

LKH solver for TSP

word-embeddings-benchmarks

Code

Save embeddings for reproducibility experiments

PCA and ICA

Axis Tour

Scatterplots

Save top words for Axis Tour embeddings

Histogram of cosine similarities

Quantitative evaluation of semantic continuity by GPT models

Dimensionality reduction

Calculate POLAR-applied embeddings (if necessary)

Evaluation

Relation between skewness and the average of two cosine similarities

Reference

Citation

Note

About

Topics

Resources

License

Licenses found

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages