Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation (VWSD)

This is the source code of the EMNLP 2023 paper Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation [paper].

Install

git clone https://github.com/anastasiakrith/multimodal-retrieval-for-vwsd.git
cd multimodal-retrieval-for-vwsd

Setting up (virtualenv)

On the project folder run the following commands:

$ virtualenv env to create a virtual environment
$ source venv/bin/activate to activate the environment
$ pip install -r requirements.txt to install packages
Create a .env file with the environmental variables. The project needs a OPENAI_API_KEY with the API key corresponding to your openai account, and optionally a DATASET_PATH corresponding to the absolute path of VWSD dataset.

Running the project

VL Retrieval

python vl_retrieval_eval.py -llm "gpt-3.5" -vl "clip" -baseline -penalty

QA Retrieval

python qa_retrieval_eval.py -llm "gpt-3.5" -captioner "git" -strategy "greedy" -prompt "no_CoT" -zero_shot

Image-to-Image Retrieval

python image_retrieval_eval.py -vl "clip" -wiki "wikipedia" -metric "cosine"

Text-to-Text Retrieval

python text_retrieval_eval.py -captioner "git" -strategy "greedy" -extractor "clip" -metric "cosine"

Acknowledgement

The implementation relies on resources from openai-api and hugging-face transformers.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
modules		modules
.gitignore		.gitignore
README.md		README.md
image_retrieval_eval.py		image_retrieval_eval.py
qa_retrieval_eval.py		qa_retrieval_eval.py
requirements.txt		requirements.txt
text_retrieval_eval.py		text_retrieval_eval.py
vl_retrieval_eval.py		vl_retrieval_eval.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation (VWSD)

Install

Setting up (virtualenv)

Running the project

VL Retrieval

QA Retrieval

Image-to-Image Retrieval

Text-to-Text Retrieval

Acknowledgement

About

Releases

Packages

Languages

anastasiakrith/multimodal-retrieval-for-vwsd

Folders and files

Latest commit

History

Repository files navigation

Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation (VWSD)

Install

Setting up (virtualenv)

Running the project

VL Retrieval

QA Retrieval

Image-to-Image Retrieval

Text-to-Text Retrieval

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages