symbol-grounding

An empirical demonstration of a discontinuous performance increase via learned symbol grounding

An RL agent is trained (via PPO) to avoid lava obstacles and reach a goal. The agent is then transferred to a new environment, where the location of lava obstacles are indicated by colored arrows. A supervised translator is trained to predict lava locations from arrow environmental observations. Performance is compared amoung the frozen pretrained agent with accessed to the translator, the pretrained agent trained on the new environment, and a randomly initialized agent learning the new environment from scratch.

Results

Installation

Clone symbol-grounding anywhere you like

Dependencies

pip3 install -r requirements.txt

Gym Minigrid

cd gym-minigrid
pip3 install -e .

Usage

Replicate Paper

To replicate the experiment from the paper, run

python main.py

Modify training durations

To experiment with different training durations, use the --pretrain_steps and --tranfer_steps arguments:

python main.py --pretrain_steps 1000000 --transfer_steps 300000

Train individual components

To train models individually (and analyze the results) use the following scripts:

train_lava.py
train_arrows.py
transfer_arrows.py
arrow_to_lava/train_translator.py
translate_arrow_lava.py
plot_results.py

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
arrow_to_lava		arrow_to_lava
assets		assets
gym-minigrid @ 4297dc2		gym-minigrid @ 4297dc2
utils		utils
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
config.py		config.py
main.py		main.py
plot_results.py		plot_results.py
requirements.txt		requirements.txt
train_arrows.py		train_arrows.py
train_lava.py		train_lava.py
transfer_arrows.py		transfer_arrows.py
translate_arrow_lava.py		translate_arrow_lava.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

symbol-grounding

Results

Installation

Dependencies

Gym Minigrid

Usage

Replicate Paper

Modify training durations

Train individual components

About

Releases

Packages

Languages

oliveradk/symbol-grounding

Folders and files

Latest commit

History

Repository files navigation

symbol-grounding

Results

Installation

Dependencies

Gym Minigrid

Usage

Replicate Paper

Modify training durations

Train individual components

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages