Masked Trajectory Models

This is the official code base for the paper Masked Trajectory Models for Prediction, Representation, and Control. Checkout our website!

If you find our work useful, consider citing:

@inproceedings{wu2023mtm,
  author = {Wu, Philipp and Majumdar, Arjun and Stone, Kevin and Lin, Yixin and Mordatch, Igor and Abbeel, Pieter and Rajeswaran, Aravind},
  title = {Masked Trajectory Models for Prediction, Representation, and Control},
  booktitle = {International Conference on Machine Learning},
  year = {2023},
}

Instructions

Install python packages from scratch

If you want to make an env from scratch

Make a new conda env

conda create -n mtm python=3.10
conda activate mtm

Install torch with gpu https://pytorch.org/get-started/locally/

Run these commands to install all dependencies

conda install pytorch==1.12.1 torchvision==0.13.1 torchaudio==0.12.1 cudatoolkit=11.3 -c pytorch
pip install -e .

Optionally install dev packages.

pip install -r requirements_dev.txt

Adroit Experiments [Optional]

Please refer to the Adroit Instructions for preparing the datasets and environment.

Running The MTM code

Example commands can be found in train_examples.sh An example notebook is located at example_train_sinusoid.ipynb which shows a simple example of how MTM can be used for trajectory prediction on a sinusoid dataset.

The main code is located in the mtm folder. Here is how you can run some of the experiments.

Simple sinusoidal test data python research/mtm/train.py +exp_mtm=sinusoid_cont
D4RL python research/mtm/train.py +exp_mtm=d4rl_cont
Adroit python research/mtm/train.py +exp_mtm=adroit_cont (needs Adroit setup)

Configuring MTM

The config file for mtm is located at research/mtm/config.yaml
Some key parameters
- traj_length: The length of trajectory sub-segments
- mask_ratios: A list of mask ratios that is randomly sampled
- mask_pattterns: A list of masking patterns that are randomly sampled. See MaskType under research/mtm/masks.py for supported options.
- mode_weights: (Only applies for AUTO_MASK) A list of weights that samples which mode is to be the "autoregressive" one. For example, if the mode order is, states, returns, actions, and mode_weights = [0.2, 0.1, 0.7], then with 0.7 probability, the action token and all future tokens will be masked out.

Code Organization

pre-commit hooks

pre-commits hooks are great. This will automatically do some checking/formatting. To use the pre-commit hooks, run the following:

pip install pre-commit
pre-commit install

If you want to make a commit without using the pre-commit hook, you can commit with the -n flag (ie. git commit -n ...).

Datasets

all dataset code is located in the research/mtm/datasets folder. All datasets have to do is return a pytorch dataset that outputs a dict (named set of trajectories).
a dataset should follow the DatasetProtocol specified in research/mtm/datasets/base.py.
each dataset should also have a corresponding get_datasets function where all the dataset specific construction logic happens. This function can take anything as input (as specified in the corresponding yaml config) and output the train and val torch Dataset.

Tokenizers

All tokenizer code is found in the research/mtm/tokenizers folder. Each tokenizer should inherit from the Tokenizer abstract class, found in research/mtm/tokenizers/base.py
Tokenizers must define a create method, which can handle dataset specific construction logic.

License & Acknowledgements

This source code is licensed under the MIT license found in the LICENSE file in the root directory of this source tree. This is not an official Meta product.

This project builds on top of or utilizes the following third party dependencies.

FangchenLiu/MaskDP_public: Masked Decision Prediction, which this work builds upon
ikostrikov/jaxrl: A fast Jax library for RL. We used this environment wrapping and data loading code for all d4rl experiments.
denisyarats/exorl: ExORL provides datasets collected with unsupervised RL methods which we use in representation learning experiments
vikashplus/robohive: Provides the Adroit environment
aravindr93/mjrl: Code for training the policy for generating data on Adroit
brentyi/tyro: Argument parsing and configuration

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
oss_utils		oss_utils
research		research
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
AdroitSetup.md		AdroitSetup.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
example_train_sinusoid.ipynb		example_train_sinusoid.ipynb
mypy.ini		mypy.ini
requirements.txt		requirements.txt
requirements_dev.txt		requirements_dev.txt
setup.py		setup.py
train_examples.sh		train_examples.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Masked Trajectory Models

Instructions

Install python packages from scratch

Adroit Experiments [Optional]

Running The MTM code

Configuring MTM

Code Organization

pre-commit hooks

Datasets

Tokenizers

License & Acknowledgements

About

Releases

Packages

Languages

License

guangheli12/mtm

Folders and files

Latest commit

History

Repository files navigation

Masked Trajectory Models

Instructions

Install python packages from scratch

Adroit Experiments [Optional]

Running The MTM code

Configuring MTM

Code Organization

pre-commit hooks

Datasets

Tokenizers

License & Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages