Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation

Repository providing the source code for the paper

Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation Sai Prasanna, Daniel Honerkamp* Kshitij Sirohi*, Tim Welschehold, Wolfram Burgard and Abhinav Valada

Please cite the paper as follows:

@article{prasanna2024perception,
  title={Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation},
  author={Sai Prasanna and Daniel Honerkamp and Kshitij Sirohi and Tim Welschehold and Wolfram Burgard and Abhinav Valada},
  journal={Proceedings of the International Symposium on Robotics Research (ISRR)},
  year={2024}
}

Setup

You have to obtain the API user name and token for hm3d dataset from matterport by following their instructions. Set these as environment variables export USERNAME=<API_TOKEN_USER_ID> export PASSWORD=<API_TOKEN>.
Run the setup.sh to create the conda environment.
Download the EMSANet checkpoint from https://drive.google.com/uc?id=1LD4_g-jL4KJPRUmCGgXxx2xGQ7TNZ_o2 and extract it tar -xvf checkpoint.tar.gz -C ./third_party/trained_models/

Evaluating aggregation approaches with the Shortest path policy

To evaluate the aggregation approaches with the shortest path policy, run

./scripts/eval_sp_policy_emsanet.sh
./scripts/eval_sp_policy_maskrcnn.sh
./scripts/eval_sp_policy_segformer.sh

Training and evaluating RL Policy

To train the RL policy on ground truth semantics and evaluate it with different semantic models and aggregation approaches, run

./scripts/train_rl_policy.sh
./scripts/eval_rl_policy_emsanet.sh
./scripts/eval_rl_policy_maskrcnn.sh
./scripts/eval_rl_policy_segformer.sh

Misc

Calibrating the perception model

Collect the data for calibrating the perception model. Run

python -m sem_objnav.obj_nav.collect_seg_data --output_dir calibation_dataset

Check the notebooks sem_objnav/notebooks/emsanet_scaling_temp.ipynb and sem_objnav/notebooks/segformer_scaling_temp.ipynb for calibation.

Stubborn

To collect data and train the models used in stubborn, run ./scripts/train_stubborn.sh.

Hyperparameter optimization

To find optimal hyperparameters for the aggregation strategies, run ./scripts/htune.sh.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.vscode		.vscode
best_hparams		best_hparams
experiments/sp_policy		experiments/sp_policy
notebooks		notebooks
scripts		scripts
sem_objnav		sem_objnav
third_party		third_party
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
setup.py		setup.py
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation

Setup

Evaluating aggregation approaches with the Shortest path policy

Training and evaluating RL Policy

Misc

Calibrating the perception model

Stubborn

Hyperparameter optimization

About

Releases

Packages

Contributors 2

Languages

License

robot-learning-freiburg/Semantic-Search

Folders and files

Latest commit

History

Repository files navigation

Perception Matters: Enhancing Embodied AI with Uncertainty-Aware Semantic Segmentation

Setup

Evaluating aggregation approaches with the Shortest path policy

Training and evaluating RL Policy

Misc

Calibrating the perception model

Stubborn

Hyperparameter optimization

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages