ThrowBot

Learning to Throw With a Handful of Samples Using Decision Transformers

Paper | Video

Throwing objects by a robot extends its reach and has many industrial applications. While analytical models can provide efficient performance, they require accurate estimation of system parameters. Reinforcement Learning (RL) algorithms can provide an accurate throwing policy without prior knowledge. However, they require an extensive amount of real world samples which may be time consuming and, most importantly, pose danger. Training in simulation, on the other hand, would most likely result in poor performance on the real robot. In this letter, we explore the use of Decision Transformers (DT) and their ability to transfer from a simulation-based policy into the real-world. Contrary to RL, we re-frame the problem as sequence modelling and train a DT by supervised learning. The DT is trained off-line on data collected from a far-from-reality simulation through random actions without any prior knowledge on how to throw. Then, the DT is fine-tuned on an handful ( ∼5 ) of real throws. Results on various objects show accurate throws reaching an error of approximately 4 cm. Also, the DT can extrapolate and accurately throw to goals that are out-of-distribution to the training data. We additionally show that few expert throw samples, and no pre-training in simulation, are sufficient for training an accurate policy.

Installation

ROS Dependencies

Motoman fixed motoman point-streaming: https://github.com/MaxorPaxor/motoman_ps

Motoman original repo: https://github.com/ros-industrial/motoman/tree/kinetic-devel
Robotiq gripper - 2f_140_gripper: https://github.com/ros-industrial/robotiq

Note

Two URDF files must be changed for the gripper to work:
- robotiq_arg2f_transmission.xacro
- robotiq_arg2f_140_model_macro.xacro
must be replaced with the files located in robotiq_2f_140_gripper_changed_urdf folder.
Mimic joints plugin: https://github.com/roboticsgroup/roboticsgroup_upatras_gazebo_plugins
The-Gazebo-grasp-fix-plugin: The gripper has problems with picking objects. The problems are fixed with Gazebo grasping plugin https://github.com/JenniferBuehler/gazebo-pkgs/wiki/The-Gazebo-grasp-fix-plugin

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
config		config
gif		gif
launch		launch
robotiq_2f_140_gripper_changed_urdf		robotiq_2f_140_gripper_changed_urdf
src/scripts		src/scripts
urdf		urdf
world		world
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
README.md		README.md
package.xml		package.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ThrowBot

Learning to Throw With a Handful of Samples Using Decision Transformers

Paper | Video

Installation

ROS Dependencies

About

Releases

Packages

Languages

MaxorPaxor/ThrowBot

Folders and files

Latest commit

History

Repository files navigation

ThrowBot

Learning to Throw With a Handful of Samples Using Decision Transformers

Paper | Video

Installation

ROS Dependencies

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages