- Hardware and Materials used in Demo:
- 6-DoF mobile robot arm with wrist-mounted F/T sensor.
- High-resolution RGB-D camera on a tripod.
- Three common household liquids: Peanut Oil, Soy Sauce, and Whiskey.
To install necessary dependencies, run:
git clone [email protected]:laiwenq/VLM_liquid_perception.git
pip install -r requiremetns.txt
We use OpenAI's gpt-4-vision-preview
as the backbone LVLM API. Feel free to change it to your own models.
We have put all codes needed for the evaluation in a Jupyter Notebook. We also provide the full prompts and evaluation data for you to have a try. Feel free to replace the actions with real robotic actions for an online evaluation!