o

o stands for Zero-Shot Autonomous Robots.

This repo uses model APIs to create a Zero-Shot Autonomous Robot. Individual robot behaviors are wrapped in asynchronous nodes (python) which are launched via scripts (bash). It's kind of like a more minimalist and simpler ROS. Four main types of models are used:

LLM (Language Language Model) a text2text model used for planning, reasoning, dialogue, and more!
VLM (Vision Language Model) - a image2text model used for scene understanding, object detection, and more!
TTS (Text-to-Speech) a text2audio model used for speech synthesis so the robot can talk.
STT (Speech-to-Text) a audio2text model used for speech recognition so the robot can listen.

To get started follow the setup guide.

The models module contains code for different model apis. For example models/rep.py is for the open source Replicate API, and models/gpt.py is for the OpenAI API. More info on models.

The robots module contains code for different robots. For example robots/nex.py is for the HiWonder AiNex Humanoid. More info on robots.

The nodes module contains code for different nodes. For example nodes/look.py contains the loop used vision with a Vision Language Model. More info on nodes.

The params module contains code for different parameters. For example params/default.sh will load environment variables (params) that contain default values. More info on params.

If you are interested in contributing, please read the contributing guide.

Video

Citation

@misc{zero-shot-robot-2023,
  title={Zero-Shot Autonomous Robots},
  author={Hugo Ponte},
  year={2023},
  url={https://github.com/hu-po/o}
}

Name		Name	Last commit message	Last commit date
Latest commit History 201 Commits
docs		docs
models		models
nodes		nodes
params		params
robots		robots
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
o.py		o.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

o

Video

Citation

About

Languages

License

hu-po/o

Folders and files

Latest commit

History

Repository files navigation

o

Video

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Languages