Locomotion code for the Upkie wheeled biped.
Test it straight from the command line on Linux, no installation required:
git clone https://github.com/tasts-robots/upkie_locomotion.git
cd upkie_locomotion
./tools/bazelisk run -c opt //agents/blue_balancer:bullet
Connect a USB controller to move the robot around. 🎮
There is no dependency to install thanks to Bazel, which builds everything locally. (Compilation will only take a while the first time.) The syntax is the same to deploy to the Raspberry Pi on Upkie with raspunzel
.
The code is organized into spines, which communicate with the simulator or actuators using the Vulp C++ library, and agents, the main programs that implement behaviors in Python. In the example above we ran the blue agent. We could also run the Bullet spine independently:
bazel run -c opt //spines:bullet -- --show
The -c opt
argument to Bazel makes sure we compile optimized code, while the --show
argument to the spine displays the Bullet visualization.
- Blue balancer
- A baseline agent designed to check out Upkie's physical capabilities. It balances the robot using PD feedback from the head's pitch and wheel odometry to wheel velocities, plus a feedforward non-minimum phase trick for smoother transitions from standing to rolling. An analytical inverse kinematics is plugged in for crouching and standing up (crouching is controlled from D-pad of the USB controller, if one is found).
- Pink balancer
- Same as the Blue balancer, but inverse kinematics is computed by Pink rather than with a model-specific analytical solution. This is the controller that runs in the first two videos of Upkie.
- PPO balancer
- An agent trained by reinforcement learning to balance with straight legs. Training uses the
UpkieWheelsEnv
gym environment and the PPO implementation from Stable Baselines3.
UpkieWheelsEnv
- Upkie with full observation but only wheel velocity actions. This environment is single-threaded rather than vectorized. In return, it runs as-is on the real robot.
- Floor contact
- Detect contact between the wheels and the floor. Both Blue and Pink agents use contact as a reset flag for their integrators, to avoid over-spinning the wheels while the robot is in the air.
- Wheel contact
- Detect contact between a given wheel and the floor.
- Wheel odometry
- Measure the relative motion of the floating base with respect to the floor. Wheel odometry is part of their secondary task (after keeping the head straight), which is to stay around the same spot on the floor.
- Bullet
- Spawn Upkie in a Bullet simulation. Resetting this spine moves the robot back to its initial configuration in this world.
- pi3hat
- Spine is made to be called from a Raspberry Pi with an onboard mjbots pi3hat. Servos are stopped when the spine is stopped, and switch to position mode (which is a position-velocity-torque controller) when the spine idles. Check out the spine state machine for details.