posted an update

Technical Update: Exploring UnifoLM-WMA-0 for G1 Alignment Testing

Background

While investigating robotics world models with extractable reasoning traces, I came across UnifoLM-WMA-0, which is Unitree’s open-source world model architecture for the G1 humanoid robot.

What is UnifoLM-WMA-0

UnifoLM-WMA-0 is a world-model-action framework that operates in two modes: 1. Decision-Making Mode: Predicts future physical interactions between the robot and environment. These predictions are then used to inform the action policy. 2. Simulation Mode: Generates environmental feedback based on robot actions, functioning as an interactive simulator. The architecture appears to separate world model predictions from action generation, which could allow for extracting intermediate reasoning states.

Potential Relevance to G1 Alignment Research

Since our current work tests LLM-based control under resource pressure scenarios, integrating UnifoLM-WMA-0 could provide: ∙ A comparison point between world-model-based and LLM-based decision-making under constraints ∙ Access to the model’s physical interaction predictions, which may reveal different failure modes than language-based reasoning ∙ A manufacturer-provided baseline for the G1 platform we’re already using Resources ∙ Code: https://github.com/unitreerobotics/unifolm-world-model-action ∙ Model weights: https://huggingface.co/unitreerobotics/UnifoLM-WMA-0-Base ∙ Released: September 15, 2025

Next Steps Under Consideration

If this direction seems worthwhile, the next phase would involve: 1. Setting up the model with our existing MuJoCo G1 simulation 2. Attempting to extract world model prediction traces 3. Running preliminary tests under the same battery pressure scenarios used for LLM testing 4. Evaluating whether the prediction traces provide meaningful insights into alignment behavior

This is exploratory at this stage. Feedback welcome if anyone has experience with this model or similar world-model architectures.

Log in or sign up for Devpost to join the conversation.