Star 历史趋势
数据来源: GitHub API · 生成自 Stargazers.cn
README.md

Towards the Era of Post-Training for Autonomous Driving

The missing infrastructure for Physical AI post-training in AD. Open-source. Production-validated.

Paper YouTube Hugging Face ModelScope
PyTorch Python License

Table of Contents

Highlights

  • WorldEngine is a post-training framework for Physical AI that systematically addresses the long-tail safety-critical data scarcity problem in autonomous driving.
  • Data-driven long-tail discovery: Failure-prone scenarios are automatically identified from real-world driving logs by the pre-trained agent itself — no manual design, no synthetic perturbations.
  • Photorealistic interactive simulation via 3D Gaussian Splatting (3DGS): Each discovered scenario is reconstructed into a fully controllable, real-time-renderable simulation environment with independent dynamic agent manipulation.
  • Behavior-driven scenario generation: Leverages Behavior World Model (BWM) to generalize and synthesize diverse traffic variations from existing long-tail scenarios, expanding sparse safety-critical events into a dense, learnable distribution.
  • RL-based post-training on synthesized safety-critical rollouts substantially outperforms scaling pre-training data alone — competitive with a ~10× increase in pre-training data.
  • Production-scale validation: Deployed on a mass-produced ADAS platform trained on 80,000+ hours of real-world driving logs, reducing simulated collision rate by up to 45.5% and achieving zero disengagements in a 200 km on-road test.

News

Benchmark

We compare different post-training paradigms on the nuPlan dataset, evaluating on both open-loop and closed-loop metrics across common and rare driving scenarios.

Metric notes: Early stage. Stable ckpts and corresponding results coming soon.

  • Open-loop PDMS is aligned with NAVSIM v1.1 PDM Score. Common denotes the standard navtest split; Rare denotes the navtest_failures subset — failure-prone rare-case scenarios extracted from navtest.
  • Closed-loop Success Rate is defined as the fraction of simulated driving episodes completed without collision or off-road failure.
  • Closed-loop Ego Progress (EP) measures the route progress made by the ego vehicle during SimEngine closed-loop testing, reflecting whether the agent makes meaningful forward progress rather than merely avoiding collision or off-road failure.
  • Closed-loop PDMS* is the PDM Score obtained via SimEngine closed-loop testing, where the planner interacts with reactive agents in simulation under real-time rendering.

Training notes:

  • Rare logs are failure-prone scenarios automatically extracted from navtrain by the pre-trained agent itself (see Rare Case Extraction).
  • Common logs are the standard cases in navtrain.
MethodOpen-loop PDMS ↑ (common)Open-loop PDMS ↑ (rare)Closed-loop SR ↑ (rare)Closed-loop EP ↑ (rare)Closed-loop PDMS* ↑ (rare)
Base model85.6447.1473.6646.7160.98
Supervised fine-tuning on rare logs87.5052.5574.5147.5961.87
Post-training on common logs87.6949.3669.6351.0260.21
Post-training on rare logs88.5159.2073.3551.8662.78
Post-training on rare synthetic replays82.6162.6987.2032.4963.22
Post-training on rare rollouts w/o Behaviour WM88.5361.8877.9656.7467.33
Post-training with WorldEngine88.9559.8388.8947.6670.12

Key findings:

  • Post-training on rare logs substantially improves rare open-loop PDMS over supervised fine-tuning (59.20 vs. 52.55), but does not improve rare closed-loop SR, indicating that fixed rare logs alone are insufficient for robust interactive behaviour.
  • Post-training on common logs provides limited long-tail benefit and degrades rare closed-loop performance, reducing SR from 73.66% to 69.63% and PDMS$^\ast$ from 60.98 to 60.21, confirming the importance of long-tail event discovery.
  • The full WorldEngine pipeline achieves the best overall rare closed-loop performance, with the highest SR (88.89%) and PDMS$^\ast$ (70.12). It improves rare closed-loop SR by +15.23 percentage points and PDMS$^\ast$ by +9.14 over the base model, while maintaining strong common open-loop performance.

Qualitative Results — Closed-Loop Simulation on nuPlan

Each pair shows the Base model vs WorldEngine post-trained model on the same rare-case scenario. Left: front-camera rendering; Right: BEV trajectory visualization.

On-Road Deployment — Night Urban Driving

Zero disengagements in 200 km on-road testing on a mass-produced ADAS platform.

System Architecture

WorldEngine consists of two tightly coupled subsystems:

ModuleFunctionCore Technology
SimEngineClosed-loop simulation with ego & agentsHydra, Ray, rendering
AlgEngineEnd-to-end model training & evaluationMMDetection3D, UniAD/VADv2/HydraMDP

Roadmap

  • Core platform integration (SimEngine + AlgEngine)
  • Multi-GPU distributed simulation and training
  • Rare case extraction and fine-tuning pipeline
  • Comprehensive documentation and usage guides
  • Hugging Face / ModelScope dataset
  • Open-source release (code, data, early pre-trained models)
  • arXiv preprint
  • Behavior World Model integration
  • Stable pre-trained models

Getting Started

Documentation Overview

WorldEngine provides comprehensive guides for each stage of your workflow:

GuidePurposeKey Topics
InstallationSet up both conda environmentsTwo-environment setup (simengine + algengine), dependencies, troubleshooting
Data OrganizationPrepare datasets and checkpointsData structure, Hugging Face/ModelScope downloads, symlinks
Quick StartRun your first experiment in 5 minQuick test tutorial, understanding results, complete pipeline
SimEngine UsageMaster closed-loop simulationRollout scripts, distributed testing, configuration, metrics
AlgEngine UsageTrain and fine-tune modelsTraining from scratch, evaluation, rare case extraction, RL fine-tuning

Installation

WorldEngine requires two separate conda environments due to different Python requirements.

Full installation guide: docs/installation.md

Quick Test

Verify your installation with a pre-trained model:

# Set up environment variable export WORLDENGINE_ROOT=$(pwd) # Option 1: Single GPU test bash scripts/closed_loop_test.sh # Option 2: Multi-GPU test (Default 8 GPUs) bash scripts/multigpu_closed_loop_test.sh

What this does:

  • Loads a pre-trained VADv2 model (50% training data, epoch 8)
  • Runs closed-loop simulation on 288 rare-case test scenarios
  • Evaluates with navsim v1 PDMS (collision avoidance, progress, comfort, etc.)
  • Saves results to experiments/closed_loop_exps/e2e_vadv2_50pct/navtest_failures_NR/

Detailed quick start tutorial: docs/quick_start.md

Deep Dive by Module

After the quick test, explore each subsystem in detail:

SimEngine - Photorealistic Closed-Loop Simulation

Learn how to run simulations, generate rollouts, and test models:

  • Rollout scripts for data generation (no model required)
  • Testing scripts for model evaluation (single/multi-GPU)
  • Ray distributed simulation for large-scale testing
  • Reactive vs non-reactive agent modes
  • Configuration guide for all Hydra parameters

SimEngine Usage Guide

AlgEngine - End-to-End Model Training & Fine-Tuning

Learn how to train models, extract rare cases, and fine-tune:

  • Training from scratch
  • Open-loop evaluation on test sets
  • Rare case extraction from evaluation failures
  • RL-based fine-tuning on long-tail scenarios
  • Multi-GPU training with distributed data parallel

AlgEngine Usage Guide

Scene Reconstruction - 3D Gaussian Splatting-based method, MTGS

WorldEngine's simulation environments are powered by 3D Gaussian Splatting (MTGS):

  • Multi-traversal reconstruction from nuPlan data
  • Photorealistic rendering for closed-loop simulation
  • Asset generation for SimEngine scenes

MTGS Repository

Citation

If any parts of our work help your research, please consider citing us and giving a star to our repository:

If you use the Render Assets (MTGS), please also cite:

@article{li2025mtgs, title={MTGS: Multi-Traversal Gaussian Splatting}, author={Li, Tianyu and Qiu, Yihang and Wu, Zhenhua and Lindstr{\"o}m, Carl and Su, Peng and Nie{\ss}ner, Matthias and Li, Hongyang}, journal={arXiv preprint arXiv:2503.12552}, year={2025} }

If you use the augmented scenarios data, please cite as well:

@inproceedings{zhou2025nexus, title={Decoupled Diffusion Sparks Adaptive Scene Generation}, author={Zhou, Yunsong and Ye, Naisheng and Ljungbergh, William and Li, Tianyu and Yang, Jiazhi and Yang, Zetong and Zhu, Hongzi and Petersson, Christoffer and Li, Hongyang}, booktitle={ICCV}, year={2025} }
@article{li2025optimization, title={Optimization-Guided Diffusion for Interactive Scene Generation}, author={Li, Shihao and Ye, Naisheng and Li, Tianyu and Chitta, Kashyap and An, Tuo and Su, Peng and Wang, Boyang and Liu, Haiou and Lv, Chen and Li, Hongyang}, journal={arXiv preprint arXiv:2512.07661}, year={2025} }

If you find AlgEngine well, please cite as well:

@ARTICLE{11353028, author={Liu, Haochen and Li, Tianyu and Yang, Haohan and Chen, Li and Wang, Caojun and Guo, Ke and Tian, Haochen and Li, Hongchen and Li, Hongyang and Lv, Chen}, journal={IEEE Transactions on Pattern Analysis and Machine Intelligence}, title={Reinforced Refinement With Self-Aware Expansion for End-to-End Autonomous Driving}, year={2026}, volume={48}, number={5}, pages={5774-5792}, keywords={Adaptation models;Self-aware;Autonomous vehicles;Pipelines;Planning;Training;Reinforcement learning;Uncertainty;Data models;Safety;End-to-end autonomous driving;reinforced finetuning;imitation learning;motion planning}, doi={10.1109/TPAMI.2026.3653866}}

If you find data scaling infos helpful, please also cite:

@article{tian2025simscale, title={SimScale: Learning to Drive via Real-World Simulation at Scale}, author={Haochen Tian and Tianyu Li and Haochen Liu and Jiazhi Yang and Yihang Qiu and Guang Li and Junli Wang and Yinfeng Gao and Zhang Zhang and Liang Wang and Hangjun Ye and Tieniu Tan and Long Chen and Hongyang Li}, journal={arXiv preprint arXiv:2511.23369}, year={2025} }

Contributing

We welcome contributions from the community! Whether you want to:

  • Report bugs - Open an Issue
  • Improve documentation - Submit a Pull Request
  • Contribute code - Fork, develop, and submit a PR

Please read our contributing guidelines before submitting PRs.

For questions:

  1. Check the documentation first
  2. Search existing Issues

License

All content in this repository is under the Apache-2.0 license.

The released data is based on nuPlan and is under the CC-BY-NC-SA 4.0 license.

Related Resources

We acknowledge all the open-source contributors for the following projects to make this work possible:

ProjectDescription
MTGSMulti-traversal Gaussian Splatting for scene reconstruction
SimScaleLarge scale driving simulation
nerfstudioCollaboration-friendly NeRF toolkit
MMDetection3D3D detection framework
UniADEnd-to-end autonomous driving framework
VADv2Vectorized autonomous driving framework
NAVSIMNon-reactive autonomous vehicle simulation benchmark
nuPlanLarge-scale autonomous driving dataset
MetaDriveCompositional driving simulation platform
RayDistributed execution framework
HydraConfiguration management framework

If you find WorldEngine useful, please consider giving us a star!

Quick Links: Documentation | Installation | Quick Start | Issues | Discussions

Contact: For research collaboration or questions, visit our Discussions

关于 About

WorldEngine: Towards the Era of Post-Training for Physical AI
3dgsend-to-end-autonomous-drivingpost-trainingsimulationworld-model

语言 Languages

Python99.2%
Shell0.8%

提交活跃度 Commit Activity

代码提交热力图
过去 52 周的开发活跃度
10
Total Commits
峰值: 9次/周
Less
More

核心贡献者 Contributors