Artificial Intelligence Researcher

上海未来不远机器人科技有限公司
Chicago, US
On-site

Job Description

Responsibilities

  • Track, reproduce, and evaluate the latest open-source World Models (e.g., DreamZero, LingBot-VA, UniSim, Cosmos); analyze the strengths and weaknesses of different World-Action Model architectures and produce technical selection reports
  • Fine-tune open-source World Model pre-trained weights to adapt to the team's proprietary robot platform and manipulation task scenarios, enabling the World Model to accurately predict future video frames and physical interaction outcomes of robot operations
  • Integrate fine-tuned World Models into the VLA Pipeline: explore practical applications such as World Model as Data Augmentation (generating synthetic training data) and Action Scoring (scoring and filtering candidate action sequences)
  • Participate in teleoperation data collection to accumulate high-quality robot manipulation video data for World Model training
  • Leverage simulation platforms such as Isaac Sim to generate large-scale, diverse synthetic video data to supplement real-world data
  • Build a World Model Evaluation Pipeline covering Prediction Quality metrics (FVD / SSIM / LPIPS) and performance gains on downstream VLA task success rates
  • Continuously track frontier advances in World Models (DreamZero WAM paradigm, LingBot-VA MoT architecture, Cosmos physics reasoning, etc.) and share findings with the team regularly

Requirements

  • Master's degree or above in Computer Science, Artificial Intelligence, or related fields
  • Deep understanding of Video Generation / Prediction; proficiency in at least one of: Diffusion Models, Autoregressive Models, or Flow Matching
  • Proficient in PyTorch with end-to-end project experience in video generation or visual model training / fine-tuning
  • Strong ability to read research papers and reproduce open-source code; capable of quickly running a new World Model and benchmarking its performance
  • Familiarity with fundamental Robotics concepts (State Space, Action Space, Reward); willingness to dive deep into Embodied AI
  • Solid mathematical foundation (probability theory, optimization, dynamical systems)
  • Highly self-motivated with the ability to maintain strong technical sensitivity in the rapidly evolving World Model landscape

Bonus Points

★ Publications related to Video Generation / Prediction at NeurIPS, ICML, ICLR, CVPR, or equivalent venues ★ Familiarity with Model-Based RL (MBRL) or Model Predictive Control (MPC) ★ Experience with large-scale distributed training (multi-node multi-GPU, DeepSpeed, FSDP) ★ Experience generating training data in simulation environments such as Isaac Sim / Habitat / AI2-THOR ★ Hands-on experience with video generation models such as Sora / SVD / CogVideo / Cosmos ★ Ability to independently design a joint training framework for World Model and VLA Policy, or propose an Imagination-Based Planning solution

Skills & Requirements

Technical Skills

PytorchDiffusion modelsAutoregressive modelsFlow matchingIsaac simLeadershipCommunicationRoboticsAiComputer science

Employment Type

FULL TIME

Level

senior

Posted

5/7/2026

Continue to LinkedIn

You will be redirected to the job posting on LinkedIn.

Sign in and we'll score your resume against this role.

Find Similar Jobs

Browse roles in the same category, level, and remote setup.

Sign in to open the target role workbench.