Xiaomi
At Xiaomi, we believe technology’s the true power lies in its ability to understand and enhance the human experience. This year, with the upgraded Xiaomi HyperOS 2 and the newly launched Xiaomi HyperAI, we are redefining connection.
Beijing, China
🇨🇳
Follow
Visit website
AI Native
No
Number of tools
0
Profitable
Yes
Valuation
$122.20BAI
Tools
No tools yet.
Models
-
MiMo-V2-TTS is Xiaomi’s large-scale speech synthesis model built for expressive agent voice, aiming for natural, emotionally aware speech.NewAudioReleased 5d ago
-
MiMo-V2-Omni is an omni foundation model that unifies multimodal understanding with agentic capability, built to see, hear, and act.NewMultimodalReleased 5d ago
-
MiMo-V2-Pro is Xiaomi’s flagship foundation model built for real-world agent workloads, designed to act as the “brain” of agent systems that orchestrate complex workflows and tool use.NewTextReleased 5d ago
-
Xiaomi-Robotics-0 is a 4.7B-parameter open Vision-Language-Action model that uses a Mixture-of-Transformers design, combining a Qwen3-based vision-language brain with a diffusion transformer controller for smooth, real-time robot manipulation on benchmarks and real robots.NewMultimodalReleased 1mo ago
-
I cannot find public technical documentation for a distinct “MiMo v2 Flash” model beyond Xiaomi’s MiMo-7B and MiMo-VL releases, so I cannot reliably describe that specific variant without guessing.TextReleased 3mo ago
-
Pixel-Perfect Depth is a monocular depth estimation model that uses pixel-space diffusion transformers to predict high-quality, flying-pixel-free depth maps for dense point clouds, accepted at NeurIPS 2025.ImageReleased 5mo ago
Robots
-
CyberDog 2Mobile · CN · Semi-autonomous · Commercially availableA compact, biomimetic quadruped robot designed to mimic real animal movement, equipped with advanced AI, sensors, and learning capabiliti... -
CyberDogMobile · CN · Semi-autonomous · In productionA quadruped bionic robot dog developed by Xiaomi that uses AI perception, cameras, and voice interaction to follow, interact with, and as... -
CyberOneHumanoid · CN · Semi-autonomous · In developmentCyberOne is a general-purpose humanoid robot developed by Xiaomi as part of its Cyber series. It is designed for industrial applications ...
Papers
-
Learning Diverse Skills for Behavior Models with Mixture of Experts1 author
-
Utonia: Toward One Encoder for All Point CloudsThe University of Hong KongPublished on: 2026-03-03 1 author
-
LaST-VLA: Thinking in Latent Spatio-Temporal Space for Vision-Language-Action in Autonomous DrivingPublished on: 2026-03-02 1 author
-
EMO-R3: Reflective Reinforcement Learning for Emotional Reasoning in Multimodal Large Language ModelsWuhan UniversityPublished on: 2026-02-27 1 author
-
MSJoE: Jointly Evolving MLLM and Sampler for Efficient Long-Form Video UnderstandingTongji UniversityPublished on: 2026-02-26 1 author
-
ThinkOmni: Lifting Textual Reasoning to Omni-modal Scenarios via Guidance DecodingHuazhong University of Science and TechnologyPublished on: 2026-02-26 1 author
-
UFO: Unifying Feed-Forward and Optimization-based Methods for Large Driving Scene ModelingUniversity of Illinois Urbana-ChampaignPublished on: 2026-02-24 1 author
-
From Pairs to Sequences: Track-Aware Policy Gradients for Keypoint DetectionWuhan UniversityPublished on: 2026-02-24 1 author
-
VGGDrive: Empowering Vision-Language Models with Cross-View Geometric Grounding for Autonomous DrivingTianjin UniversityPublished on: 2026-02-24 1 author
-
Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time ExecutionXiaomi RoboticsPublished on: 2026-02-13 1 author
-
HAIC: Humanoid Agile Object Interaction Control via Dynamics-Aware World ModelTsinghua UniversityPublished on: 2026-02-12 1 author
-
Federated Balanced LearningPublished on: 2026-02-09 1 author
-
DriveWorld-VLA: Unified Latent-Space World Modeling with Vision-Language-Action for Autonomous DrivingPublished on: 2026-02-06 1 author
-
MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement LearningHuazhong University of Science and TechnologyPublished on: 2026-02-05 1 author
-
From Chains to Graphs: Self-Structured Reasoning for General-Domain LLMsUniversity of TokyoPublished on: 2026-01-20 1 author
-
Pixel-Perfect Visual Geometry EstimationPublished on: 2026-01-08 1 author
-
DriveLaW:Unifying Planning and Video Generation in a Latent Driving WorldHuazhong University of Science and TechnologyPublished on: 2025-12-31 1 author
-
Flow2GAN: Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-step High-Fidelity Audio GenerationPublished on: 2025-12-29 1 author
-
GroundingME: Exposing the Visual Grounding Gap in MLLMs through Multi-Dimensional EvaluationThe University of Hong KongPublished on: 2025-12-19 1 author
-
DVGT: Driving Visual Geometry TransformerTsinghua UniversityPublished on: 2025-12-18 1 author
Repositories
No repositories yet.
