Baidu | There's An AI For That

Accounting 4 Advertising 26 Aerospace Technology 1 AI services 4 Animation and Post-production 30 Apparel and Fashion 2 Appliances, Electrical, and Electronics Manufacturing 4 Architecture and Planning 2 Artificial Intelligence 70 Audio and Video Equipment Manufacturing 1 Automation Machinery Manufacturing 7 Biotechnology 6 Blockchain 2 Blogs 1 Book and Periodical Publishing 4 Broadcast Media Production and Distribution 1 Business Consulting 4 Business Content 4 Business Intelligence Platforms 12 Community 1 Computer Games 3 Computer Hardware Manufacturing 2 Construction 4 Consumer 5 Consumer Electronics 1 Cybersecurity 29 Data Infrastructure and Analytics 71 Desktop Computing Software Products 21 E-Learning 49 Education 45 Embedded Software Products 6 Engineering 1 Entertainment 12 Events 2 Financial 37 Food & Beverages 3 Fundraising 1 Graphic Design 54 Hospitality 2 Hospitals and Health Care 38 Human Resources 35 Individual and Family 2 Industrial Machinery Manufacturing 2 Information Technology and Services 75 Interior Design 4 Internet 1 Internet Marketplace Platforms 3 Internet News 2 Internet Publishing 15 IT and IT Consulting 129 Language Schools 1 Legal 19 Machinery Manufacturing 3 Manufacturing 2 Market Research 20 Marketing 61 Media & Telecommunications 6 Media Production 8 Medical Equipment Manufacturing 1 Mobile Computing Software Products 9 Mobile Food 1 Mobile Gaming Apps 2 Movies, Videos and Sound 3 Music 24 Non-profit Organizations 4 Online Audio and Video Media 21 Outsourcing and Offshoring Consulting 1 Personal Care 1 Pharmaceutical Manufacturing 1 Photography 13 Physical, Occupational and Speech Therapists 1 Professional Services 3 Professional Training and Coaching 7 Public Policy 1 Public Relations and Communications 3 Real Estate 7 Research 19 Retail 7 Robotics Engineering 18 Social Networking Platforms 10 Software Development 872 Staffing and Recruiting 4 Strategic Management 1 Technology, Information and Internet 490 Technology, Information and Media 61 Translation and Localization 10 Transportation/Trucking/Railroad 1 Travel & Tourism 10 Truck Transportation 1 Utilities 1 Venture Capital and Private Equity Principals 2 Wellness and Fitness 14 Writing and Editing 63

ByteDance

CapCut Online Creative Suite

Overall Rank#68

Baidu

Follow
Visit website

Baidu is a Chinese multinational technology company specializing in internet-related services, products, and artificial intelligence.

Beijing, China

🇨🇳

AI Native

Number of tools

Number of models

Number of employees

33.5k

Profitable

Yes

Valuation

$46B

Tools 0 Models 18 Papers 5 Repositories 10

Models

Gen 3

PP OCRv6

PP-OCRv6 is PaddlePaddle/Baidu’s lightweight universal OCR system for multilingual text detection and recognition across edge, mobile, desktop, and server deployments.

📜OCR 🔍Data extraction 📄Document analysis

NewMultimodal

Released 12d ago
Gen 7

ERNIE 5.1

ERNIE-5.1 is Baidu’s new preview flagship language model, built for stronger general text capability, better cost efficiency, and improved creative performance. It is positioned as the top-ranked Chinese model on the LMArena Text leaderboard and as a high-efficiency upgrade over ERNIE-5.0 with much lower training cost at its scale.

💬Chatting 💼Business 💻Coding 🤖Agents ⚖️Legal 🔢Math

NewText

Released 1mo ago
Gen 3

Qianfan-OCR

Qianfan-OCR is a 4B end-to-end document intelligence vision-language model that performs direct image-to-Markdown conversion and supports prompt-driven document tasks like table extraction, chart understanding, document QA, and key information extraction.

📜OCR 📄Document data extraction 📷Image text extraction 🖼️Image to markdown

Multimodal

Released 3mo ago
Gen 3 Paddle

PaddleOCR-VL 1.5

Production ready OCR and document AI toolkit that turns images and PDFs into structured data, with multilingual OCR, layout analysis and VLM based document parsing.

🏭Manufacturing

Text

Released 4mo ago
Gen 4 Z image

Z Image

Image

Released 7mo ago
Gen 7 ERNIE

ERNIE 4.5 VL 28B A3B Thinking

A multimodal MoE model that “looks, reads, and reasons” across images, video, and text. It adds tool use and a Thinking with Images mode, supports long context, and activates about 3B parameters per token for flagship-level VLM quality at practical latency.

📷Images 💻Coding 👤Avatars 🎥Videos

Text

Released 7mo ago
Gen 7 ERNIE

ERNIE 5

ERNIE 5 is Baidu’s next-gen general model for reasoning, coding, and multimodal understanding. It supports long context, tool and function calling, reliable JSON, streaming, and enterprise guardrails, making it a strong default for RAG, agents, and document or chart analysis.

📷Images 💻Coding 📝Writing

Text

Released 7mo ago
Gen 3 Paddle

PaddleOCR-VL

PaddleOCR-VL is a vision-language model built around PaddleOCR that reads documents, forms, tables, charts, and screenshots. It combines strong OCR with reasoning over layout and content, then answers in text or structured JSON for multimodal RAG and automation.

🏭Manufacturing 🎮Game creation

Multimodal

Released 8mo ago
Gen 3 Qianfan

Qianfan-VL-3B

Qianfan-VL-3B is Baidu’s lightweight VLM for cost-sensitive, real-time multimodal apps. It processes images plus text and returns grounded answers with basic OCR and layout understanding, long context, tool/function calling, and JSON outputs—optimized for speed and efficiency.

🏭Manufacturing 🖼️Image to text 🔍Image recognition

Text

Released 9mo ago
Gen 3 Qianfan

Qianfan-VL-8B

Qianfan-VL-8B is Baidu’s mid-size vision-language model. It reads images (docs, charts, screenshots, photos) alongside text and returns grounded answers with solid OCR, layout understanding, multi-image reasoning, long context, tool/function calling, and reliable JSON outputs—balanced for quality and latency.

🏭Manufacturing

Text

Released 9mo ago
Gen 3 Qianfan

Qianfan VL 70B

Qianfan-VL 70B is Baidu’s large vision-language model on the Qianfan platform. It ingests images (docs, charts, screenshots, photos) with text and produces grounded answers, featuring strong OCR and layout understanding, long context, tool/function calling, streaming, and reliable JSON outputs for multimodal RAG and enterprise apps.

📜OCR 🖼️3D image generation 🎬Video dubbing 🔍Image recognition

Text

Released 9mo ago
Gen 3 ERNIE

ERNIE 4.5 Turbo

ERNIE 4.5 Turbo is Baidu’s high-throughput, cost-optimized variant of ERNIE 4.5. It delivers strong reasoning and coding with long-context options, tool/function calling, JSON outputs, and streaming—ready for production via ERNIE Bot and the Qianfan API.

💻Coding 📚Summaries 🚀Productivity

Text

Released 9mo ago

Papers

DISPLAY: Directable Human-Object Interaction Video Generation via Sparse Motion Guidance and Multi-Task Auxiliary

Published on: 2026-03-10 10 authors
GarmentPainter: Efficient 3D Garment Texture Synthesis with Character-Guided Diffusion Model

Published on: 2026-03-09 5 authors
Learning to Generate via Understanding: Understanding-Driven Intrinsic Rewarding for Unified Multimodal Models

Chinese Academy of Sciences, Peking University, Sun Yat-sen University, University of Chinese Academy of Sciences

Published on: 2026-03-06 9 authors
GenHOI: Towards Object-Consistent Hand-Object Interaction with Temporally Balanced and Spatially Selective Object Injection

Northwestern Polytechnical University, Sun Yat-sen University

Published on: 2026-03-06 12 authors
PaddleOCR-VL-1.5: Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing

Published on: 2026-01-29 15 authors

Go to section

Search

Baidu Follow Visit website

Tools

Models

Papers

Repositories

Help

People also viewed

Feedback and Incident Report

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type:

Baidu

Follow
Visit website