Apple

NVIDIA Google Microsoft Amazon Meta Platforms Tesla OpenAI WordPress Tencent Amazon Web Services ByteDance Alibaba Anthropic Cisco Systems xAI Salesforce Shopify LinkedIn Adobe CrowdStrike

Meta Platforms Tencent Slack Technologies Strofe ESAI Sciox.ai FlexOS Global Tweetai Raven Tech PetPortrait.AI Deciphr Corporation Copylime Charisma Entertainment Useful collective Polaris Software Koolio.ai Eklipse Consensus NLP Flamel AI GPTZero

Overall Rank#2

Cupertino, California, United States

🇺🇸

Visit website

AI Native

Number of tools

Number of employees

12k

Profitable

Yes

Valuation

$4.02TAI

Tools

Apple Creator Studio

All yours for the making.

Multimedia

Open

12,529 www.apple.com

Yerena

Mar 2, 2026

@Apple Creator Studio

Has anyone made anything with these tools they could share? Curious about the AI capabilities.

Reply Share Edit Delete Report

Share

🇺🇸 United States
Released 1mo ago
No pricing

13,242
15
5.0

Models

Gen 3

Manzano

Manzano is Apple’s unified multimodal model that shares a hybrid vision tokenizer for both image understanding and text-to-image generation, using one autoregressive LLM plus a diffusion decoder to reach state-of-the-art unified performance.

🖼️Image generation 🔊Text to speech 🔍SEO content 🎮Game creation

NewMultimodal

Released 1mo ago
Gen 4

SHARP

SHARP is Apple’s monocular view-synthesis model that regresses a 3D Gaussian scene from one photo in under a second on a standard GPU, enabling real-time, photorealistic nearby views with metric camera motion.

🖼️Image generation 🔍Image recognition

Image

Released 3mo ago
Gen 3

FastVLM

FastVLM is Apple’s lightweight vision-language model built for real-time multimodal apps. It ingests images alongside text and returns grounded answers fast—OCR, charts/diagrams, screenshots, and general visual QA—while supporting long context, tool/function calling, and structured JSON outputs.

📜OCR 🔍SEO content 📞Customer support 📊Data analysis

Text

Released 7mo ago
Gen 4

Image Playground

Image Playground is Apple’s on-device generative image tool inside Apple Intelligence. It creates playful visuals in three styles—Animation, Illustration, and Sketch—right inside apps like Messages and Notes, and is also available to developers via the Image Playground API.

📷Images 🔊Text to speech 🖼️Image generation 🍸Cocktail recipes

Image

Released 1y ago
Gen 3

MM1.5

MM1.5 is Apple Research’s refinement of the MM1 multimodal recipe. It keeps the same encoder–decoder architecture but upgrades data curation, image resolution, and multi-image/document training, yielding stronger OCR, layout understanding, chart/diagram reasoning, and more grounded visual answers.

Text

Released 1y ago
Gen 3

MM1

MM1 is Apple Research’s multimodal LLM blueprint: a vision encoder feeding a text decoder via cross-attention, pretrained on a balanced mix of image–caption, interleaved image–text, and text-only data. It highlights how data quality, interleaving, and resolution—not just scale—drive strong OCR, document/chart reasoning, and grounded visual answers.

Text

Released 2y ago

Papers

Amortizing Maximum Inner Product Search with Learned Support Functions

Massachusetts Institute of Technology

Published on: 2026-03-09 4 authors
Expanding LLM Agent Boundaries with Strategy-Guided Exploratio

Published on: 2026-03-02 1 author
TrajTok: Learning Trajectory Tokens enables better Video Understanding

University of Washington

Published on: 2026-02-26 1 author
The Design Space of Tri-Modal Masked Diffusion Models

University of Cambridge

Published on: 2026-02-25 1 author
Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pretraining

Stanford University

Published on: 2026-02-23 1 author
ASTRA-bench: Evaluating Tool-Use Agent Reasoning and Action Planning with Personal User Context

Published on: 2026-02-20 1 author
Unifying Ranking and Generation in Query Auto-Completion via Retrieval-Augmented Generation and Multi-Objective Alignment

UC Berkeley

Published on: 2026-02-14 1 author
DSO: Direct Steering Optimization for Bias Mitigation

Carnegie Mellon University

Published on: 2026-02-12 1 author
CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning

University of Edinburgh

Published on: 2026-02-09 1 author
MemoryLLM: Plug-n-Play Interpretable Feed-Forward Memory for Transformers

Published on: 2026-01-30 1 author
RayRoPE: Projective Ray Positional Encoding for Multi-view Attention

Carnegie Mellon University

Published on: 2026-01-21 1 author
GenCtrl -- A Formal Controllability Toolkit for Generative Models

Universitat Pompeu Fabra

Published on: 2026-01-09 1 author
NarrativeTrack: Evaluating Video Language Models Beyond the Frame

University of Illinois Urbana-Champaign

Published on: 2026-01-03 1 author
Delay-Tolerant Networking for Tsunami Evacuation on the Small Island of Hachijojima: A Study of Epidemic and Prophet Routing

The University of Nottingham

Published on: 2025-12-31 1 author
Completed Hyperparameter Transfer across Modules, Width, Depth, Batch and Duration

Published on: 2025-12-26 1 author
One Layer Is Enough: Adapting Pretrained Visual Encoders for Image Generation

Published on: 2025-12-16 1 author
Sharp Monocular View Synthesis in Less Than a Second

Published on: 2025-12-11 1 author
Chain-of-Image Generation: Toward Monitorable and Controllable Image Generation

Duke University

Published on: 2025-12-09 1 author
The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

Published on: 2025-11-20 1 author
Shielded Diffusion: Generating Novel and Diverse Images using Sparse Repellency

Published on: 2025-10-28 1 author
Enabling Differentially Private Federated Learning for Speech Recognition: Benchmarks, Adaptive Optimizers and Gradient Clipping

Purdue University

Published on: 2025-10-25 1 author
Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing

Published on: 2025-10-22 8 authors
FS-DFM: Fast and Accurate Long Text Generation with Few-Step Diffusion Language Models

The Ohio State University

Published on: 2025-09-24 6 authors
EpiCache: Episodic KV Cache Management for Long Conversational Question Answering

Hanyang University

Published on: 2025-09-22 5 authors
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

Washington State University

Published on: 2025-08-27 1 author
Scaling Laws for Native Multimodal Models

Sorbonne University

Published on: 2025-08-09 1 author
Apple Intelligence Foundation Language Models: Tech Report 2025

Published on: 2025-07-17 1 author
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

Swiss Federal Institute of Technology Lausanne

Published on: 2025-06-04 1 author
Scaling Diffusion Language Models via Adaptation from Autoregressive Models

The University of Hong Kong, University of Illinois at Urbana-Champaign

Published on: 2025-05-31 1 author
FastVLM: Efficient Vision Encoding for Vision Language Models

Published on: 2025-05-15 1 author
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing

Published on: 2025-04-22 1 author
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

Published on: 2025-04-21 1 author
ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities

Published on: 2025-04-16 1 author
AToken: A Unified Tokenizer for Vision

Published on: 2025-02-19 1 author
pfl-research: simulation framework for accelerating research in Private Federated Learning

Published on: 2024-12-10 1 author
Controlling Language and Diffusion Models by Transporting Activations

Published on: 2024-11-22 1 author
Duo-LLM: A Framework for Studying Adaptive Computation in Large Language Models

Published on: 2024-10-01 1 author
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference

Published on: 2024-07-19 1 author
Efficient Large Language Model Inference with Limited Memory

Published on: 2024-06-12 1 author
Private Vector Mean Estimation in the Shuffle Model: Optimal Rates Require Many Messages

Published on: 2024-04-26 1 author
Diffusion Models Without Attention

Cornell University

Published on: 2023-10-30 1 author
Differentially Private Heavy Hitter Detection using Federated Analytics

Published on: 2023-07-21 1 author
Application-Agnostic Language Modeling for On-Device ASR

Published on: 2023-03-16 1 author
Stable Diffusion with Core ML on Apple Silicon

Published on: 2022-12-01 1 author
Training a Tokenizer for Free with Private Federated Learning

Cornell University

Published on: 2022-03-15 1 author
On-device Panoptic Segmentation for Camera Using Transformers

Published on: 2021-10-19 1 author
Federated Evaluation and Tuning for On-Device Personalization: System Design & Applications

Published on: 2021-02-18 1 author
Scalable Differential Privacy with Certified Robustness in Adversarial Learning

Published on: 2020-09-15 1 author
Overton: A Data System for Monitoring and Improving Machine-Learned Products

Published on: 2019-09-07 1 author
Evaluating Discourse Phenomena in Neural Machine Translation

University of Edinburgh

Published on: 2018-04-20 1 author

Funding rounds

View all

Debt Financing

Closed 2013-04-30

Round $17B Closed
IPO

Closed 1980-12-12

Round $101.20M Post-money $1.78B Closed
Series A

Closed 1978-01-15

Round $517.50K Post-money $5.20M 3 investors Closed
Seed

Announced 1977-01-15

Raised $250K Round $250K Pre-money $750K 1 investors Closed

Search

Apple

Tools

Models

Papers

Funding rounds

Repositories

Help

People also viewed

Feedback and Incident Report

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: