2,581
1,977
760
12,826
9,801
3,040
9,999
14,332
5,122
6,204
3,145
9,386
5,458
81,942
3,391
2,181
5,233
11,852
4,260
1,960
Apple
AI Native
No
Number of tools
1
Number of employees
12k
Profitable
Yes
Valuation
$4.02TAI
Most popular AI tool
Tools
-
12,486 www.apple.comShareYerenaMar 2, 2026@Apple Creator StudioHas anyone made anything with these tools they could share? Curious about the AI capabilities.Reply Share Edit Delete Report
Models
-
Manzano is Apple’s unified multimodal model that shares a hybrid vision tokenizer for both image understanding and text-to-image generation, using one autoregressive LLM plus a diffusion decoder to reach state-of-the-art unified performance.NewMultimodalReleased 1mo ago
-
SHARP is Apple’s monocular view-synthesis model that regresses a 3D Gaussian scene from one photo in under a second on a standard GPU, enabling real-time, photorealistic nearby views with metric camera motion.ImageReleased 3mo ago
-
FastVLM is Apple’s lightweight vision-language model built for real-time multimodal apps. It ingests images alongside text and returns grounded answers fast—OCR, charts/diagrams, screenshots, and general visual QA—while supporting long context, tool/function calling, and structured JSON outputs.TextReleased 7mo ago
-
Image Playground is Apple’s on-device generative image tool inside Apple Intelligence. It creates playful visuals in three styles—Animation, Illustration, and Sketch—right inside apps like Messages and Notes, and is also available to developers via the Image Playground API.ImageReleased 1y ago
-
MM1.5 is Apple Research’s refinement of the MM1 multimodal recipe. It keeps the same encoder–decoder architecture but upgrades data curation, image resolution, and multi-image/document training, yielding stronger OCR, layout understanding, chart/diagram reasoning, and more grounded visual answers.TextReleased 1y ago
-
MM1 is Apple Research’s multimodal LLM blueprint: a vision encoder feeding a text decoder via cross-attention, pretrained on a balanced mix of image–caption, interleaved image–text, and text-only data. It highlights how data quality, interleaving, and resolution—not just scale—drive strong OCR, document/chart reasoning, and grounded visual answers.TextReleased 2y ago
Papers
-
Amortizing Maximum Inner Product Search with Learned Support FunctionsMassachusetts Institute of TechnologyPublished on: 2026-03-09 4 authors
-
Expanding LLM Agent Boundaries with Strategy-Guided ExploratioPublished on: 2026-03-02 1 author
-
TrajTok: Learning Trajectory Tokens enables better Video UnderstandingUniversity of WashingtonPublished on: 2026-02-26 1 author
-
The Design Space of Tri-Modal Masked Diffusion ModelsUniversity of CambridgePublished on: 2026-02-25 1 author
-
Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM PretrainingStanford UniversityPublished on: 2026-02-23 1 author
-
ASTRA-bench: Evaluating Tool-Use Agent Reasoning and Action Planning with Personal User ContextPublished on: 2026-02-20 1 author
-
Unifying Ranking and Generation in Query Auto-Completion via Retrieval-Augmented Generation and Multi-Objective AlignmentUC BerkeleyPublished on: 2026-02-14 1 author
-
DSO: Direct Steering Optimization for Bias MitigationCarnegie Mellon UniversityPublished on: 2026-02-12 1 author
-
CLaRa: Bridging Retrieval and Generation with Continuous Latent ReasoningUniversity of EdinburghPublished on: 2026-02-09 1 author
-
MemoryLLM: Plug-n-Play Interpretable Feed-Forward Memory for TransformersPublished on: 2026-01-30 1 author
-
RayRoPE: Projective Ray Positional Encoding for Multi-view AttentionCarnegie Mellon UniversityPublished on: 2026-01-21 1 author
-
GenCtrl -- A Formal Controllability Toolkit for Generative ModelsUniversitat Pompeu FabraPublished on: 2026-01-09 1 author
-
NarrativeTrack: Evaluating Video Language Models Beyond the FrameUniversity of Illinois Urbana-ChampaignPublished on: 2026-01-03 1 author
-
Delay-Tolerant Networking for Tsunami Evacuation on the Small Island of Hachijojima: A Study of Epidemic and Prophet RoutingThe University of NottinghamPublished on: 2025-12-31 1 author
-
Completed Hyperparameter Transfer across Modules, Width, Depth, Batch and DurationPublished on: 2025-12-26 1 author
-
One Layer Is Enough: Adapting Pretrained Visual Encoders for Image GenerationPublished on: 2025-12-16 1 author
-
Sharp Monocular View Synthesis in Less Than a SecondPublished on: 2025-12-11 1 author
-
Chain-of-Image Generation: Toward Monitorable and Controllable Image GenerationDuke UniversityPublished on: 2025-12-09 1 author
-
The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem ComplexityPublished on: 2025-11-20 1 author
-
Shielded Diffusion: Generating Novel and Diverse Images using Sparse RepellencyPublished on: 2025-10-28 1 author
-
Enabling Differentially Private Federated Learning for Speech Recognition: Benchmarks, Adaptive Optimizers and Gradient ClippingPurdue UniversityPublished on: 2025-10-25 1 author
-
Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image EditingPublished on: 2025-10-22 8 authors
-
FS-DFM: Fast and Accurate Long Text Generation with Few-Step Diffusion Language ModelsThe Ohio State UniversityPublished on: 2025-09-24 6 authors
-
EpiCache: Episodic KV Cache Management for Long Conversational Question AnsweringHanyang UniversityPublished on: 2025-09-22 5 authors
-
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language ModelsWashington State UniversityPublished on: 2025-08-27 1 author
-
Scaling Laws for Native Multimodal ModelsSorbonne UniversityPublished on: 2025-08-09 1 author
-
Apple Intelligence Foundation Language Models: Tech Report 2025Published on: 2025-07-17 1 author
-
FlexTok: Resampling Images into 1D Token Sequences of Flexible LengthSwiss Federal Institute of Technology LausannePublished on: 2025-06-04 1 author
-
Scaling Diffusion Language Models via Adaptation from Autoregressive ModelsThe University of Hong Kong, University of Illinois at Urbana-ChampaignPublished on: 2025-05-31 1 author
-
FastVLM: Efficient Vision Encoding for Vision Language ModelsPublished on: 2025-05-15 1 author
-
UniVG: A Generalist Diffusion Model for Unified Image Generation and EditingPublished on: 2025-04-22 1 author
-
Depth Pro: Sharp Monocular Metric Depth in Less Than a SecondPublished on: 2025-04-21 1 author
-
ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use CapabilitiesPublished on: 2025-04-16 1 author
-
AToken: A Unified Tokenizer for VisionPublished on: 2025-02-19 1 author
-
pfl-research: simulation framework for accelerating research in Private Federated LearningPublished on: 2024-12-10 1 author
-
Controlling Language and Diffusion Models by Transporting ActivationsPublished on: 2024-11-22 1 author
-
Duo-LLM: A Framework for Studying Adaptive Computation in Large Language ModelsPublished on: 2024-10-01 1 author
-
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM InferencePublished on: 2024-07-19 1 author
-
Efficient Large Language Model Inference with Limited MemoryPublished on: 2024-06-12 1 author
-
Private Vector Mean Estimation in the Shuffle Model: Optimal Rates Require Many MessagesPublished on: 2024-04-26 1 author
-
Diffusion Models Without AttentionCornell UniversityPublished on: 2023-10-30 1 author
-
Differentially Private Heavy Hitter Detection using Federated AnalyticsPublished on: 2023-07-21 1 author
-
Application-Agnostic Language Modeling for On-Device ASRPublished on: 2023-03-16 1 author
-
Stable Diffusion with Core ML on Apple SiliconPublished on: 2022-12-01 1 author
-
Training a Tokenizer for Free with Private Federated LearningCornell UniversityPublished on: 2022-03-15 1 author
-
On-device Panoptic Segmentation for Camera Using TransformersPublished on: 2021-10-19 1 author
-
Federated Evaluation and Tuning for On-Device Personalization: System Design & ApplicationsPublished on: 2021-02-18 1 author
-
Scalable Differential Privacy with Certified Robustness in Adversarial LearningPublished on: 2020-09-15 1 author
-
Overton: A Data System for Monitoring and Improving Machine-Learned ProductsPublished on: 2019-09-07 1 author
-
Evaluating Discourse Phenomena in Neural Machine TranslationUniversity of EdinburghPublished on: 2018-04-20 1 author
Funding rounds
View all-
Debt FinancingClosed 2013-04-30Round $17B Closed
-
IPOClosed 1980-12-12Round $101.20M Post-money $1.78B Closed
-
Series AClosed 1978-01-15Round $517.50K Post-money $5.20M 3 investors Closed
-
SeedAnnounced 1977-01-15Raised $250K Round $250K Pre-money $750K 1 investors Closed
Repositories
No repositories yet.
