Papers
-
AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data
-
An AI System to Help Scientists Write Expert-Level Empirical Software
-
Measuring the environmental impact of delivering AI at Google Scale
-
Why do LLMs attend to the first token
-
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems
-
Scaling Data-Constrained Language Models
-
Non-preemptive Throughput Maximizationunder Time-varying Capacity
-
AlphaEvolve: A coding agent for scientific and algorithmic discovery
-
Gemini Robotics: Bringing AI into the Physical World
-
Lessons from Defending Gemini Against Indirect Prompt Injections
-
Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use
-
I-Con: A Unifying Framework for Representation Learning
-
LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities
-
It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization
-
How new data permeates LLM knowledge and how to dilute it
-
Migrating Code At Scale With LLMs At Google
-
Gemini: A Family of Highly Capable Multimodal Models
-
Gemma 3 Technical Report
-
Gemini Embedding: Generalizable Embeddings from Gemini
-
EmbeddingGemma: Powerful and Lightweight Text Representations
-
Titans: Learning to Memorize at Test Time
-
VDB-GPDF: Online Gaussian Process Distance Field with VDB Structure
-
OpenVLA: An Open-Source Vision-Language-Action Model
-
Generative Image Dynamics
-
Gemma: Open Models Based on Gemini Research and Technology
-
VideoPoet: A Large Language Model for Zero-Shot Video Generation
-
Solving Olympiad Geometry without Human Demonstrations (AlphaGeometry)
-
PaLM 2 Technical Report
-
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
-
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language ModelsGoogle / East China Normal University, Singapore Management University, Singapore University of Technology and Design, Southwest Jiaotong University
-
Sequential Attention: Making AI Models Leaner and Faster Without Sacrificing Accuracy
-
Generative Agents: Interactive Simulacra of Human Behavior
-
Synergizing Reasoning and Acting in Language Models
-
PaLM: Scaling Language Modeling with Pathways
-
AudioLM: A Language Modeling Approach to Audio Generation
-
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
-
Discovering faster matrix multiplication algorithms with reinforcement learning
-
CLIP-CLOP: CLIP-Guided Collage and Photomontage
-
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
-
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
-
Least-to-Most Prompting Enables Complex Reasoning in Large Language Models
-
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
-
Training Compute-Optimal Large Language Models
-
Competition-Level Code Generation with AlphaCode
-
Improving language models by retrieving from trillions of tokens
-
Highly accurate protein structure predictionwith AlphaFold
-
AN IMAGE IS WORTH 16X16 WORDS: TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE
-
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
-
Denoising Diffusion Probabilistic Models
-
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
MongoDB - Build AI That Scales
