Papers

Filter by company

Splat and Replace: 3D Reconstruction with Repetitive Elements

Adobe

Published on: 2025-06-06 1 author
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

Apple / Swiss Federal Institute of Technology Lausanne

Published on: 2025-06-04 1 author
HybGRAG: Hybrid Retrieval-Augmented Generation on Textual and Relational Knowledge Bases

Amazon / Carnegie Mellon University

Published on: 2025-06-02 1 author
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation

Snowflake

Published on: 2025-06-02 1 author
Scaling Diffusion Language Models via Adaptation from Autoregressive Models

Tencent, Apple / The University of Hong Kong, University of Illinois at Urbana-Champaign

Published on: 2025-05-31 1 author
M+: Extending MemoryLLM with Scalable Long-Term Memory

Amazon

Published on: 2025-05-30 1 author
Skywork Open Reasoner 1 Technical Report

Published on: 2025-05-29 1 author
GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

NVIDIA

Published on: 2025-05-27 1 author
More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives

Moonshot AI / Renmin University of China

Published on: 2025-05-27 1 author
Optimizing Robustness and Accuracy in Mixture of Experts: A Dual-Model Approach

Perplexity

Published on: 2025-05-27 1 author
Autoregressive Speech Synthesis without Vector Quantization

Microsoft / The Chinese University of Hong Kong

Published on: 2025-05-27 1 author
Vision as LoRA

ByteDance / University of Birmingham

Published on: 2025-05-26 8 authors
syftr: Pareto-Optimal Generative AI

DataRobot

Published on: 2025-05-26 1 author
SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

MiniMax

Published on: 2025-05-26 1 author
Gemini Robotics: Bringing AI into the Physical World

Google

Published on: 2025-05-25 1 author
OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks

MiniMax / Fudan University

Published on: 2025-05-24 1 author
One RL to See Them All: Visual Triple Unified Reinforcement Learning

MiniMax

Published on: 2025-05-23 1 author
GiGL: Large-Scale Graph Neural Networks at Snapchat

Snap

Published on: 2025-05-23 1 author
From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech Recognition

ByteDance / Singapore University of Technology and Design

Published on: 2025-05-22 1 author
Model Merging in Pre-training of Large Language Models

ByteDance

Published on: 2025-05-22 1 author
DAPO: An Open-Source LLM Reinforcement Learning System at Scale

ByteDance / Tsinghua University

Published on: 2025-05-20 1 author
M-RewardBench: Evaluating Reward Models in Multilingual Settings

Cohere / Allen Institute for AI

Published on: 2025-05-20 1 author
Lessons from Defending Gemini Against Indirect Prompt Injections

Google

Published on: 2025-05-20 1 author
G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning

Moonshot AI

Published on: 2025-05-19 1 author
Progressive Autoregressive Video Diffusion Models

Adobe / Stony Brook University

Published on: 2025-05-18 1 author
FastVLM: Efficient Vision Encoding for Vision Language Models

Apple

Published on: 2025-05-15 1 author
VGGT: Visual Geometry Grounded Transformer

Meta Platforms / University of Oxford

Published on: 2025-05-14 6 authors
Qwen3 Technical Report

Alibaba

Published on: 2025-05-14 1 author
The Leaderboard Illusion

Cohere / Princeton University

Published on: 2025-05-12 1 author
MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

MiniMax

Published on: 2025-05-12 1 author
LLMs Get Lost In Multi-Turn Conversation

Microsoft

Published on: 2025-05-09 1 author
A Survey on Test-Time Scaling in Large Language Models: What, How, Where, and How Well

Salesforce / City University of Hong Kong

Published on: 2025-05-04 1 author
Command A: An Enterprise-Ready Large Language Model

Cohere

Published on: 2025-05-01 1 author
InteractRank: Personalized Web-Scale Search Pre-Ranking with Cross Interaction Features

Pinterest

Published on: 2025-05-01 1 author
Investigating the Overlooked Hessian Structure: From CNNs to LLMs

ByteDance

Published on: 2025-05-01 1 author
The Leaderboard Illusion

Cohere / Allen Institute for Artificial Intelligence, Massachusetts Institute of Technology, Princeton University, Stanford University, University of Washington, University of Waterloo

Published on: 2025-04-29 13 authors
Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use

Google / Stanford University

Published on: 2025-04-28 1 author
Perception Encoder: The best visual embeddings are not at the output of the network

Meta Platforms / Fudan University

Published on: 2025-04-28 1 author
Kimi-Audio Technical Report

Moonshot AI

Published on: 2025-04-25 1 author
I-Con: A Unifying Framework for Representation Learning

Google, Microsoft / MIT

Published on: 2025-04-23 5 authors
Describe Anything: Detailed Localized Image and Video Captioning

NVIDIA / UC Berkeley

Published on: 2025-04-22 11 authors
LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

Google / JKU Linz

Published on: 2025-04-22 5 authors
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing

Apple

Published on: 2025-04-22 1 author
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

Apple

Published on: 2025-04-21 1 author
How Does Critical Batch Size Scale in Pre-training?

Amazon / Harvard University

Published on: 2025-04-21 1 author
Representation Engineering for Large-Language Models: Survey and Research Challenges

Perplexity

Published on: 2025-04-21 1 author
FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving

Perplexity

Published on: 2025-04-21 1 author
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

SenseTime / Fudan University, Nanjing University, Shanghai Jiao Tong University, The Chinese University of Hong Kong, Tsinghua University

Published on: 2025-04-19 1 author
It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization

Google

Published on: 2025-04-17 4 authors
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

ByteDance

Published on: 2025-04-17 1 author

Prev 43 44 45 46 47 48 49 50 51 52 53 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: