Papers

Filter by company

Agent Laboratory: Using LLM Agents as Research Assistants

AMD / ETH Zurich

Published on: 2025-06-17 1 author
CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs

Published on: 2025-06-17 1 author
AlphaEvolve: A coding agent for scientific and algorithmic discovery

Google

Published on: 2025-06-16 1 author
Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token Sequences

Snowflake

Published on: 2025-06-16 1 author
Transformers without Normalization

Meta Platforms / New York University

Published on: 2025-06-14 5 authors
T1: Advancing Language Model Reasoning through Reinforcement Learning and Inference Scalin

Z.ai

Published on: 2025-06-13 1 author
Ministral 3

Mistral AI

Published on: 2025-06-13 1 author
The Behavior Gap: Evaluating Zero-shot LLM Agents in Complex Task-Oriented Dialogs

Intuit

Published on: 2025-06-13 1 author
A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

Salesforce / University of Illinois Urbana-Champaign

Published on: 2025-06-12 10 authors
Magistral

Mistral AI

Published on: 2025-06-12 1 author
Adobe Researchers present a powerful, unified approach to generative video editing at CVPR 2025

Adobe

Published on: 2025-06-12 1 author
Multi-Token Attention

Meta Platforms

Published on: 2025-06-11 4 authors
Transaction Categorization with Relational Deep Learning in QuickBooks

Intuit / University of Notre Dame

Published on: 2025-06-10 1 author
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

Published on: 2025-06-09 1 author
Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning

Published on: 2025-06-09 1 author
Gradient-guided Attention Map Editing: Towards Efficient Contextual Hallucination Mitigation

Intuit / University of California

Published on: 2025-06-07 1 author
Modality-Specialized Synergizers for Interleaved Vision-Language Generalists

Intuit / Virginia Tech

Published on: 2025-06-07 1 author
Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning

Published on: 2025-06-06 1 author
Splat and Replace: 3D Reconstruction with Repetitive Elements

Adobe

Published on: 2025-06-06 1 author
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

Apple / Swiss Federal Institute of Technology Lausanne

Published on: 2025-06-04 1 author
HybGRAG: Hybrid Retrieval-Augmented Generation on Textual and Relational Knowledge Bases

Amazon / Carnegie Mellon University

Published on: 2025-06-02 1 author
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation

Snowflake

Published on: 2025-06-02 1 author
Scaling Diffusion Language Models via Adaptation from Autoregressive Models

Tencent, Apple / The University of Hong Kong, University of Illinois at Urbana-Champaign

Published on: 2025-05-31 1 author
M+: Extending MemoryLLM with Scalable Long-Term Memory

Amazon

Published on: 2025-05-30 1 author
Skywork Open Reasoner 1 Technical Report

Published on: 2025-05-29 1 author
GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

NVIDIA

Published on: 2025-05-27 1 author
More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives

Moonshot AI / Renmin University of China

Published on: 2025-05-27 1 author
Optimizing Robustness and Accuracy in Mixture of Experts: A Dual-Model Approach

Perplexity

Published on: 2025-05-27 1 author
Autoregressive Speech Synthesis without Vector Quantization

Microsoft / The Chinese University of Hong Kong

Published on: 2025-05-27 1 author
Vision as LoRA

ByteDance / University of Birmingham

Published on: 2025-05-26 8 authors
syftr: Pareto-Optimal Generative AI

DataRobot

Published on: 2025-05-26 1 author
SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

MiniMax

Published on: 2025-05-26 1 author
Gemini Robotics: Bringing AI into the Physical World

Google

Published on: 2025-05-25 1 author
OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks

MiniMax / Fudan University

Published on: 2025-05-24 1 author
One RL to See Them All: Visual Triple Unified Reinforcement Learning

MiniMax

Published on: 2025-05-23 1 author
GiGL: Large-Scale Graph Neural Networks at Snapchat

Snap

Published on: 2025-05-23 1 author
From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech Recognition

ByteDance / Singapore University of Technology and Design

Published on: 2025-05-22 1 author
Model Merging in Pre-training of Large Language Models

ByteDance

Published on: 2025-05-22 1 author
DAPO: An Open-Source LLM Reinforcement Learning System at Scale

ByteDance / Tsinghua University

Published on: 2025-05-20 1 author
M-RewardBench: Evaluating Reward Models in Multilingual Settings

Cohere / Allen Institute for AI

Published on: 2025-05-20 1 author
Lessons from Defending Gemini Against Indirect Prompt Injections

Google

Published on: 2025-05-20 1 author
G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning

Moonshot AI

Published on: 2025-05-19 1 author
Progressive Autoregressive Video Diffusion Models

Adobe / Stony Brook University

Published on: 2025-05-18 1 author
FastVLM: Efficient Vision Encoding for Vision Language Models

Apple

Published on: 2025-05-15 1 author
VGGT: Visual Geometry Grounded Transformer

Meta Platforms / University of Oxford

Published on: 2025-05-14 6 authors
Qwen3 Technical Report

Alibaba

Published on: 2025-05-14 1 author
The Leaderboard Illusion

Cohere / Princeton University

Published on: 2025-05-12 1 author
MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

MiniMax

Published on: 2025-05-12 1 author
LLMs Get Lost In Multi-Turn Conversation

Microsoft

Published on: 2025-05-09 1 author
A Survey on Test-Time Scaling in Large Language Models: What, How, Where, and How Well

Salesforce / City University of Hong Kong

Published on: 2025-05-04 1 author

Prev 24 25 26 27 28 29 30 31 32 33 34 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: