Papers

Filter by company

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Alibaba

Published on: 2025-07-31 1 author
Kimi K2: Open Agentic Intelligence

Moonshot AI

Published on: 2025-07-28 1 author
Scaling Data-Constrained Language Models

Google / Harvard University

Published on: 2025-07-28 1 author
Seed LiveInterpret 2.0: End-to-end Simultaneous Speech-to-speech Translation with Your Voice

ByteDance

Published on: 2025-07-27 1 author
Scalpel vs. Hammer: GRPO Amplifies Existing Capabilities, SFT Replaces Them

Cohere

Published on: 2025-07-25 1 author
STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning

Snowflake / Seoul National University

Published on: 2025-07-21 1 author
DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition

DeepSeek

Published on: 2025-07-18 1 author
Voxtral

Mistral AI

Published on: 2025-07-17
Apple Intelligence Foundation Language Models: Tech Report 2025

Apple

Published on: 2025-07-17 1 author
Non-preemptive Throughput Maximizationunder Time-varying Capacity

Google / University of Illinois Urbana-Champaign

Published on: 2025-07-16 1 author
MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning

Amazon / MIT

Published on: 2025-07-12 1 author
A Survey of Automatic Prompt Optimization with Instruction-focused Heuristic-based Search Algorithm

Intuit / Vanderbilt University

Published on: 2025-07-12 1 author
SEE: Strategic Exploration and Exploitation for Cohesive In-Context Prompt Optimization

Intuit / Vanderbilt University

Published on: 2025-07-12 1 author
Skywork-R1V3 Technical Report

Published on: 2025-07-10 1 author
SHADE-Arena: Evaluating Sabotage and Monitoring in LLM Age

Anthropic / Redwood Research

Published on: 2025-07-08 1 author
Unconditional Diffusion for Generative Sequential Recommendation

ByteDance / University of Science and Technology of China

Published on: 2025-07-08 1 author
Evaluating the Critical Risks of Amazon’s Nova Premier under the Frontier Model Safety Framework

Amazon

Published on: 2025-07-07 1 author
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Published on: 2025-07-03 1 author
`For Argument's Sake, Show Me How to Harm Myself!': Jailbreaking LLMs in Suicide and Self-Harm Contexts

Published on: 2025-07-01 2 authors
UMA: A Family of Universal Models for Atoms

Meta Platforms / Carnegie Mellon University

Published on: 2025-06-30 1 author
Hierarchical Reasoning Model

Published on: 2025-06-26 9 authors
Steering Your Diffusion Policy with Latent Space Reinforcement Learning

Amazon

Published on: 2025-06-25 1 author
Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs

Published on: 2025-06-24 1 author
KIMI-VL TECHNICAL REPORT

Moonshot AI

Published on: 2025-06-23 1 author
TransAct V2: Lifelong User Action Sequence Modeling on Pinterest Recommendation

Pinterest

Published on: 2025-06-21 1 author
Next-User Retrieval: Enhancing Cold-Start Recommendations via Generative Next-User Modeling

Published on: 2025-06-18 1 author
Agent Laboratory: Using LLM Agents as Research Assistants

AMD / ETH Zurich

Published on: 2025-06-17 1 author
CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs

Published on: 2025-06-17 1 author
AlphaEvolve: A coding agent for scientific and algorithmic discovery

Google

Published on: 2025-06-16 1 author
Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token Sequences

Snowflake

Published on: 2025-06-16 1 author
Transformers without Normalization

Meta Platforms / New York University

Published on: 2025-06-14 5 authors
T1: Advancing Language Model Reasoning through Reinforcement Learning and Inference Scalin

Z.ai

Published on: 2025-06-13 1 author
Ministral 3

Mistral AI

Published on: 2025-06-13 1 author
The Behavior Gap: Evaluating Zero-shot LLM Agents in Complex Task-Oriented Dialogs

Intuit

Published on: 2025-06-13 1 author
A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

Salesforce / University of Illinois Urbana-Champaign

Published on: 2025-06-12 10 authors
Magistral

Mistral AI

Published on: 2025-06-12 1 author
Adobe Researchers present a powerful, unified approach to generative video editing at CVPR 2025

Adobe

Published on: 2025-06-12 1 author
Multi-Token Attention

Meta Platforms

Published on: 2025-06-11 4 authors
Transaction Categorization with Relational Deep Learning in QuickBooks

Intuit / University of Notre Dame

Published on: 2025-06-10 1 author
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

Published on: 2025-06-09 1 author
Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning

Published on: 2025-06-09 1 author
Gradient-guided Attention Map Editing: Towards Efficient Contextual Hallucination Mitigation

Intuit / University of California

Published on: 2025-06-07 1 author
Modality-Specialized Synergizers for Interleaved Vision-Language Generalists

Intuit / Virginia Tech

Published on: 2025-06-07 1 author
Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning

Published on: 2025-06-06 1 author
Splat and Replace: 3D Reconstruction with Repetitive Elements

Adobe

Published on: 2025-06-06 1 author
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

Apple / Swiss Federal Institute of Technology Lausanne

Published on: 2025-06-04 1 author
HybGRAG: Hybrid Retrieval-Augmented Generation on Textual and Relational Knowledge Bases

Amazon / Carnegie Mellon University

Published on: 2025-06-02 1 author
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation

Snowflake

Published on: 2025-06-02 1 author
Scaling Diffusion Language Models via Adaptation from Autoregressive Models

Tencent, Apple / The University of Hong Kong, University of Illinois at Urbana-Champaign

Published on: 2025-05-31 1 author
M+: Extending MemoryLLM with Scalable Long-Term Memory

Amazon

Published on: 2025-05-30 1 author

Prev 33 34 35 36 37 38 39 40 41 42 43 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: