Papers

Filter by company

Evaluating the Critical Risks of Amazon’s Nova Premier under the Frontier Model Safety Framework

Amazon

Published on: 2025-07-07 1 author
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Published on: 2025-07-03 1 author
`For Argument's Sake, Show Me How to Harm Myself!': Jailbreaking LLMs in Suicide and Self-Harm Contexts

Northeastern University

Published on: 2025-07-01 2 authors
UMA: A Family of Universal Models for Atoms

Meta Platforms / Carnegie Mellon University

Published on: 2025-06-30 1 author
Hierarchical Reasoning Model

Sapient Intelligence / Tsinghua University

Published on: 2025-06-26 9 authors
Steering Your Diffusion Policy with Latent Space Reinforcement Learning

Amazon

Published on: 2025-06-25 1 author
Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs

Published on: 2025-06-24 1 author
KIMI-VL TECHNICAL REPORT

Moonshot AI

Published on: 2025-06-23 1 author
TransAct V2: Lifelong User Action Sequence Modeling on Pinterest Recommendation

Pinterest

Published on: 2025-06-21 1 author
Next-User Retrieval: Enhancing Cold-Start Recommendations via Generative Next-User Modeling

Published on: 2025-06-18 1 author
Agent Laboratory: Using LLM Agents as Research Assistants

AMD / ETH Zurich

Published on: 2025-06-17 1 author
CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs

Published on: 2025-06-17 1 author
AlphaEvolve: A coding agent for scientific and algorithmic discovery

Google

Published on: 2025-06-16 1 author
Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token Sequences

Snowflake

Published on: 2025-06-16 1 author
Transformers without Normalization

Meta Platforms / New York University

Published on: 2025-06-14 5 authors
T1: Advancing Language Model Reasoning through Reinforcement Learning and Inference Scalin

Z.ai

Published on: 2025-06-13 1 author
Ministral 3

Mistral AI

Published on: 2025-06-13 1 author
The Behavior Gap: Evaluating Zero-shot LLM Agents in Complex Task-Oriented Dialogs

Intuit

Published on: 2025-06-13 1 author
A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

Salesforce / University of Illinois Urbana-Champaign

Published on: 2025-06-12 10 authors
Magistral

Mistral AI

Published on: 2025-06-12 1 author
Adobe Researchers present a powerful, unified approach to generative video editing at CVPR 2025

Adobe

Published on: 2025-06-12 1 author
Multi-Token Attention

Meta Platforms

Published on: 2025-06-11 4 authors
Transaction Categorization with Relational Deep Learning in QuickBooks

Intuit / University of Notre Dame

Published on: 2025-06-10 1 author
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

Published on: 2025-06-09 1 author
Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning

Published on: 2025-06-09 1 author
Gradient-guided Attention Map Editing: Towards Efficient Contextual Hallucination Mitigation

Intuit / University of California

Published on: 2025-06-07 1 author
Modality-Specialized Synergizers for Interleaved Vision-Language Generalists

Intuit / Virginia Tech

Published on: 2025-06-07 1 author
Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning

Published on: 2025-06-06 1 author
Splat and Replace: 3D Reconstruction with Repetitive Elements

Adobe

Published on: 2025-06-06 1 author
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

Apple / Swiss Federal Institute of Technology Lausanne

Published on: 2025-06-04 1 author
HybGRAG: Hybrid Retrieval-Augmented Generation on Textual and Relational Knowledge Bases

Amazon / Carnegie Mellon University

Published on: 2025-06-02 1 author
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation

Snowflake

Published on: 2025-06-02 1 author
Scaling Diffusion Language Models via Adaptation from Autoregressive Models

Tencent, Apple / The University of Hong Kong, University of Illinois at Urbana-Champaign

Published on: 2025-05-31 1 author
M+: Extending MemoryLLM with Scalable Long-Term Memory

Amazon

Published on: 2025-05-30 1 author
Skywork Open Reasoner 1 Technical Report

Published on: 2025-05-29 1 author
GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

NVIDIA

Published on: 2025-05-27 1 author
More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives

Moonshot AI / Renmin University of China

Published on: 2025-05-27 1 author
Optimizing Robustness and Accuracy in Mixture of Experts: A Dual-Model Approach

Perplexity

Published on: 2025-05-27 1 author
Autoregressive Speech Synthesis without Vector Quantization

Microsoft / The Chinese University of Hong Kong

Published on: 2025-05-27 1 author
Vision as LoRA

ByteDance / University of Birmingham

Published on: 2025-05-26 8 authors
syftr: Pareto-Optimal Generative AI

DataRobot

Published on: 2025-05-26 1 author
SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

MiniMax

Published on: 2025-05-26 1 author
Gemini Robotics: Bringing AI into the Physical World

Google

Published on: 2025-05-25 1 author
OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks

MiniMax / Fudan University

Published on: 2025-05-24 1 author
One RL to See Them All: Visual Triple Unified Reinforcement Learning

MiniMax

Published on: 2025-05-23 1 author
GiGL: Large-Scale Graph Neural Networks at Snapchat

Snap

Published on: 2025-05-23 1 author
From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech Recognition

ByteDance / Singapore University of Technology and Design

Published on: 2025-05-22 1 author
Model Merging in Pre-training of Large Language Models

ByteDance

Published on: 2025-05-22 1 author
DAPO: An Open-Source LLM Reinforcement Learning System at Scale

ByteDance / Tsinghua University

Published on: 2025-05-20 1 author
M-RewardBench: Evaluating Reward Models in Multilingual Settings

Cohere / Allen Institute for AI

Published on: 2025-05-20 1 author

Prev 62 63 64 65 66 67 68 69 70 71 72 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: