Papers
-
SenTSR-Bench: Thinking with Injected Knowledge for Time-Series Reasoning
-
Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching
-
SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks
-
Visual Foresight for Robotic Stow: A Diffusion-Based World Model from Sparse Snapshots
-
Iterative Reranking as a Compute-Scaling Method for LLM-based Rankers
-
KG-CRAFT: Knowledge graph-based contrastive reasoning with LLMs for enhancing automated fact-checking
-
Pattern Discovery with Wide-Lens Analysis and Sharp-Focus Validation
-
Autoregressive Image Generation with Masked Bit Modeling
-
AgentArk: Distilling Multi-Agent Intelligence into a Single LLM Agent
-
Interpretable Tabular Foundation Models via In-Context Kernel Regression
-
RFS: Reinforcement Learning with Residual Flow Steering for Dexterous Manipulation
-
Differentiable Semantic ID for Generative Recommendation
-
AnyView: Synthesizing Any Novel View in Dynamic Scenes
-
MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents
-
Internal Representations as Indicators of Hallucinations in Agent Tool Selection
-
ELLA: Efficient Lifelong Learning for Adapters
-
Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes
-
Journey Before Destination: On the importance of Visual Faithfulness in Slow Thinking
-
Diffusion Language Model Inference with Monte Carlo Tree Search
-
Amazon Ads Multi-Touch Attribution
-
Evaluating the Critical Risks of Amazon’s Nova Premier under the Frontier Model Safety Framework
-
The Amazon Nova Family of Models: Technical Report and Model Card
-
Evaluating Nova 2.0 Lite model under Amazon’s Frontier Model Safety Framework
-
Chronos: Learning the Language of Time Series
-
AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2seq Model
-
DeepAR: Probabilistic Forecasting with Autoregressive Recurrent Networks
