TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

Papers

  • Slim attention: cut your context memory in half without loss – K-cache is all you need for MHA
    Published on: 2025-06-03 1 author
  • Test-Time Training with KV Binding Is Secretly Linear Attention
    Published on: 2026-02-24 1 author
  • Compositional Planning with Jumpy World Models
    Meta Platforms / McGill University
    Published on: 2026-02-23 1 author
  • SenTSR-Bench: Thinking with Injected Knowledge for Time-Series Reasoning
    Amazon / The Pennsylvania State University
    Published on: 2026-02-23 1 author
  • Toward the Thermodynamic Limit: Neural Operators for Non-equilibrium Dynamics of Mott Insulators
    Published on: 2026-02-23 1 author
  • Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pretraining
    Apple / Stanford University
    Published on: 2026-02-23 1 author
  • Haitao Lin
    Tencent / Fudan University
    Published on: 2026-02-23 1 author
  • How to Train Your Deep Research Agent? Prompt, Reward, and Policy Optimization in Search-R1
    Published on: 2026-02-23 1 author
  • Wink: Recovering from Misbehaviors in Coding Agents
    Published on: 2026-02-20 1 author
  • Natural Language Actor-Critic: Scalable Off-Policy Learning in Language Space
    Published on: 2026-02-20 1 author
  • Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control
    Stanford University
    Published on: 2026-02-20
  • The Geometry of Noise: Why Diffusion Models Don't Need Noise Conditioning
    Published on: 2026-02-20 1 author
  • SARAH: Spatially Aware Real-time Agentic Humans
    Published on: 2026-02-20 1 author
  • MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
    1 author
  • El Agente Gráfico: Structured Execution Graphs for Scientific Agents
    NVIDIA / University of Toronto
    Published on: 2026-02-19 1 author
  • Unified Latents (UL): How to train your latents
    Google / Google DeepMind
    Published on: 2026-02-19 1 author
  • EVMbench: Evaluating AI Agents on Smart Contract Security
    Published on: 2026-02-18 1 author
  • EgoScale: Scaling Dexterous Manipulation with Diverse Egocentric Human Data
    NVIDIA / University of California
    Published on: 2026-02-18 1 author
  • Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching
    Amazon / UC Berkeley
    Published on: 2026-02-17 1 author
  • jina-embeddings-v5-text: Task-Targeted Embedding Distillation
    Published on: 2026-02-17 1 author
  • World Action Models are Zero-shot Policies
    Published on: 2026-02-17 1 author
  • On Surprising Effectiveness of Masking Updates in Adaptive Optimizers
    Google / Northwestern University
    Published on: 2026-02-17 1 author
  • GLM-5: from Vibe Coding to Agentic Engineering
    Z.ai / Tsinghua University
    Published on: 2026-02-17
  • Image Generation with a Sphere Encoder
    Published on: 2026-02-16 1 author
  • BitDance: Scaling Autoregressive Generative Models with Binary Tokens
    ByteDance / The Chinese University of Hong Kong
    Published on: 2026-02-15 1 author
  • Experiential Reinforcement Learning
    Microsoft / University of Southern California
    Published on: 2026-02-15 1 author
  • Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety
    OpenAI / Amazon, Anthropic, Meta
    Published on: 2026-02-15 1 author
  • Unifying Ranking and Generation in Query Auto-Completion via Retrieval-Augmented Generation and Multi-Objective Alignment
    Apple / UC Berkeley
    Published on: 2026-02-14 1 author
  • Isaac Lab: A GPU-Accelerated Simulation Framework for Multi-Modal Robot Learning
    1 author
  • WizardLM: Empowering large pre-trained language models to follow complex instructions
    Microsoft / Peking University
    1 author
  • Florence: A New Foundation Model for Computer Vision
    1 author
  • SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks
    Published on: 2026-02-13 1 author
  • Visual Foresight for Robotic Stow: A Diffusion-Based World Model from Sparse Snapshots
    Published on: 2026-02-12 1 author
  • On-Policy Context Distillation for Language Models
    Microsoft / Microsoft Research
    Published on: 2026-02-12 1 author
  • Iterative Reranking as a Compute-Scaling Method for LLM-based Rankers
    Amazon / Sapienza University
    Published on: 2026-02-12 1 author
  • KG-CRAFT: Knowledge graph-based contrastive reasoning with LLMs for enhancing automated fact-checking
    Amazon / Universidade Federal Fluminense
    Published on: 2026-02-12 1 author
  • Pattern Discovery with Wide-Lens Analysis and Sharp-Focus Validation
    Amazon / UC Santa Cruz
    Published on: 2026-02-12 1 author
  • ChatLLM network: More brains, more intelligence
    Beijing Institute of Technology
    Published on: 2026-02-12 1 author
  • BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation
    ByteDance / Tsinghua University
    Published on: 2026-02-11 1 author
  • Voxtral Realtime
    Published on: 2026-02-11 1 author
  • Accelerating Mathematical and Scientific Discovery with Gemini Deep Think
    Published on: 2026-02-11 2 authors
  • OmniSapiens: A Foundation Model for Social Behavior Processing via HARPO
    MIT, National University of Singapore
    Published on: 2026-02-11 1 author
  • GameDevBench Evaluating Agentic Capabilities Through Game Development Wayne Chi1 , Yixiong Fang1 , Arnav Yayavaram1 , Siddharth Yayavaram1 , Seth Karten2
    Carniege Mellon University, Princeton University
    Published on: 2026-02-11 1 author
  • EchoJEPA: A Latent Predictive Foundation Model for Echocardiography
    Published on: 2026-02-10 1 author
  • Autoregressive Image Generation with Masked Bit Modeling
    Published on: 2026-02-09 1 author
  • iGRPO: Self-Feedback–Driven LLM Reasonin
    Published on: 2026-02-09 1 author
  • ML-DCN: Masked Low-Rank Deep Crossing Network Towards Scalable Ads Click-through Rate Prediction at Pinterest
    Published on: 2026-02-09 1 author
  • Agentic LLMs as Powerful Deanonymizers: Re-identification of Participants in the Anthropic Interviewer Dataset
    Anthropic / Northeastern University, China
    Published on: 2026-02-09 1 author
  • HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing
    MiniMax / Fudan University
    Published on: 2026-02-08 1 author
  • Intelligence Explosion
    1 author
0 AIs selected
Clear selection
#
Name
Task