TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

Papers

Filter by company
  • Splat and Replace: 3D Reconstruction with Repetitive Elements
    Published on: 2025-06-06 1 author
  • FlexTok: Resampling Images into 1D Token Sequences of Flexible Length
    Apple / Swiss Federal Institute of Technology Lausanne
    Published on: 2025-06-04 1 author
  • HybGRAG: Hybrid Retrieval-Augmented Generation on Textual and Relational Knowledge Bases
    Amazon / Carnegie Mellon University
    Published on: 2025-06-02 1 author
  • SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation
    Published on: 2025-06-02 1 author
  • Scaling Diffusion Language Models via Adaptation from Autoregressive Models
    Tencent, Apple / The University of Hong Kong, University of Illinois at Urbana-Champaign
    Published on: 2025-05-31 1 author
  • M+: Extending MemoryLLM with Scalable Long-Term Memory
    Published on: 2025-05-30 1 author
  • Skywork Open Reasoner 1 Technical Report
    Published on: 2025-05-29 1 author
  • GR00T N1: An Open Foundation Model for Generalist Humanoid Robots
    Published on: 2025-05-27 1 author
  • More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives
    Moonshot AI / Renmin University of China
    Published on: 2025-05-27 1 author
  • Optimizing Robustness and Accuracy in Mixture of Experts: A Dual-Model Approach
    Published on: 2025-05-27 1 author
  • Autoregressive Speech Synthesis without Vector Quantization
    Microsoft / The Chinese University of Hong Kong
    Published on: 2025-05-27 1 author
  • Vision as LoRA
    ByteDance / University of Birmingham
    Published on: 2025-05-26 8 authors
  • syftr: Pareto-Optimal Generative AI
    Published on: 2025-05-26 1 author
  • SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond
    Published on: 2025-05-26 1 author
  • Gemini Robotics: Bringing AI into the Physical World
    Published on: 2025-05-25 1 author
  • OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks
    MiniMax / Fudan University
    Published on: 2025-05-24 1 author
  • One RL to See Them All: Visual Triple Unified Reinforcement Learning
    Published on: 2025-05-23 1 author
  • GiGL: Large-Scale Graph Neural Networks at Snapchat
    Published on: 2025-05-23 1 author
  • From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech Recognition
    ByteDance / Singapore University of Technology and Design
    Published on: 2025-05-22 1 author
  • Model Merging in Pre-training of Large Language Models
    Published on: 2025-05-22 1 author
  • DAPO: An Open-Source LLM Reinforcement Learning System at Scale
    ByteDance / Tsinghua University
    Published on: 2025-05-20 1 author
  • M-RewardBench: Evaluating Reward Models in Multilingual Settings
    Cohere / Allen Institute for AI
    Published on: 2025-05-20 1 author
  • Lessons from Defending Gemini Against Indirect Prompt Injections
    Published on: 2025-05-20 1 author
  • G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
    Published on: 2025-05-19 1 author
  • Progressive Autoregressive Video Diffusion Models
    Adobe / Stony Brook University
    Published on: 2025-05-18 1 author
  • FastVLM: Efficient Vision Encoding for Vision Language Models
    Published on: 2025-05-15 1 author
  • VGGT: Visual Geometry Grounded Transformer
    Meta Platforms / University of Oxford
    Published on: 2025-05-14 6 authors
  • Qwen3 Technical Report
    Published on: 2025-05-14 1 author
  • The Leaderboard Illusion
    Cohere / Princeton University
    Published on: 2025-05-12 1 author
  • MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder
    Published on: 2025-05-12 1 author
  • LLMs Get Lost In Multi-Turn Conversation
    Published on: 2025-05-09 1 author
  • A Survey on Test-Time Scaling in Large Language Models: What, How, Where, and How Well
    Salesforce / City University of Hong Kong
    Published on: 2025-05-04 1 author
  • Command A: An Enterprise-Ready Large Language Model
    Published on: 2025-05-01 1 author
  • InteractRank: Personalized Web-Scale Search Pre-Ranking with Cross Interaction Features
    Published on: 2025-05-01 1 author
  • Investigating the Overlooked Hessian Structure: From CNNs to LLMs
    Published on: 2025-05-01 1 author
  • The Leaderboard Illusion
    Cohere / Allen Institute for Artificial Intelligence, Massachusetts Institute of Technology, Princeton University, Stanford University, University of Washington, University of Waterloo
    Published on: 2025-04-29 13 authors
  • Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use
    Google / Stanford University
    Published on: 2025-04-28 1 author
  • Perception Encoder: The best visual embeddings are not at the output of the network
    Meta Platforms / Fudan University
    Published on: 2025-04-28 1 author
  • Kimi-Audio Technical Report
    Published on: 2025-04-25 1 author
  • I-Con: A Unifying Framework for Representation Learning
    Google, Microsoft / MIT
    Published on: 2025-04-23 5 authors
  • Describe Anything: Detailed Localized Image and Video Captioning
    NVIDIA / UC Berkeley
    Published on: 2025-04-22 11 authors
  • LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities
    Google / JKU Linz
    Published on: 2025-04-22 5 authors
  • UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
    Published on: 2025-04-22 1 author
  • Depth Pro: Sharp Monocular Metric Depth in Less Than a Second
    Published on: 2025-04-21 1 author
  • How Does Critical Batch Size Scale in Pre-training?
    Amazon / Harvard University
    Published on: 2025-04-21 1 author
  • Representation Engineering for Large-Language Models: Survey and Research Challenges
    Published on: 2025-04-21 1 author
  • FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving
    Published on: 2025-04-21 1 author
  • InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models
    SenseTime / Fudan University, Nanjing University, Shanghai Jiao Tong University, The Chinese University of Hong Kong, Tsinghua University
    Published on: 2025-04-19 1 author
  • It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization
    Published on: 2025-04-17 4 authors
  • ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
    Published on: 2025-04-17 1 author
0 AIs selected
Clear selection
#
Name
Task