TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

Papers

Filter by company
  • Command A: An Enterprise-Ready Large Language Model
    Published on: 2025-05-01 1 author
  • InteractRank: Personalized Web-Scale Search Pre-Ranking with Cross Interaction Features
    Published on: 2025-05-01 1 author
  • Investigating the Overlooked Hessian Structure: From CNNs to LLMs
    Published on: 2025-05-01 1 author
  • The Leaderboard Illusion
    Published on: 2025-04-29 13 authors
  • Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use
    Google / Stanford University
    Published on: 2025-04-28 1 author
  • Perception Encoder: The best visual embeddings are not at the output of the network
    Meta Platforms / Fudan University
    Published on: 2025-04-28 1 author
  • Kimi-Audio Technical Report
    Published on: 2025-04-25 1 author
  • I-Con: A Unifying Framework for Representation Learning
    Google, Microsoft / MIT
    Published on: 2025-04-23 5 authors
  • Describe Anything: Detailed Localized Image and Video Captioning
    NVIDIA / UC Berkeley
    Published on: 2025-04-22 11 authors
  • LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities
    Google / JKU Linz
    Published on: 2025-04-22 5 authors
  • UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
    Published on: 2025-04-22 1 author
  • Depth Pro: Sharp Monocular Metric Depth in Less Than a Second
    Published on: 2025-04-21 1 author
  • How Does Critical Batch Size Scale in Pre-training?
    Amazon / Harvard University
    Published on: 2025-04-21 1 author
  • Representation Engineering for Large-Language Models: Survey and Research Challenges
    Published on: 2025-04-21 1 author
  • FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving
    Published on: 2025-04-21 1 author
  • InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models
    SenseTime / Fudan University, Nanjing University, Shanghai Jiao Tong University, The Chinese University of Hong Kong, Tsinghua University
    Published on: 2025-04-19 1 author
  • It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization
    Published on: 2025-04-17 4 authors
  • ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
    Published on: 2025-04-17 1 author
  • ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities
    Published on: 2025-04-16 1 author
  • How new data permeates LLM knowledge and how to dilute it
    Published on: 2025-04-13 1 author
  • Migrating Code At Scale With LLMs At Google
    Published on: 2025-04-13 1 author
  • VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks
    Published on: 2025-04-11 1 author
  • Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model
    Published on: 2025-04-11 1 author
  • PixelFlow: Pixel-Space Generative Models with Flow
    Adobe / The University of Hong Kong
    Published on: 2025-04-10 5 authors
  • Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning
    Published on: 2025-04-10 1 author
  • SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills
    Cisco Systems / The Ohio State University
    Published on: 2025-04-09 11 authors
  • Gemini: A Family of Highly Capable Multimodal Models
    Google / Google DeepMind
    Published on: 2025-04-09 1 author
  • SmolVLM: Redefining small and efficient multimodal models
    Hugging Face / Stanford University
    Published on: 2025-04-07 1 author
  • One-Minute Video Generation with Test-Time Training
    NVIDIA / Stanford University
    Published on: 2025-04-07 1 author
  • Data Scaling Laws for End-to-End Autonomous Driving
    NVIDIA / New York University, Stanford University, University of Toronto
    Published on: 2025-04-06 10 authors
  • Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme
    MiniMax / Shanghai Jiao Tong University
    Published on: 2025-04-04 1 author
  • MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding
    Together AI / Carnegie Mellon University
    Published on: 2025-04-02 1 author
  • A Systematic Survey of Automatic Prompt Optimization Techniques
    Published on: 2025-04-02 1 author
  • Scaling Language-Free Visual Representation Learning
    Meta Platforms / New York University, Princeton University
    Published on: 2025-04-01 11 authors
  • Large Language Models Pass the Turing Test
    Published on: 2025-03-31 2 authors
  • XAMBA: SSMs on Edge NPUs
    Intel / Purdue University
    Published on: 2025-03-31 1 author
  • On the Biology of a Large Language Model
    Published on: 2025-03-27 1 author
  • Alibaba LingmaAgent: Improving Automated Issue Resolution via Comprehensive Repository Exploration
    Published on: 2025-03-26 1 author
  • Qwen2.5-Omni Technical Report
    Published on: 2025-03-26 1 author
  • Neuromorphic Principles for Efficient Large Language Models on Intel Loihi 2
    Intel / University of California
    Published on: 2025-03-25 1 author
  • ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback
    Published on: 2025-03-25 1 author
  • Gemma 3 Technical Report
    Published on: 2025-03-25 1 author
  • Debunking the CUDA Myth Towards GPU-based AI Systems
    Intel / Korea Advanced Institute of Science & Technology
    Published on: 2025-03-22
  • The Amazon Nova Family of Models: Technical Report and Model Card
    Published on: 2025-03-17 1 author
  • StepMathAgent: A Step-Wise Agent for Evaluating Mathematical Processes through Tree-of-Error
    Z.ai / Tsinghua University
    Published on: 2025-03-13 1 author
  • Long Context Tuning for Video Generation
    ByteDance / The Chinese University of Hong Kong
    Published on: 2025-03-13 1 author
  • Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
    Published on: 2025-03-12 1 author
  • HunyuanVideo: A Systematic Framework For Large Video Generative Models
    Published on: 2025-03-11 1 author
  • Learning to Search Effective Example Sequences for In-Context Learning
    Published on: 2025-03-11 1 author
  • Gemini Embedding: Generalizable Embeddings from Gemini
    Published on: 2025-03-10 1 author
0 AIs selected
Clear selection
#
Name
Task