TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

Papers

Filter by company
  • Teaching LLMs to Plan: Logical Chain-of-Thought Instruction Tuning for Symbolic Planning
    Microsoft / MIT
    Published on: 2025-09-14 1 author
  • Towards an AI-Augmented Textbook
    Published on: 2025-09-13 37 authors
  • Steering MoE LLMs via Expert (De)Activation
    Published on: 2025-09-11 1 author
  • Robix: A Unified Model for Robot Interaction, Reasoning and Planning
    Published on: 2025-09-11 1 author
  • AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data
    Published on: 2025-09-08 1 author
  • An AI System to Help Scientists Write Expert-Level Empirical Software
    Google / MIT
    Published on: 2025-09-08 1 author
  • Why Language Models Hallucinate
    Published on: 2025-09-04 4 authors
  • Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving
    Moonshot AI / Tsinghua University
    Published on: 2025-09-03 1 author
  • GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
    Apple / Washington State University
    Published on: 2025-08-27 1 author
  • Measuring the environmental impact of delivering AI at Google Scale
    Published on: 2025-08-21
  • 3D-GENERALIST: Vision-Language-Action Models for Crafting 3D Worlds
    NVIDIA / Stanford University
    Published on: 2025-08-19 1 author
  • X-MoE: Enabling Scalable Training for Emerging Mixture-of-Experts Architectures on HPC Platforms
    DeepSeek / University of Illinois Urbana-Champaign
    Published on: 2025-08-18 1 author
  • NoProp: Training Neural Networks without Full Back-propagation or Full Forward-propagation
    Mila / University of Oxford
    Published on: 2025-08-17 3 authors
  • Matrix-3D: Omnidirectional Explorable 3D World Generation
    Published on: 2025-08-11
  • Amazon Ads Multi-Touch Attribution
    Amazon / Northwestern University
    Published on: 2025-08-11 1 author
  • Scaling Laws for Native Multimodal Models
    Apple / Sorbonne University
    Published on: 2025-08-09 1 author
  • Devstral: Fine-tuning Language Models for Coding Agent Applications
    Published on: 2025-08-08 1 author
  • Establishing Best Practices for Building Rigorous Agentic Benchmarks
    Amazon / Stanford University
    Published on: 2025-08-07 1 author
  • No LLM Solved Yu Tsumura's 554th Problem
    Published on: 2025-08-05 2 authors
  • Why do LLMs attend to the first token
    Google / University of Oxford
    Published on: 2025-08-05 7 authors
  • Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction
    Amazon / Princeton University
    Published on: 2025-08-05 1 author
  • Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation
    Published on: 2025-08-05 1 author
  • Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference
    ByteDance / Tsinghua University
    Published on: 2025-08-04 1 author
  • Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems
    MetaGPT, Google, Microsoft / Nanyang Technological University, Université de Montréal, University of Illinois at Urbana-Champaign
    Published on: 2025-08-02 1 author
  • Geak: Introducing Triton Kernel AI Agent & Evaluation Benchmarks
    AMD
    Published on: 2025-07-31 1 author
  • Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving
    Published on: 2025-07-31 1 author
  • Kimi K2: Open Agentic Intelligence
    Published on: 2025-07-28 1 author
  • Scaling Data-Constrained Language Models
    Google / Harvard University
    Published on: 2025-07-28 1 author
  • Seed LiveInterpret 2.0: End-to-end Simultaneous Speech-to-speech Translation with Your Voice
    Published on: 2025-07-27 1 author
  • Scalpel vs. Hammer: GRPO Amplifies Existing Capabilities, SFT Replaces Them
    Published on: 2025-07-25 1 author
  • STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning
    Snowflake / Seoul National University
    Published on: 2025-07-21 1 author
  • DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition
    Published on: 2025-07-18 1 author
  • Voxtral
    Published on: 2025-07-17
  • Apple Intelligence Foundation Language Models: Tech Report 2025
    Published on: 2025-07-17 1 author
  • Non-preemptive Throughput Maximizationunder Time-varying Capacity
    Google / University of Illinois Urbana-Champaign
    Published on: 2025-07-16 1 author
  • MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning
    Amazon / MIT
    Published on: 2025-07-12 1 author
  • A Survey of Automatic Prompt Optimization with Instruction-focused Heuristic-based Search Algorithm
    Intuit / Vanderbilt University
    Published on: 2025-07-12 1 author
  • SEE: Strategic Exploration and Exploitation for Cohesive In-Context Prompt Optimization
    Intuit / Vanderbilt University
    Published on: 2025-07-12 1 author
  • Skywork-R1V3 Technical Report
    Published on: 2025-07-10 1 author
  • SHADE-Arena: Evaluating Sabotage and Monitoring in LLM Age
    Anthropic / Redwood Research
    Published on: 2025-07-08 1 author
  • Unconditional Diffusion for Generative Sequential Recommendation
    ByteDance / University of Science and Technology of China
    Published on: 2025-07-08 1 author
  • Evaluating the Critical Risks of Amazon’s Nova Premier under the Frontier Model Safety Framework
    Published on: 2025-07-07 1 author
  • Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy
    Published on: 2025-07-03 1 author
  • `For Argument's Sake, Show Me How to Harm Myself!': Jailbreaking LLMs in Suicide and Self-Harm Contexts
    Published on: 2025-07-01 2 authors
  • UMA: A Family of Universal Models for Atoms
    Meta Platforms / Carnegie Mellon University
    Published on: 2025-06-30 1 author
  • Hierarchical Reasoning Model
    Published on: 2025-06-26 9 authors
  • Steering Your Diffusion Policy with Latent Space Reinforcement Learning
    Published on: 2025-06-25 1 author
  • Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs
    Published on: 2025-06-24 1 author
  • KIMI-VL TECHNICAL REPORT
    Published on: 2025-06-23 1 author
  • TransAct V2: Lifelong User Action Sequence Modeling on Pinterest Recommendation
    Published on: 2025-06-21 1 author
0 AIs selected
Clear selection
#
Name
Task