TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

Papers

Filter by company
  • Thinking with Blueprints: Assisting Vision-Language Models in Spatial Reasoning via Structured Object Representation
    Microsoft / National University of Singapore
    Published on: 2026-01-05 1 author
  • NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation
    ByteDance / Tsinghua University
    Published on: 2026-01-05 1 author
  • ELLA: Efficient Lifelong Learning for Adapters
    Amazon / Purdue University
    Published on: 2026-01-05 1 author
  • Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes
    Published on: 2026-01-05 1 author
  • mHC: Manifold-Constrained Hyper-Connections
    Published on: 2026-01-05 1 author
  • DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
    Published on: 2026-01-04 1 author
  • AHA! Animating Human Avatars in Diverse Scenes with Gaussian Splatting
    Snap / Tübingen AI Center, University of Tübingen,
    Published on: 2026-01-04 1 author
  • Animated 3DGS Avatars in Diverse Scenes with Consistent Lighting and Shadows
    Snap / Tübingen AI Center, University of Tübingen,
    Published on: 2026-01-04 1 author
  • NarrativeTrack: Evaluating Video Language Models Beyond the Frame
    Apple / University of Illinois Urbana-Champaign
    Published on: 2026-01-03 1 author
  • DriveLaW:Unifying Planning and Video Generation in a Latent Driving World
    Xiaomi / Huazhong University of Science and Technology
    Published on: 2025-12-31 1 author
  • Delay-Tolerant Networking for Tsunami Evacuation on the Small Island of Hachijojima: A Study of Epidemic and Prophet Routing
    Apple / The University of Nottingham
    Published on: 2025-12-31 1 author
  • YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection
    Tencent / Singapore Management University
    Published on: 2025-12-30 1 author
  • HY-MT1.5 Technical Report
    Published on: 2025-12-30 1 author
  • GARDO: Reinforcing Diffusion Models without Reward Hacking
    Kuaishou Technology / Hong Kong University of Science and Technology
    Published on: 2025-12-30 1 author
  • Flow2GAN: Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-step High-Fidelity Audio Generation
    Published on: 2025-12-29 1 author
  • ThinkGen: Generalized Thinking for Visual Generation
    ByteDance / Beijing Jiaotong University
    Published on: 2025-12-29 1 author
  • D2Pruner: Debiased Importance and Structural Diversity for MLLM Token Pruning
    Tencent / Shanghai Jiao Tong University
    Published on: 2025-12-26 1 author
  • Completed Hyperparameter Transfer across Modules, Width, Depth, Batch and Duration
    Published on: 2025-12-26 1 author
  • DiverseGRPO: Mitigating Mode Collapse in Image Generation via Diversity-Aware GRPO
    Published on: 2025-12-25 1 author
  • SemanticGen: Video Generation in Semantic Space
    Kuaishou Technology / Zhejiang University
    Published on: 2025-12-25 1 author
  • Streaming Video Instruction Tuning
    Tencent / Hong Kong Baptist University
    Published on: 2025-12-24 1 author
  • Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures
    Published on: 2025-12-23 1 author
  • FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models
    Snap / Sun Yat-sen University
    Published on: 2025-12-23 1 author
  • GR-RL: Going Dexterous and Precise for Long-Horizon Robotic Manipulation
    Published on: 2025-12-23 1 author
  • COBRA: Catastrophic Bit-flip Reliability Analysis of State-Space Models
    Published on: 2025-12-22 1 author
  • From Word to World: Can Large Language Models be Implicit Text-based World Models?
    Microsoft / Southern University of Science and Technology
    Published on: 2025-12-21 1 author
  • Secret mixtures of experts inside your LLM
    Published on: 2025-12-20 1 author
  • Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience
    Published on: 2025-12-19 22 authors
  • GroundingME: Exposing the Visual Grounding Gap in MLLMs through Multi-Dimensional Evaluation
    Xiaomi / The University of Hong Kong
    Published on: 2025-12-19 1 author
  • Diffusion Forcing for Multi-Agent Interaction Sequence Modeling
    Published on: 2025-12-19 1 author
  • Sigma-MoE-Tiny Technical Report
    Microsoft / Microsoft Research
    Published on: 2025-12-19 1 author
  • Journey Before Destination: On the importance of Visual Faithfulness in Slow Thinking
    Amazon / University of Wisconsin-Madison
    Published on: 2025-12-19 1 author
  • Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers
    Published on: 2025-12-19 1 author
  • Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities.
    Published on: 2025-12-19
  • DVGT: Driving Visual Geometry Transformer
    Xiaomi / Tsinghua University
    Published on: 2025-12-18 1 author
  • RePlan: Reasoning-guided Region Planning for Complex Instruction-based Image Editing
    Tencent / The Chinese University of Hong Kong
    Published on: 2025-12-18 1 author
  • N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models
    Tencent / Hong Kong University of Science and Technology
    Published on: 2025-12-18 1 author
  • Kling-Omni Technical Report
    Published on: 2025-12-18 1 author
  • EasyV2V: A High-quality Instruction-based Video Editing Framework
    Published on: 2025-12-18 1 author
  • FlashPortrait: 6x Faster Infinite Portrait Animation with Adaptive Latent Prediction
    Microsoft / Fudan University
    Published on: 2025-12-18 1 author
  • GenEval 2: Addressing Benchmark Drift in Text-to-Image Evaluatio
    Published on: 2025-12-18 1 author
  • Addendum to GPT-5.2 System Card: GPT-5.2-Codex
    Published on: 2025-12-18 1 author
  • Monitoring Monitorability
    Published on: 2025-12-18 1 author
  • Spatia: Video Generation with Updatable Spatial Memory
    Microsoft / The University of Sydney
    Published on: 2025-12-17 1 author
  • Prompt Repetition Improves Non-Reasoning LLMs
    Published on: 2025-12-17 1 author
  • Towards a Science of Scaling Agent Systems
    Google / MIT
    Published on: 2025-12-17 1 author
  • Fast and Accurate Causal Parallel Decoding using Jacobi Forcing
    Snowflake / UC San Diego
    Published on: 2025-12-16 1 author
  • TalkVerse: Democratizing Minute-Long Audio-Driven Video Generation
    Snap / The Chinese University of Hong Kong
    Published on: 2025-12-16 1 author
  • GLM-TTS Technical Report
    Z.ai / Tsinghua University
    Published on: 2025-12-16 1 author
  • Native and Compact Structured Latents for 3D Generation
    Microsoft / Tsinghua University
    Published on: 2025-12-16 1 author
0 AIs selected
Clear selection
#
Name
Task