Papers

Filter by company

A Systematic Survey of Automatic Prompt Optimization Techniques

Amazon

Published on: 2025-04-02 1 author
Scaling Language-Free Visual Representation Learning

Meta Platforms / New York University, Princeton University

Published on: 2025-04-01 11 authors
XAMBA: SSMs on Edge NPUs

Intel / Purdue University

Published on: 2025-03-31 1 author
On the Biology of a Large Language Model

Anthropic

Published on: 2025-03-27 1 author
Alibaba LingmaAgent: Improving Automated Issue Resolution via Comprehensive Repository Exploration

Alibaba

Published on: 2025-03-26 1 author
Qwen2.5-Omni Technical Report

Alibaba

Published on: 2025-03-26 1 author
Neuromorphic Principles for Efficient Large Language Models on Intel Loihi 2

Intel / University of California

Published on: 2025-03-25 1 author
ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback

Snowflake

Published on: 2025-03-25 1 author
Gemma 3 Technical Report

Google

Published on: 2025-03-25 1 author
Debunking the CUDA Myth Towards GPU-based AI Systems

Intel / Korea Advanced Institute of Science & Technology

Published on: 2025-03-22
The Amazon Nova Family of Models: Technical Report and Model Card

Amazon

Published on: 2025-03-17 1 author
StepMathAgent: A Step-Wise Agent for Evaluating Mathematical Processes through Tree-of-Error

Z.ai / Tsinghua University

Published on: 2025-03-13 1 author
Long Context Tuning for Video Generation

ByteDance / The Chinese University of Hong Kong

Published on: 2025-03-13 1 author
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Cohere

Published on: 2025-03-12 1 author
HunyuanVideo: A Systematic Framework For Large Video Generative Models

Tencent

Published on: 2025-03-11 1 author
Learning to Search Effective Example Sequences for In-Context Learning

Intuit

Published on: 2025-03-11 1 author
Gemini Embedding: Generalizable Embeddings from Gemini

Google

Published on: 2025-03-10 1 author
Aya Vision: Expanding the worlds AI can see

Cohere

Published on: 2025-03-04 1 author
OWLViz: An Open-World Benchmark for Visual Question Answering

Adobe

Published on: 2025-03-04 1 author
OWLViz: An Open-World Benchmark for Visual Question Answering

Adobe

Published on: 2025-03-04 1 author
Towards Statistical Factuality Guarantee for Large Vision-Language Models

Intuit / Vanderbilt University

Published on: 2025-02-27 1 author
Evaluating Nova 2.0 Lite model under Amazon’s Frontier Model Safety Framework

Amazon

Published on: 2025-02-27 1 author
AI-Instruments: Embodying Prompts as Instruments to Abstract & Reflect Graphical Interface Commands as General-Purpose Tools

Microsoft

Published on: 2025-02-26 1 author
Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models

Amazon

Published on: 2025-02-25 1 author
Muon is Scalable for LLM Training

Moonshot AI

Published on: 2025-02-24 1 author
MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Meta Platforms / University of California

Published on: 2025-02-20 1 author
AToken: A Unified Tokenizer for Vision

Apple

Published on: 2025-02-19 1 author
Qwen2.5-VL Technical Report

Alibaba

Published on: 2025-02-19 1 author
MoBA: Mixture of Block Attention for Long-Context LLMs

Moonshot AI / Tsinghua University

Published on: 2025-02-18 1 author
GraNNite: Enabling High-Performance Execution of Graph Neural Networks on Resource-Constrained Neural Processing Units

Intel

Published on: 2025-02-13 1 author
Reviving The Classics: Active Reward Modeling in Large Language Model Alignment

ByteDance

Published on: 2025-02-04 1 author
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Anthropic / Safeguards Research Team

Published on: 2025-01-31 1 author
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Anthropic / Safeguards Research Team

Published on: 2025-01-31 1 author
Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling

DeepSeek

Published on: 2025-01-29 1 author
EmbeddingGemma: Powerful and Lightweight Text Representations

Google

Published on: 2025-01-24 1 author
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making

Amazon

Published on: 2025-01-19 1 author
ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario

Z.ai / Tsinghua University

Published on: 2025-01-17 1 author
MiniMax-01: Scaling Foundation Models with Lightning Attention

MiniMax

Published on: 2025-01-14 1 author
PoAct: Policy and Action Dual-Control Agent for Generalized Applications

Z.ai / Central South University, Tsinghua University

Published on: 2025-01-13 1 author
Retrieval-Augmented Generation with Graphs (GraphRAG)

Amazon / Michigan State University

Published on: 2025-01-08 1 author
Cosmos World Foundation Model Platform for Physical AI

NVIDIA

Published on: 2025-01-07 1 author
Titans: Learning to Memorize at Test Time

Google

Published on: 2024-12-31 1 author
Generative Video Propagation

Adobe / The Chinese University of Hong Kong

Published on: 2024-12-27 1 author
In Case You Missed It: ARC 'Challenge' Is Not That Challenging

Snowflake

Published on: 2024-12-23 1 author
Qwen2.5 Technical Report

Alibaba

Published on: 2024-12-19 1 author
Prompt Compression with Context-Aware Sentence Encoding for Fast and Improved LLM Inference

Workday / Queen’s University

Published on: 2024-12-18 1 author
Alignment faking in large language models

Anthropic / New York University

Published on: 2024-12-18 1 author
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

DeepSeek

Published on: 2024-12-13 1 author
VDB-GPDF: Online Gaussian Process Distance Field with VDB Structure

Google

Published on: 2024-12-12 1 author
pfl-research: simulation framework for accelerating research in Private Federated Learning

Apple

Published on: 2024-12-10 1 author

Prev 8 9 10 11 12 13 14 15 16 17 18 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: