Papers

Filter by company

Cosmos World Foundation Model Platform for Physical AI

NVIDIA

Published on: 2025-01-07 1 author
Titans: Learning to Memorize at Test Time

Google

Published on: 2024-12-31 1 author
Generative Video Propagation

Adobe / The Chinese University of Hong Kong

Published on: 2024-12-27 1 author
In Case You Missed It: ARC 'Challenge' Is Not That Challenging

Snowflake

Published on: 2024-12-23 1 author
Qwen2.5 Technical Report

Alibaba

Published on: 2024-12-19 1 author
Prompt Compression with Context-Aware Sentence Encoding for Fast and Improved LLM Inference

Workday / Queen’s University

Published on: 2024-12-18 1 author
Alignment faking in large language models

Anthropic / New York University

Published on: 2024-12-18 1 author
How Often are Fingerprints Repeated in the Population? Expanding on Evidence from AI With the Birthday Paradox

Published on: 2024-12-17 2 authors
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

DeepSeek

Published on: 2024-12-13 1 author
VDB-GPDF: Online Gaussian Process Distance Field with VDB Structure

Google

Published on: 2024-12-12 1 author
pfl-research: simulation framework for accelerating research in Private Federated Learning

Apple

Published on: 2024-12-10 1 author
Frontier AI systems have surpassed the self-replicating red line

Published on: 2024-12-09 4 authors
InstantRestore: Single-Step Personalized Face Restoration with Shared-Image Attention

Snap / University of California

Published on: 2024-12-09 1 author
Best-of-N Jailbreaking

Published on: 2024-12-04 10 authors
Creating realistic 3D shapes using generative AI

Massachusetts Institute of Technology

Published on: 2024-12-04 1 author
Commit0: Library Generation from Scratch

Cohere / Cornell University

Published on: 2024-12-02 1 author
ARTIST: Improving the Generation of Text-rich Images with Disentangled Diffusion Models and Large Language Models

Adobe / Duke University

Published on: 2024-12-02
Controlling Language and Diffusion Models by Transporting Activations

Apple

Published on: 2024-11-22 1 author
The Rise and Potential of Large Language Model Based Agents: A Survey

MIT

Published on: 2024-11-11 5 authors
Evaluating Cultural and Social Awareness of LLM Web Agents

Salesforce

Published on: 2024-10-30 1 author
SF-V: Single Forward Video Generation Model

Snap / Rutgers University

Published on: 2024-10-24 1 author
The Llama 3 Herd of Model

Meta Platforms

Published on: 2024-10-23 1 author
Improving Pinterest Search Relevance Using Large Language Models

Pinterest

Published on: 2024-10-22 1 author
NVLM: Open Frontier-Class Multimodal LLMs

NVIDIA

Published on: 2024-10-22 1 author
HyQE: Ranking Contexts with Hypothetical Query Embeddings

Intuit / Boston University

Published on: 2024-10-20 1 author
RedPajama: an Open Dataset for Training Large Language Models

Together AI, EleutherAI / Stanford University, The Ohio State University

Published on: 2024-10-19 1 author
Understanding Chain-of-Thought in LLMs through Information Theory

ByteDance

Published on: 2024-10-18 1 author
Survival of the Safest: Towards Secure Prompt Optimization through Interleaved Multi-Objective Evolution

Intuit

Published on: 2024-10-12 1 author
Nemotron-4-340B-Instruct

NVIDIA

Published on: 2024-10-12 1 author
Pixtral 12B

Mistral AI

Published on: 2024-10-10 1 author
Data-Driven Discovery of Conservation Laws from Trajectories via Neural Deflation

Intuit / University of Massachusetts Amherst

Published on: 2024-10-07 1 author
Chronos: Learning the Language of Time Series

Amazon / AWS AI Labs

Published on: 2024-10-04 1 author
Qwen2-VL: Enhancing Vision-Language Model’s Perception of the World at Any Resolution

Alibaba

Published on: 2024-10-03 1 author
Duo-LLM: A Framework for Studying Adaptive Computation in Large Language Models

Apple

Published on: 2024-10-01 1 author
HM3: Heterogeneous Multi-Class Model Merging

DataRobot

Published on: 2024-09-27 1 author
arsier: Recipes for Training and Evaluating Large Video Description Models

ByteDance

Published on: 2024-09-24 1 author
OpenVLA: An Open-Source Vision-Language-Action Model

Google / UC Berkeley

Published on: 2024-09-05 1 author
ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis

Tencent

Published on: 2024-09-03 1 author
Arctic-TILT. Business Document Understanding at Sub-Billion Scale

Snowflake / Adam Mickiewicz University

Published on: 2024-08-08 1 author
General-Purpose User Modeling with Behavioral Logs

Snap / Utrecht University

Published on: 2024-07-25 1 author
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference

Apple

Published on: 2024-07-19 1 author
Qwen2-Audio Technical Report

Alibaba

Published on: 2024-07-15 1 author
Qwen2 Technical Report

Alibaba

Published on: 2024-07-15 1 author
Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models

Anthropic / University of Oxford

Published on: 2024-06-29 1 author
Harvesting Efficient On-Demand Order Pooling from Skilled Couriers: Enhancing Graph Representation Learning for Refining Real-time Many-to-One Assignments

Meituan

Published on: 2024-06-20 1 author
Claude 3.5 Sonnet Model Card Addendum

Anthropic

Published on: 2024-06-20 1 author
Abliteration

Hugging Face

Published on: 2024-06-13 1 author
Multi-Agent Software Development through Cross-Team Collaboration

Published on: 2024-06-13 Venue: Findings of ACL 2025 11 authors
Efficient Large Language Model Inference with Limited Memory

Apple

Published on: 2024-06-12 1 author
AgentBoard: An Evaluation Platform for LLM-Based Autonomous Agents

Published on: 2024-06-11 7 authors

Prev 36 37 38 39 40 41 42 43 44 45 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: