Papers

Filter by company

Commit0: Library Generation from Scratch

Cohere / Cornell University

Published on: 2024-12-02 1 author
ARTIST: Improving the Generation of Text-rich Images with Disentangled Diffusion Models and Large Language Models

Adobe / Duke University

Published on: 2024-12-02
The Llama 3 Herd of Models

Meta Platforms

Published on: 2024-11-23 1 author
Controlling Language and Diffusion Models by Transporting Activations

Apple

Published on: 2024-11-22 1 author
The Rise and Potential of Large Language Model Based Agents: A Survey

MIT

Published on: 2024-11-11 5 authors
Evaluating Cultural and Social Awareness of LLM Web Agents

Salesforce

Published on: 2024-10-30 1 author
SF-V: Single Forward Video Generation Model

Snap / Rutgers University

Published on: 2024-10-24 1 author
The Llama 3 Herd of Model

Meta Platforms

Published on: 2024-10-23 1 author
Improving Pinterest Search Relevance Using Large Language Models

Pinterest

Published on: 2024-10-22 1 author
NVLM: Open Frontier-Class Multimodal LLMs

NVIDIA

Published on: 2024-10-22 1 author
HyQE: Ranking Contexts with Hypothetical Query Embeddings

Intuit / Boston University

Published on: 2024-10-20 1 author
RedPajama: an Open Dataset for Training Large Language Models

EleutherAI / The Ohio State University

Published on: 2024-10-19 1 author
RedPajama: an Open Dataset for Training Large Language Models

Together AI / Stanford University

Published on: 2024-10-19 1 author
Understanding Chain-of-Thought in LLMs through Information Theory

ByteDance

Published on: 2024-10-18 1 author
Survival of the Safest: Towards Secure Prompt Optimization through Interleaved Multi-Objective Evolution

Intuit

Published on: 2024-10-12 1 author
Nemotron-4-340B-Instruct

NVIDIA

Published on: 2024-10-12 1 author
Pixtral 12B

Mistral AI

Published on: 2024-10-10 1 author
Data-Driven Discovery of Conservation Laws from Trajectories via Neural Deflation

Intuit / University of Massachusetts Amherst

Published on: 2024-10-07 1 author
Chronos: Learning the Language of Time Series

Amazon / AWS AI Labs

Published on: 2024-10-04 1 author
Qwen2-VL: Enhancing Vision-Language Model’s Perception of the World at Any Resolution

Alibaba

Published on: 2024-10-03 1 author
Duo-LLM: A Framework for Studying Adaptive Computation in Large Language Models

Apple

Published on: 2024-10-01 1 author
HM3: Heterogeneous Multi-Class Model Merging

DataRobot

Published on: 2024-09-27 1 author
arsier: Recipes for Training and Evaluating Large Video Description Models

ByteDance

Published on: 2024-09-24 1 author
OpenVLA: An Open-Source Vision-Language-Action Model

Google / UC Berkeley

Published on: 2024-09-05 1 author
ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis

Tencent

Published on: 2024-09-03 1 author
Arctic-TILT. Business Document Understanding at Sub-Billion Scale

Snowflake / Adam Mickiewicz University

Published on: 2024-08-08 1 author
General-Purpose User Modeling with Behavioral Logs

Snap / Utrecht University

Published on: 2024-07-25 1 author
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference

Apple

Published on: 2024-07-19 1 author
Qwen2-Audio Technical Report

Alibaba

Published on: 2024-07-15 1 author
Qwen2 Technical Report

Alibaba

Published on: 2024-07-15 1 author
Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models

Anthropic / University of Oxford

Published on: 2024-06-29 1 author
Harvesting Efficient On-Demand Order Pooling from Skilled Couriers: Enhancing Graph Representation Learning for Refining Real-time Many-to-One Assignments

Meituan

Published on: 2024-06-20 1 author
Claude 3.5 Sonnet Model Card Addendum

Anthropic

Published on: 2024-06-20 1 author
Abliteration

Hugging Face

Published on: 2024-06-13 1 author
Multi-Agent Software Development through Cross-Team Collaboration

Published on: 2024-06-13 Venue: Findings of ACL 2025 11 authors
Efficient Large Language Model Inference with Limited Memory

Apple

Published on: 2024-06-12 1 author
AgentBoard: An Evaluation Platform for LLM-Based Autonomous Agents

Published on: 2024-06-11 7 authors
Creative Text-to-Audio Generation via Synthesizer Programming

Adobe

Published on: 2024-06-01 1 author
Retrieval Augmented Generation for Domain-specific Question Answering

Adobe

Published on: 2024-05-29 1 author
Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet

Anthropic

Published on: 2024-05-21 1 author
Multimodal Chain-of-Thought Reasoning in Language Models

Amazon / Shanghai Jiao Tong University

Published on: 2024-05-20 1 author
Magic-Me: Identity-Specific Video Customized Diffusion

ByteDance

Published on: 2024-05-20 1 author
Generative Image Dynamics

Google

Published on: 2024-05-14 1 author
AI at Work Is Here. Now Comes the Hard Part

Microsoft

Published on: 2024-05-08 Venue: Microsoft Work Trend Index
Private Vector Mean Estimation in the Shuffle Model: Optimal Rates Require Many Messages

Apple

Published on: 2024-04-26 1 author
OmniSearchSage: Multi-Task Multi-Entity Embeddings for Pinterest Search

Pinterest

Published on: 2024-04-25 1 author
Distinguishing homolytic versus heterolytic bond dissociation of phenyl sulfonium cations with localized active space methods

Adobe / University of Chicago

Published on: 2024-04-22 1 author
More, better or different? Trade-offs between group size and competence development in jury theorems

Institute for Futures Studies, Umeå University

Published on: 2024-04-18 2 authors
Mixtral 8x22B (Cheaper, Better, Faster, Stronger)

Mistral AI

Published on: 2024-04-17 1 author
Gemma: Open Models Based on Gemini Research and Technology

Google / Google DeepMind

Published on: 2024-04-16 1 author

Prev 8 9 10 11 12 13 14 15 16 17 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: