Papers
-
Commit0: Library Generation from Scratch
-
ARTIST: Improving the Generation of Text-rich Images with Disentangled Diffusion Models and Large Language Models
-
The Llama 3 Herd of Models
-
Controlling Language and Diffusion Models by Transporting Activations
-
The Rise and Potential of Large Language Model Based Agents: A SurveyMIT
-
Evaluating Cultural and Social Awareness of LLM Web Agents
-
SF-V: Single Forward Video Generation Model
-
The Llama 3 Herd of Model
-
Improving Pinterest Search Relevance Using Large Language Models
-
NVLM: Open Frontier-Class Multimodal LLMs
-
HyQE: Ranking Contexts with Hypothetical Query Embeddings
-
RedPajama: an Open Dataset for Training Large Language Models
-
RedPajama: an Open Dataset for Training Large Language Models
-
Understanding Chain-of-Thought in LLMs through Information Theory
-
Survival of the Safest: Towards Secure Prompt Optimization through Interleaved Multi-Objective Evolution
-
Nemotron-4-340B-Instruct
-
Pixtral 12B
-
Data-Driven Discovery of Conservation Laws from Trajectories via Neural Deflation
-
Chronos: Learning the Language of Time Series
-
Qwen2-VL: Enhancing Vision-Language Model’s Perception of the World at Any Resolution
-
Duo-LLM: A Framework for Studying Adaptive Computation in Large Language Models
-
HM3: Heterogeneous Multi-Class Model Merging
-
arsier: Recipes for Training and Evaluating Large Video Description Models
-
OpenVLA: An Open-Source Vision-Language-Action Model
-
ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis
-
Arctic-TILT. Business Document Understanding at Sub-Billion Scale
-
General-Purpose User Modeling with Behavioral Logs
-
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference
-
Qwen2-Audio Technical Report
-
Qwen2 Technical Report
-
Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models
-
Harvesting Efficient On-Demand Order Pooling from Skilled Couriers: Enhancing Graph Representation Learning for Refining Real-time Many-to-One Assignments
-
Claude 3.5 Sonnet Model Card Addendum
-
Abliteration
-
Multi-Agent Software Development through Cross-Team Collaboration
-
Efficient Large Language Model Inference with Limited Memory
-
AgentBoard: An Evaluation Platform for LLM-Based Autonomous Agents
-
Creative Text-to-Audio Generation via Synthesizer Programming
-
Retrieval Augmented Generation for Domain-specific Question Answering
-
Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet
-
Multimodal Chain-of-Thought Reasoning in Language Models
-
Magic-Me: Identity-Specific Video Customized Diffusion
-
Generative Image Dynamics
-
AI at Work Is Here. Now Comes the Hard Part
-
Private Vector Mean Estimation in the Shuffle Model: Optimal Rates Require Many Messages
-
OmniSearchSage: Multi-Task Multi-Entity Embeddings for Pinterest Search
-
Distinguishing homolytic versus heterolytic bond dissociation of phenyl sulfonium cations with localized active space methods
-
More, better or different? Trade-offs between group size and competence development in jury theoremsInstitute for Futures Studies, Umeå University
-
Mixtral 8x22B (Cheaper, Better, Faster, Stronger)
-
Gemma: Open Models Based on Gemini Research and Technology
