Papers

Filter by company

SWE-Agent: Agent-Computer Interfaces Enable Automated Software Engineering

Princeton University

Published on: 2023-09-14 4 authors
PaLM 2 Technical Report

Google

Published on: 2023-09-13 1 author
Phi-2: A Small Language Model with Reasoning Capability

Microsoft

Published on: 2023-09-12 5 authors
Efficient Memory Management for Large Language Model Serving with PagedAttention

Stanford University, University of California, Berkeley

Published on: 2023-09-11 9 authors
Textbooks Are All You Need

Microsoft / Microsoft Research

Published on: 2023-09-02 1 author
Graph of Thoughts: Solving Elaborate Problems with Large Language Models

Eidgenössische Technische Hochschule Zürich

Published on: 2023-08-18 6 authors
3D Gaussian Splatting for Real-Time Radiance Field Rendering

INRIA, Université Côte d’Azur

Published on: 2023-08-08 Venue: SIGGRAPH 2023 4 authors
MetaGPT: Meta Programming for A Multi-Agent Collaborative Framework

DeepWisdom / Tsinghua University

Published on: 2023-08-01 4 authors
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control

Google / Google DeepMind

Published on: 2023-07-28 1 author
Differentially Private Heavy Hitter Detection using Federated Analytics

Apple

Published on: 2023-07-21 1 author
Llama 2: Open Foundation and Fine-Tuned Chat Models

Meta Platforms

Published on: 2023-07-19 1 author
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models

Salesforce

Published on: 2023-06-15 1 author
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Amazon

Published on: 2023-06-12 1 author
Neuralangelo: High-Fidelity Neural Surface Reconstruction

NVIDIA / Johns Hopkins University

Published on: 2023-06-05 1 author
Orca: Progressive Learning from Complex Explanation Traces of GPT-4

Microsoft / Microsoft Research

Published on: 2023-06-05 1 author
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Massachusetts Institute of Technology

Published on: 2023-06-01 3 authors
TransAct: Transformer-based Realtime User Action Model for Recommendation at Pinterest

Pinterest

Published on: 2023-05-31 1 author
IMAGEBIND: One Embedding Space To Bind Them Al

Meta Platforms / Meta AI Research

Published on: 2023-05-31 1 author
ReWOO: Decoupling Reasoning from Observation for Efficient LLM Reasoning

Microsoft / Tsinghua University

Published on: 2023-05-29 4 authors
Voyager: An Open-Ended Embodied Agent with Large Language Models

NVIDIA / Caltech, Stanford University

Published on: 2023-05-25 4 authors
QLoRA: Efficient Finetuning of Quantized LLMs

University of Washington

Published on: 2023-05-23 4 authors
Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture

Meta Platforms / Meta AI Research

Published on: 2023-05-13 1 author
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models

Google

Published on: 2023-05-07 6 authors
Segment Anything

Meta Platforms / Meta AI Research

Published on: 2023-05-05 1 author
Sequential Attention: Making AI Models Leaner and Faster Without Sacrificing Accuracy

Google / Carnegie Mellon University

Published on: 2023-04-25 1 author
DAMO-YOLO : A Report on Real-Time Object Detection Design

Alibaba

Published on: 2023-04-24
VECO 2.0: Cross-lingual Language Model Pre-training with Multi-granularity Contrastive Learning

Alibaba

Published on: 2023-04-17 1 author
Sparks of Artificial General Intelligence: Early experiments with GPT-4

Microsoft / Microsoft Research

Published on: 2023-04-13 1 author
Generative Agents: Interactive Simulacra of Human Behavior

Google / Stanford University

Published on: 2023-04-07 6 authors
Regression Transformer enables concurrent sequence regression and generation for molecular language modelling

IBM

Published on: 2023-04-06 1 author
CAMEL: Communicative Agents for “Mind” Exploration of Large Language Models

KAUST / University of Cambridge

Published on: 2023-03-30 4 authors
Reflexion: Language Agents with Verbal Reinforcement Learning

NVIDIA / Stanford University

Published on: 2023-03-20 8 authors
Application-Agnostic Language Modeling for On-Device ASR

Apple

Published on: 2023-03-16 1 author
Language Is Not All You Need: Aligning Perception with Language Models

Microsoft

Published on: 2023-03-01 1 author
LLaMA: Open and Efficient Foundation Language Models

Meta Platforms

Published on: 2023-02-27 10 authors
Adding Conditional Control to Text-to-Image Diffusion Models

Stanford University

Published on: 2023-02-10 2 authors
Toolformer: Language Models Can Teach Themselves to Use Tools

Meta Platforms / Meta AI Research

Published on: 2023-02-09 1 author
Flow Matching for Generative Modeling

Meta Platforms / Weizmann Institute of Science

Published on: 2023-02-08 1 author
Why we built an AI supercomputer in the cloud

IBM

Published on: 2023-02-07 1 author
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video

Alibaba

Published on: 2023-02-01
Decision-Making Context Interaction Network for Click-Through Rate Prediction

Meituan

Published on: 2023-01-29 1 author
Large language models generate functional protein sequences across diverse families

Salesforce

Published on: 2023-01-26 1 author
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers

Microsoft

Published on: 2023-01-05 1 author
Constitutional AI: Harmlessness from AI Feedback

Anthropic

Published on: 2022-12-15 1 author
Robust Speech Recognition via Large-Scale Weak Supervision

OpenAI

Published on: 2022-12-06 1 author
Stable Diffusion with Core ML on Apple Silicon

Apple

Published on: 2022-12-01 1 author
Fast Inference from Transformers via Speculative Decoding

Stanford University, University of California, Berkeley

Published on: 2022-11-30 4 authors
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks

Microsoft / University of Washington

Published on: 2022-11-22 4 authors
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers

Eidgenössische Technische Hochschule Zürich, Institute of Science and Technology Austria

Published on: 2022-10-31 4 authors
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations

Meta Platforms / Facebook Ai Research

Published on: 2022-10-22 1 author

Prev 38 39 40 41 42 43 44 45 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: