Papers
-
VIGS-SLAM: Visual Inertial Gaussian Splatting SLAM
-
The Art of Scaling Test-Time Compute for Large Language Models
-
ThetaEvolve: Test-time Learning on Open ProblemsMicrosoft / Carnegie Mellon University, University of California, University of Washington, University of Wisconsin-Madison
-
LatBot: Distilling Universal Latent Actions for Vision-Language-Action Models
-
SageServe: Optimizing LLM Serving on Cloud Data Centers with Forecast Aware Auto-Scaling
-
Shifting Work Patterns with Generative AI
-
LUT-LLM: Efficient Large Language Model Inference with Memory-based Computations on FPGAs
-
Teaching LLMs to Plan: Logical Chain-of-Thought Instruction Tuning for Symbolic Planning
-
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems
-
Autoregressive Speech Synthesis without Vector Quantization
-
LLMs Get Lost In Multi-Turn Conversation
-
I-Con: A Unifying Framework for Representation Learning
-
AI-Instruments: Embodying Prompts as Instruments to Abstract & Reflect Graphical Interface Commands as General-Purpose Tools
-
AI at Work Is Here. Now Comes the Hard Part
-
Towards Making the Most of ChatGPT for Machine Translation
-
BitNet: Scaling 1-bit Transformers for Large Language Models
-
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation
-
Phi-2: A Small Language Model with Reasoning Capability
-
Textbooks Are All You Need
-
Orca: Progressive Learning from Complex Explanation Traces of GPT-4
-
ReWOO: Decoupling Reasoning from Observation for Efficient LLM Reasoning
-
Sparks of Artificial General Intelligence: Early experiments with GPT-4
-
Language Is Not All You Need: Aligning Perception with Language Models
-
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
-
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks
-
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
-
LoRA: Low-Rank Adaptation of Large Language Models
-
ZeRO: Memory Optimizations Toward Training Trillion Parameter Models
-
D2-Net: A Trainable CNN for Joint Description and Detection of Local FeaturesMicrosoft / Chalmers University of Technology, Czech Institute of Informatics, Robotics, and Cybernetics, Czech Technical University, Département d'Informatique de l'ENS, ETH Zurich, Institute of Science Tokyo, National Institute for Research in Digital Science and Technology
-
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
-
Deep Residual Learning for Image Recognition
MongoDB - Build AI That Scales
