TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

Papers

  • FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
    Massachusetts Institute of Technology, Stanford University
    Published on: 2022-05-27 5 authors
  • mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections
    Published on: 2022-05-25 1 author
  • ItemSage: Learning Product Embeddings for Shopping Recommendations at Pinterest
    Published on: 2022-05-24 1 author
  • Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
    Google / Google Research
    Published on: 2022-05-23 1 author
  • MultiBiSage: A Web-Scale Recommendation System Using Multiple Bipartite Graphs at Pinterest
    Pinterest / The Ohio State University
    Published on: 2022-05-21 1 author
  • M6-Rec: Generative Pretrained Language Models are Open-Ended Recommender Systems
    Published on: 2022-05-19 1 author
  • Least-to-Most Prompting Enables Complex Reasoning in Large Language Models
    Published on: 2022-05-19 6 authors
  • PinnerFormer: Sequence Modeling for User Representation at Pinterest
    Published on: 2022-05-09 1 author
  • PinnerFormer: Sequence Modeling for User Representation at Pinterest
    Published on: 2022-05-09 1 author
  • Hierarchical Text-Conditional Image Generation with CLIP Latents
    Published on: 2022-04-13 1 author
  • Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
    Published on: 2022-04-12 1 author
  • Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
    Published on: 2022-04-04 1 author
  • Training Compute-Optimal Large Language Models
    Published on: 2022-03-29 1 author
  • CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis
    Published on: 2022-03-25 1 author
  • Learning When to Translate for Streaming Speech
    ByteDance / University of California, Berkeley
    Published on: 2022-03-22 1 author
  • Training a Tokenizer for Free with Private Federated Learning
    Apple / Cornell University
    Published on: 2022-03-15 1 author
  • Training Language Models to Follow Instructions with Human Feedback
    Published on: 2022-03-04 1 author
  • FourCastNet: A Global Data-driven High-resolution Weather Model using Adaptive Fourier Neural Operators
    NVIDIA / University of Michigan
    Published on: 2022-02-22 1 author
  • Competition-Level Code Generation with AlphaCode
    Google / Google DeepMind
    Published on: 2022-02-08 1 author
  • Low-Overhead Fault-Tolerant Quantum Error Correction with the Surface-GKP Code
    Published on: 2022-01-28 1 author
  • Instant Neural Graphics Primitives with a Multiresolution Hash Encoding
    Published on: 2022-01-16 1 author
  • ML-Decoder: Scalable and Versatile Classification Head
    Published on: 2021-12-31 1 author
  • A Mathematical Framework for Transformer Circuits
    Published on: 2021-12-22 1 author
  • Training Verifiers to Solve Math Word Problems
    Published on: 2021-12-18 1 author
  • Improving language models by retrieving from trillions of tokens
    Published on: 2021-12-08 1 author
  • On-device Panoptic Segmentation for Camera Using Transformers
    Published on: 2021-10-19 1 author
  • Merlion: A Machine Learning Library for Time Series
    Published on: 2021-09-20 1 author
  • Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
    Microsoft / Microsoft Research Asia
    Published on: 2021-08-17 1 author
  • AGENT: A Benchmark for Core Psychological Reasoning
    IBM / Harvard University
    Published on: 2021-07-18 1 author
  • Highly accurate protein structure predictionwith AlphaFold
    Published on: 2021-07-15 1 author
  • LoRA: Low-Rank Adaptation of Large Language Models
    Published on: 2021-06-17 Venue: ICLR 2022 7 authors
  • AN IMAGE IS WORTH 16X16 WORDS: TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE
    Published on: 2021-06-03 1 author
  • M6: A Chinese Multimodal Pretrainer
    Published on: 2021-05-29 1 author
  • Holographic dynamics simulations with a trapped ion quantum computer
    Honeywell International / University of Illinois Urbana-Champaign
    Published on: 2021-05-19 1 author
  • An autonomous debating system (Project Debater)
    IBM
    Published on: 2021-03-17 1 author
  • Learning Transferable Visual Models From Natural Language Supervision
    Published on: 2021-02-26 1 author
  • Learning Transferable Visual Models From Natural Language Supervision
    Published on: 2021-02-26 Venue: ICML 2021 12 authors
  • Federated Evaluation and Tuning for On-Device Personalization: System Design & Applications
    Published on: 2021-02-18 1 author
  • Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
    Published on: 2020-09-18 1 author
  • PinnerSage: Multi-Modal User Embedding Framework for Recommendations at Pinterest
    Published on: 2020-07-07 1 author
  • Denoising Diffusion Probabilistic Models
    Google / University of California, Berkeley
    Published on: 2020-06-19 Venue: NeurIPS 2020 3 authors
  • ZeRO: Memory Optimizations Toward Training Trillion Parameter Models
    Published on: 2020-05-13 1 author
  • NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
    Google / Google Research, UC Berkeley, UC San Diego
    Published on: 2020-03-30 6 authors
  • Scaling Laws for Neural Language Models
    OpenAI / Johns Hopkins University
    Published on: 2020-01-23 1 author
  • Dota 2 with Large Scale Deep Reinforcement Learning
    Published on: 2019-12-13 1 author
  • PyTorch: An Imperative Style, High-Performance Deep Learning Library
    Meta Platforms / University of Warsaw
    Published on: 2019-12-03 1 author
  • Overton: A Data System for Monitoring and Improving Machine-Learned Products
    Published on: 2019-09-07 1 author
  • StructBERT: Incorporating Language Structures into Pre-training for Deep Language Understanding
    Published on: 2019-08-13 1 author
  • RoBERTa: A Robustly Optimized BERT Pretraining Approach
    Meta Platforms / Facebook AI Research
    Published on: 2019-07-26 1 author
  • D2-Net: A Trainable CNN for Joint Description and Detection of Local Features
    Microsoft / ETH Zurich
    Published on: 2019-05-09 1 author
0 AIs selected
Clear selection
#
Name
Task