Papers

445,813 searches today

The front page of AI.Used by 90M+ humans.

YouTube Submit AI School Companionship SEO Summaries Chatbots Music Funny

Free mode

About Free mode

100% free

Freemium

Free Trial

Prompts Deals

Filter by company

Robust Post-Training for Generative Recommenders: Why Exponential Reward-Weighted SFT Outperforms RLHF

Meta Platforms, Netflix / Stanford University

Published on: 2026-03-10 5 authors

✕

0 AIs selected

Clear selection

#

Name

Task

Go to section