Return to Article Details Reward-Guided Fine-Tuning of Language Models with Social Feedback Download Download PDF