🪴 alan's notes

Home

❯

RLHF

RLHF

Aug 31, 20241 min read

Reinforcement Learning from Human Feedback

Graph View

Backlinks

LLM

Created with Quartz v4.3.0 © 2024

alan.computer
alan's notes