H
Hacker News
New
Best
Show
Jobs
RLHF from Scratch
(github.com)
61 points
by onurkanbkrc
Feb 10, 2026
2 Comments
alansaber
•
Feb 10, 2026
Looks good. I am a big advocate for these hands on demos as being the best way for beginners to learn ML
fauria
•
Feb 10, 2026
RLHF: Reinforcement learning from human feedback -
https://en.wikipedia.org/wiki/Reinforcement_learning_from_hu...
2 Comments