2 weeks ago
Sun Feb 2, 2025 8:25pm PST
Ask HN: Is there a primer on RL applied to LLMs?
Want to read more on how exactly new thinking models are trained and if some old RL techniques are now applied again to LLMs
comments:
add comment
loading comments...