hckrnws
back
2 weeks ago
Sun Feb 2, 2025 8:25pm PST
Ask HN: Is there a primer on RL applied to LLMs?
@eamag
Want to read more on how exactly new thinking models are trained and if some old RL techniques are now applied again to LLMs
comments:
add comment
loading comments...