hckrnws
back
3 months ago
Mon Nov 3, 2025 12:29pm PST
Scaling Coding-Agent RL to 32x H100s. 160% Improvement on Stanford's TBench
@Danau5tin
read article
comments:
add comment
loading comments...