hckrnws
back
2 months ago
Mon Mar 10, 2025 5:06pm PST
Detecting misbehavior in frontier reasoning models
@meetpateltech
read article
comments:
add comment
loading comments...