hckrnws
lieret
Thurs Jul 24, 2025 11:47pm PST
Karma:
5
submitted
Wed Aug 20, 2025 3:09pm PST
Show HN: Randomly switching between LMs at every step boosts SWE-bench score
@lieret
1
1
5
Fri Aug 8, 2025 4:29pm PST
GPT-5 on SWE-bench: Cost and performance deep-dive
@lieret
1
3
4
Thurs Jul 31, 2025 2:30pm PST
Show HN: New SWE-bench leaderboard compares LMs without fancy agent scaffolds
@lieret
2
Fri Jul 25, 2025 1:27pm PST
Show HN: Mini-swe-agent achieves 65% on SWE-bench in 100 lines of python
@lieret
2
4
7