5 months ago
Fri Feb 28, 2025 3:38pm PST
Ask HN: Are there any objective measurements for AI model coding performance?
Not sure if this is even possible, but is there any site or benchmark for testing which AI model is best for the task of coding?

Like Claude 3.5 vs GPT 4o vs Gemini 2 etc

What exists beyond our opinions to more objectively measure the quality of code output on these models?

comments:
add comment
loading comments...