2 years ago
Wed Jun 14, 2023 9:35pm PST
Ask HN: What are you using for LLM response testing and benchmarking?
What are you using to test your LLM responses, benchmark them, maybe compare different versions?

I've seen a few YC startups focusing on this but I haven't decided yet if we should build this internally or use an external tool.

comments:
add comment
loading comments...