hckrnws
back
1 year ago
Tues Nov 19, 2024 12:15am PST
Llama 3.1 405B now runs at 969 tokens/s on Cerebras Inference
@benchmarkist
read article
comments:
add comment
loading comments...