4 days ago
Tues Nov 19, 2024 12:15am PST
Llama 3.1 405B now runs at 969 tokens/s on Cerebras Inference
read article
comments:
add comment
loading comments...