Cerebras gives waferscale chips inferencing twist, claims 1,800 token per sec generation rates
Hot Chips Faster than you can read? More like blink and you'll miss the hallucination
Systems27 Aug 2024 | 7