1 Bit Bonsai 4B
Overview
1-bit Bonsai 4B is PrismML’s mid-size ultra-efficient model designed for fast local inference. It uses just 0.57GB of memory and is positioned as a strong balance of speed, accuracy, and energy efficiency, with PrismML reporting throughput of 132 tokens per second on an M4 Pro for workloads that need both responsiveness and capable language performance.
About PrismML
Tools using 1 Bit Bonsai 4B
No tools found for this model yet.
