non-dedicated hardware
Scaling Large ML Models to Small Devices with Atila Orhon
The size of ML models is growing into the many billions of parameters. This poses a challenge for running inference on non-dedicated hardware like phones and laptops. Argmax is a startup