non-dedicated hardware

Sort by:

Scaling Large ML Models to Small Devices with Atila Orhon

The size of ML models is growing into the many billions of parameters. This poses a challenge for running inference on non-dedicated hardware like phones and laptops. Argmax is a startup