Deep learning models have achieved striking performance across vision, language and time-series tasks, yet their growing depth and parameter counts impose substantial computational and memory demands.
Vienna startup Ora Computing raised €3.5M and proved a 70-billion-parameter large language model can be compressed for under ...