Tech startup proposes a novel way to tackle massive LLMs using the fastest memory available to mankind
GPU-like PCIe card offers 10PFLOPs FP4 compute power and 2GB of SRAM SRAM is usually used in small amounts as cache in processors (L1 to L3) It also uses LPDDR5…