23 out Tensormesh raises $4.5M to squeeze more inference out of AI server loads
Posted at 13:00h
in Tech-En
Tensormesh uses an expanded form of KV Caching to make inference loads as much as ten times more efficient.
Tensormesh uses an expanded form of KV Caching to make inference loads as much as ten times more efficient.