What's the deal with LlamaCPP and caching?

j4k3@lemmy.world · 1 year ago

What's the deal with LlamaCPP and caching?

webghost0101@sopuli.xyz · 1 year ago

Thats a juicy amount of memmory for just a laptop.

Interesting, the fosai site made it appear like 70B models are near impossible to run requiring 40B gb of VRam but i suppose it can work with less But slower.

The vram of your gpu seems to be the biggest factor. A reason why while my current gpu is dying i cant get myself to spend on a mere 12 gb 4070ti