Edward Snowden slams Nvidia's RTX 50-series 'F-tier value,' whistleblows on lackluster VRAM capacity

submitted by

www.tomshardware.com/pc-components/gpus/edward-…

112
288

Back to main discussion

Parent comment

Ty. I'll try ollama with the Q-4-M quantization. I wouldn't expect to see a difference between ollama and SGlang.


Insert image