The Tech to Build the Holodeck [Gaussian Splatting]

FrankLaskey@lemmy.ml · 6 months ago

Interesting project. Is it actually possible to track workouts using your phone or smartwatch without needing proprietary third-party apps like Strava or Garmin Connect though?

FrankLaskey@lemmy.ml · 9 months ago

Looks like it now has Docling Content Extraction Support for RAG. Has anyone used Docling much?

FrankLaskey@lemmy.ml · 9 months ago

Oh and I typically get 16-20 tok/s running a 32b model on Ollama using Open WebUI. Also I have experienced issues with 4-bit quantization for the K/V cache on some models myself so just FYI

FrankLaskey@lemmy.ml · 9 months ago

It really depends on how you quantize the model and the K/V cache as well. This is a useful calculator. https://smcleod.net/vram-estimator/ I can comfortably fit most 32b models quantized to 4-bit (usually KVM or IQ4XS) on my 3090’s 24 GB of VRAM with a reasonable context size. If you’re going to be needing a much larger context window to input large documents etc then you’d need to go smaller with the model size (14b, 27b etc) or get a multi GPU set up or something with unified memory and a lot of ram (like the Mac Minis others are mentioning).

FrankLaskey@lemmy.ml · 10 months ago

Hopefully these improvements will become available to other Nvidia GPU architectures like Ada and Ampere in the future as well.

FrankLaskey@lemmy.ml · 10 months ago

Is it possible to use StreetComplete on iOS?

FrankLaskey@lemmy.ml · 11 months ago

The Tech to Build the Holodeck [Gaussian Splatting]