I wonder if a 1B model could be close to free to host. That's an eventuality, but I wonder how long it'll take for that to be real.
giantrobot · 7m ago
A 1B model at 2-bit quantization is about the size of the average web page anymore. With some WebGPU support you could run such a model in a browser.
I'm half joking. Web pages are ludicrously fat these days.
WhatsName · 1h ago
I find the fact that in this day people can own two letter domains absolutely staggering, based on rarity, those should be worth millions I guess?
nadermx · 2m ago
I mean, ch.at is a incredible domain hack. But not sure it's worth millions. If it was ch.com could get mid six figures and up. But either way absolutely amazing domain.
Perhaps via an RNN like in https://huggingface.co/spaces/BlinkDL/RWKV-Gradio-2
Or even just leverage huggingface gradio spaces? (most are Gradio apps that expose APIs https://www.gradio.app/guides/view-api-page)
I'm half joking. Web pages are ludicrously fat these days.