What AI services are you selfhosting? Or, have tested and passed on

kiol@lemmy.world · 5 days ago

What AI services are you selfhosting? Or, have tested and passed on

L_Acacia@lemmy.ml · 16 hours ago

Well they are fully closed source except for the open source project they are a wrapper on. The open source part is llama.cpp

ikidd@lemmy.world · 16 hours ago

Fair enough, but it’s damn handy and simple to use. And I don’t know how to do speculative decoding with ollama, which massively speeds up the models for me.

L_Acacia@lemmy.ml · 15 hours ago

Their software is pretty nice. That’s what I’d recommand to someone who doesn’t want to tinker. It’s just a shame they don’t want to open source their software and we have to reinvent the wheel 10 times. If you are willing to tinker a bit koboldcpp + openewebui/librechat is a pretty nice combo.

ikidd@lemmy.world · 12 hours ago

That koboldcpp is pretty interesting. Looks like I can load a draft model for spec decode as well as a pile of other things.

What local models have you been using for coding? I’ve been disappointed with things like deepseek-coder and the qwen-coder, it’s not even a patch on Claude, but that damn cost for anthropic has been killing me.

What AI services are you selfhosting? Or, have tested and passed on

What AI services are you selfhosting? Or, have tested and passed on

Testing Indiedroid Nova w/ 16gb ram - Learning Together