I use Kobold as a backend for the FluentRead browser plugin, so I can do local language translation.
LocalLLaMA
Welcome to LocalLLaMA! Here we discuss running and developing machine learning models at home. Lets explore cutting edge open source neural network technology together.
Get support from the community! Ask questions, share prompts, discuss benchmarks, get hyped at the latest and greatest model releases! Enjoy talking about our awesome hobby.
As ambassadors of the self-hosting machine learning community, we strive to support each other and share our enthusiasm in a positive constructive way.
Rules:
Rule 1 - No harassment or personal character attacks of community members. I.E no namecalling, no generalizing entire groups of people that make up our community, no baseless personal insults.
Rule 2 - No comparing artificial intelligence/machine learning models to cryptocurrency. I.E no comparing the usefulness of models to that of NFTs, no comparing the resource usage required to train a model is anything close to maintaining a blockchain/ mining for crypto, no implying its just a fad/bubble that will leave people with nothing of value when it burst.
Rule 3 - No comparing artificial intelligence/machine learning to simple text prediction algorithms. I.E statements such as "llms are basically just simple text predictions like what your phone keyboard autocorrect uses, and they're still using the same algorithms since <over 10 years ago>.
Rule 4 - No implying that models are devoid of purpose or potential for enriching peoples lives.
VSCode + roo plugin seems to be all the hotness for coders leveraging 'agenic teams' so I spent a bit playing around with it. Most local models dont do tool calling very well I need to see if devstral works better without giving errors. I hear real professionals use claude API for that kind of stuff.
Im only vaguely familiar with getting computers to send, recieve, and manipulate data with eachother on a local network so got a very basic python script going pointed at kobold cpps openai-compatable API to send prompts and recieve repliesinstead of the default webui app just to learn how it works under the hood.
One of my next projects will be creating a extremely simple web based UI for my ereaders basic web browser to connect to. kobold has something similar with the /noscript subpage but even that is too much for my kobo reader. I intend to somehow leverage a gemtext to html proxy like ducking or newswaffle to make the page rendering output dead simple.
One of these days im going to get a pi zero and attach it to a relay and see if I can get a model to send a signal to turn a light on and off. Those home automation people with the smart houses that integrate llms into things look soo cool
I looked into roo, and was able to get it to interact with ollama but not actually work. From looking into roo, I found Cline which works a lot better. I would like to figure out a way to get it to work with the authenticated proxy api hosted to openwebui so I can access models externally but it is still pretty cool.
If your running into the issue of an app wanting an api key for your local ollamas openai-compatable web interface API and refuses to work without one, I found that any random characters work. If you port forward your host computer you should be able to access the webui interface on an external network using the public IP.
Heres the dead simple python program I used to send and recieve text to kobold.cpp engine through the web API. Not sure how similar ollama but afaik openai-compatable API means it all should works close to the same for compatibility(I think? lol!) if you give it a shot Make sure to set the .py file you make as executable and run it from a terminal doing ./filename.py to see the output in real time. It should make a log text file in same dir as the program too. Just use your host computers local ip if the python script pc is on same network.
spoiler
import requests
# Configuration
API_URL = "http://10.0.0.xx:5001/api/v1/generate"
PROMPT = "Tell me a short story about a robot learning to dance."
OUTPUT_FILE = "output.txt"
# Define the API request data
data = {
"prompt": PROMPT,
"max_length": 200, # Adjust response length
"temperature": 0.7, # Control randomness (0=deterministic, 1=creative)
"top_p": 0.9, # Focus on high-probability tokens
}
# Send the request to kobold.cpp
response = requests.post(API_URL, json=data)
if response.status_code == 200:
# Extract the generated text
result = response.json()
generated_text = result["results"][0]["text"]
# Save to a text file
with open(OUTPUT_FILE, "w") as f:
f.write(generated_text)
print(f"Response saved to {OUTPUT_FILE}!")
else:
print(f"Error: {response.status_code} - {response.text}")
Right now I have Ollama / Open-WebUI, Kokoro FastAPI, ComfyUI, Wan2GP, and FramePack Studio set up. I recently (as in yesterday) configured an API key middleware with Traefik and placed it in front of Ollama and Comfy, but currently nothing is using them yet.
I’ll probably try out Devstral with one of the agentic coding frameworks, like Void or Anon Kode. I may also try out one of the FOSS writing studios (like Plot Bunni) and connect my own Ollama instance. I could use NovelCrafter but paying a subscription fee to use my own server for the compute intensive part feels silly to me.
I tried to use Open Notebook (basically a replacement for NotebookLM) with Ollama and Kokoro, with Kokoro FastAPI as my OpenAI endpoint, but turns out it only supported, and required, text embeddings from OpenAI, so I couldn’t do that fully on my local. At some point, if they don’t fix that, I’m planning to either add support myself or set up some routes with Traefik where the ones OpenNotebook uses point to the service I want to use.
ETA: n8n is one of the services I plan to set up next, and I’ll likely end up integrating both Ollama and Comfy workflows into it.
I connected Karakeep to ollama for automatic tagging.
good question honestly
I didn't do dedicated but i have nice hardware for the ollama / OpenWebUI stack, and I've done some experimenting. Certain models have limited uses for me and I'm at least glad I'm able to eff around with this securely/locally. So far I've found Qwen3 and Deepseekr1 to be kinda handy sometimes.
I'd love to find more applications so i guess this reply is more of a follow