Maroon@lemmy.world to

Selfhosted@lemmy.worldEnglish · 1 day ago

Please suggest some good self-hostable RAG for my LLM.

29

Please suggest some good self-hostable RAG for my LLM.

Maroon@lemmy.world to

Selfhosted@lemmy.worldEnglish · 1 day ago

A while ago, I had requested help with using LLMs to manage all my teaching notes. I have since installed Ollama and been playing with it to get a feel for the setup.

I was also suggested the use of RAG (Retrieval Augmented Generation ) and CA (cognitive architecture). However, I am unclear on good self hosted options for these two tasks. Could you please suggest a few?

For example, I tried ragflow.io and installed it on my system, but it seems I need to setup an account with a username and password to use it. It remains unclear if I can use the system offline like the base ollama model, and that information won’t be sent from my computer system.

Chat

BaroqueInMind@lemmy.one
link
fedilink
English
arrow-up
1·
edit-2
7 hours ago
Why not use this and select whatever LLM to leverage as a RAG? It literally allows you to self host the model and select any model for both chat and RAG analysis. I have it set to Hermes3 8B for chat and a 1.3B Llama3 as the RAG.

Selfhosted@lemmy.world

selfhosted@lemmy.world

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !selfhosted@lemmy.world

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don’t control.

Rules:

Be civil: we’re here to support and learn from one another. Insults won’t be tolerated. Flame wars are frowned upon.
No spam posting.
Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it’s not obvious why your post topic revolves around selfhosting, please include details to make it clear.
Don’t duplicate the full text of your blog or github here. Just post the link for folks to click.
Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).
No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

275 users / day
2.06K users / week
4.66K users / month
12.3K users / 6 months
2 local subscribers
39.5K subscribers
2.87K Posts
56.4K Comments
Modlog