1
0
mirror of https://github.com/containers/ramalama.git synced 2026-02-05 06:46:39 +01:00
Files
ramalama/inference-spec
Mike Bonnet 162cc543f1 serve --rag: run rag_framework in a separate container
rag_framework is now a proxy that enriches requests to the LLM with RAG context. Run it in a separate
container and send requests from the chat interface to the RAG proxy.

Generate the rag_framework command using CommandFactory.

Signed-off-by: Mike Bonnet <mikeb@redhat.com>
2025-10-23 22:02:26 -07:00
..