mirror of
https://github.com/containers/ramalama.git
synced 2026-02-06 09:47:54 +01:00
Relates to: https://github.com/containers/ramalama/pull/1982 Previously, the --max-tokens param was integrated in the daemon internal command factory. With the introduction of the spec, this command factory has now been replaced by the spec and the --max-tokens option added to the llama.cpp one. Signed-off-by: Michael Engel <mengel@redhat.com>