'make install-requirements' currently assumes 'pipx'
is installed in your env, but this may not be the case
add an explict install/upgrade command via pip
Signed-off-by: Nathan Weinberg <nweinber@redhat.com>
This PR just installs the python requirements needed to play with the
rag_framework.py file.
I have not added the docling support yet, since that would swell the
size of the images. Will add that in a separate PR.
Also remove pragmatic and begin conversion to new rag tooling.
Signed-off-by: Brian <bmahabir@bu.edu>
Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
We are encountering issues where newer python3 features are
breaking systems with older versions of python3, such as macOS,
this should ensure we validate this in CI.
Signed-off-by: Eric Curtin <ecurtin@redhat.com>
Usually, the chat templates for gguf models are written as jinja templates.
Ollama, however, uses Go Templates specific to ollama. In order to use the
proper templates for models pulled from ollama, the chat templates are
converted to jinja ones and passed to llama-run.
Signed-off-by: Michael Engel <mengel@redhat.com>
ramalama-serve-core is intended to act as a proxy and implement
multiple-models. ramalama-client-core in intended to act as a OpenAI
client. ramalama-run-core is intended to act as ramalama-serve-core +
ramalama-client-core, both processes will die on completion of
ramalama-run-core.
Signed-off-by: Eric Curtin <ecurtin@redhat.com>
This is probably a consequence of my slow network, but I switched
to smollm:135m, it's easier for demos. tiny was taking too long
to download.
Signed-off-by: Eric Curtin <ecurtin@redhat.com>
Split make target lint into lint, format and check-format
and updated the CI steps accordingly. Also moved configuration
of black to pyproject.toml and flake8 to .flake8 file.
Signed-off-by: Michael Engel <mengel@redhat.com>
By pinning the version of the development tools, the risk of
accidental upgrades and breaking changes leading are mitigated.
Signed-off-by: Michael Engel <mengel@redhat.com>
Building Pragmatic into a container image is fairly easy.
podman build --build-arg IMAGE=quay.io/ramalama/rocm -t quay.io/ramalama/rocm-rag container-images/pragmatic
Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
Also Simplify Spec File
Fedora 39 is no longer supported, so remove checks in spec file.
More Podman to Recommends, You can run RamaLama with no container engine or with Docker.
Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
Update CUDA base image from version to 12.6.3 for both builder
and runtime stages. Update UBI base image to 9.5. Update ROCm
version in to 6.3.1.
Signed-off-by: Eric Curtin <ecurtin@redhat.com>
- Modified `Makefile` to pass the 'build' command to `container_build.sh`.
- Refactored `container_build.sh` to:
- Use `case` statements for command handling.
- Add `determine_platform` function to determine the platform.
- Add `parse_arguments` function for better argument parsing.
- Add `process_all_targets` function to handle all container images.
- Add `print_usage` function to display usage information.
- Enhance the `main` function to use the new functions and handle errors.
Signed-off-by: Eric Curtin <ecurtin@redhat.com>
NO need to leak ramalama storage into container any longer.
Since we are disabling SELinux don't add Z.
Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
I found that the fix in the Makefile still broke on f39, and having
the line just ignore if it fails worked.
Signed-off-by: Stephen Smoogen <ssmoogen@redhat.com>