1
0
mirror of https://github.com/containers/ramalama.git synced 2026-02-05 06:46:39 +01:00
Commit Graph

86 Commits

Author SHA1 Message Date
Ian Eaves
f65529bda7 adds dev dependency groups
Signed-off-by: Ian Eaves <ian.k.eaves@gmail.com>
2025-06-06 18:12:33 -05:00
Daniel J Walsh
a9f5238082 Merge pull request #1340 from ieaves/feat/standardized-build
Remove hardcoded /usr/local site-packages injection to fix sys.path pollution
2025-05-09 10:55:44 -04:00
Sergio Arroutbi
1de0c27534 Groom coverage rules, genreate xml/lcov reports
Signed-off-by: Sergio Arroutbi <sarroutb@redhat.com>
2025-05-08 11:36:42 +02:00
Ian Eaves
785c66184b updated build to remove setup.py dependency to fix cli entrypoint
Signed-off-by: Ian Eaves <ian.k.eaves@gmail.com>

removed uv.lock

Signed-off-by: Ian Eaves <ian.k.eaves@gmail.com>

reverts uv-install.sh, bin/ramalama, and flat cli hierarchy

Signed-off-by: Ian Eaves <ian.k.eaves@gmail.com>

packit version extraction from pyproject.toml

Signed-off-by: Ian Eaves <ian.k.eaves@gmail.com>

pyproject.toml references license file

Signed-off-by: Ian Eaves <ian.k.eaves@gmail.com>

fixed completion directory location

Signed-off-by: Ian Eaves <ian.k.eaves@gmail.com>

fixed format and check-format. There is no longer a root .py file to check

Signed-off-by: Ian Eaves <ian.k.eaves@gmail.com>

newline at end of install-uv.sh

Signed-off-by: Ian Eaves <ian.k.eaves@gmail.com>

remove *.py from make lint flake8 command

Signed-off-by: Ian Eaves <ian.k.eaves@gmail.com>

added import for ModelStoreImport to main

Signed-off-by: Ian Eaves <ian.k.eaves@gmail.com>

attempt to consolidate main functions

Signed-off-by: Ian Eaves <ian.k.eaves@gmail.com>

lint

Signed-off-by: Ian Eaves <ian.k.eaves@gmail.com>

Make bin/ramalama executable

Signed-off-by: Ian Eaves <ian.k.eaves@gmail.com>

typo

Signed-off-by: Ian Eaves <ian.k.eaves@gmail.com>
2025-05-07 10:57:31 -05:00
Sergio Arroutbi
70806fa8ab Add coverage tools, run them via specific rules
Added new rules to install/run specific coverity tools:
* install-detailed-cov-requirements: Install basic coverage tools
* install-cov-requirements: Install extended coverage tools
* cov-tests: Execute basic coverage tools
* detailed-cov-tests: Execute extended coverage tools

Signed-off-by: Sergio Arroutbi <sarroutb@redhat.com>
2025-05-07 11:44:59 +02:00
Daniel J Walsh
715dffbb53 Make --no-cache optional for make build
Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-04-28 06:59:13 -04:00
Sergio Arroutbi
837b0c99c2 Fix minor issues
- Include test rule in global Makefile
- Use GO variable in doc/Makefile
- Ramalama->RamaLama

Signed-off-by: Sergio Arroutbi <sarroutb@redhat.com>
2025-04-25 16:53:03 +02:00
Daniel J Walsh
c66e931b7a Allow building older versions of cuda
Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-04-21 13:51:10 -04:00
Nathan Weinberg
082ed827f1 fix: add 'pipx' install to 'make install-requirements'
'make install-requirements' currently assumes 'pipx'
is installed in your env, but this may not be the case

add an explict install/upgrade command via pip

Signed-off-by: Nathan Weinberg <nweinber@redhat.com>
2025-04-11 10:04:33 -04:00
Eric Curtin
73c54bf34c Combine Vulkan, Kompute and CPU inferencing into one image
Less images to maintain, Vulkan is more mature and more widely
used than Kompute.

Signed-off-by: Eric Curtin <ecurtin@redhat.com>
2025-03-27 15:17:44 +00:00
Eric Curtin
6416c9e3e9 Add ramalama client
Once we achieve feature parity with llama-run, we will more
tightly integrate this into RamaLama.

Signed-off-by: Eric Curtin <ecurtin@redhat.com>
2025-03-20 17:30:38 +00:00
Brian
57f4a6097b Add software to support using rag in RamaLama
This PR just installs the python requirements needed to play with the
rag_framework.py file.

I have not added the docling support yet, since that would swell the
size of the images.  Will add that in a separate PR.

Also remove pragmatic and begin conversion to new rag tooling.

Signed-off-by: Brian <bmahabir@bu.edu>
Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-03-15 07:28:47 -04:00
Eric Curtin
f6eaeb6b49 python3 validator
We are encountering issues where newer python3 features are
breaking systems with older versions of python3, such as macOS,
this should ensure we validate this in CI.

Signed-off-by: Eric Curtin <ecurtin@redhat.com>
2025-03-13 15:13:51 +00:00
Michael Engel
f55475e36d Added converting go templates to jinja templates
Usually, the chat templates for gguf models are written as jinja templates.
Ollama, however, uses Go Templates specific to ollama. In order to use the
proper templates for models pulled from ollama, the chat templates are
converted to jinja ones and passed to llama-run.

Signed-off-by: Michael Engel <mengel@redhat.com>
2025-03-11 15:41:55 +01:00
Eric Curtin
2769347597 Extend make validate check to do more
It also does check-format now.

Signed-off-by: Eric Curtin <ecurtin@redhat.com>
2025-03-07 12:24:38 +00:00
Daniel J Walsh
adc53fea4e Merge pull request #911 from leo-pony/main
Add support for llama.cpp engine to use ascend NPU device
2025-03-06 12:11:30 -05:00
leo-pony
93c023d4a7 Add code for ascend npu supporting for llama.cpp engine
Signed-off-by: leo-pony <nengjunma@outlook.com>
2025-03-06 17:25:46 +08:00
Eric Curtin
4b1b4f4bc4 Add new ramalama-*-core executables
ramalama-serve-core is intended to act as a proxy and implement
multiple-models. ramalama-client-core in intended to act as a OpenAI
client. ramalama-run-core is intended to act as ramalama-serve-core +
ramalama-client-core, both processes will die on completion of
ramalama-run-core.

Signed-off-by: Eric Curtin <ecurtin@redhat.com>
2025-03-04 14:00:50 +00:00
Eric Curtin
b027740e42 Switch from tiny to smollm:135m
This is probably a consequence of my slow network, but I switched
to smollm:135m, it's easier for demos. tiny was taking too long
to download.

Signed-off-by: Eric Curtin <ecurtin@redhat.com>
2025-02-26 19:43:34 +00:00
Michael Engel
3622340635 Apply formatting and linting to unit tests
Signed-off-by: Michael Engel <mengel@redhat.com>
2025-02-26 17:42:30 +01:00
Michael Engel
149086e043 Added unit tests for new model factory
Signed-off-by: Michael Engel <mengel@redhat.com>
2025-02-24 10:11:12 +01:00
Eric Curtin
bfe91e3c2d Make CI build all images
To ensure they all continue to build and remain of reasonable size.

Signed-off-by: Eric Curtin <ecurtin@redhat.com>
2025-02-22 13:48:50 +00:00
Michael Engel
c21764e818 Introduced isort in the format make targets
Signed-off-by: Michael Engel <mengel@redhat.com>
2025-02-15 15:00:56 +01:00
Michael Engel
73a74bea19 Refined formatting and linting
Split make target lint into lint, format and check-format
and updated the CI steps accordingly. Also moved configuration
of black to pyproject.toml and flake8 to .flake8 file.

Signed-off-by: Michael Engel <mengel@redhat.com>
2025-02-15 15:00:56 +01:00
Michael Engel
b31482cc8d Pin versions of the development tools to major version
By pinning the version of the development tools, the risk of
accidental upgrades and breaking changes leading are mitigated.

Signed-off-by: Michael Engel <mengel@redhat.com>
2025-02-15 14:56:13 +01:00
Daniel J Walsh
7f1e680b09 Attempt to use build_llama_and_whisper.sh
Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-02-12 13:03:23 -05:00
Daniel J Walsh
4e4e6708bf Add ramalama rag command
Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-02-12 11:43:09 -05:00
Daniel J Walsh
c166c0a007 Begin process of packaging PRAGmatic
Building Pragmatic into a container image is fairly easy.

podman build --build-arg IMAGE=quay.io/ramalama/rocm -t quay.io/ramalama/rocm-rag container-images/pragmatic

Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-02-05 14:07:56 -05:00
Charro Gruver
7ba4c953d1 Fix syntax error in Makefile
Signed-off-by: Charro Gruver <cgruver@redhat.com>
2025-02-04 19:47:47 +00:00
Charro Gruver
e7495266ec Modify Makefile to support multi-arch builds
Signed-off-by: Charro Gruver <cgruver@redhat.com>
2025-02-04 19:44:27 +00:00
Daniel J Walsh
6f1c21c3c4 Remove omlmd from OCI calls
Also Simplify Spec File

Fedora 39 is no longer supported, so remove checks in spec file.

More Podman to Recommends, You can run RamaLama with no container engine or with Docker.

Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-01-15 14:46:51 -05:00
Daniel J Walsh
9d9d07451d container_build.sh works on MAC
Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-01-14 17:12:31 -05:00
Eric Curtin
08ae68241f Merge pull request #544 from rhatdan/run
whisper.cpp now installs properly
2025-01-07 15:20:19 +00:00
Daniel J Walsh
600c9b7676 whisper.cpp now installs properly
These changes are required for building images.

Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-01-07 10:16:54 -05:00
Eric Curtin
dd38466f6f Build to build_rm
This should be build_rm

Signed-off-by: Eric Curtin <ecurtin@redhat.com>
2025-01-07 14:46:24 +00:00
Eric Curtin
b3c1c0bc3e Update container images to use latest
Update CUDA base image from version to 12.6.3 for both builder
and runtime stages. Update UBI base image to 9.5. Update ROCm
version in to 6.3.1.

Signed-off-by: Eric Curtin <ecurtin@redhat.com>
2025-01-05 18:46:38 +00:00
Eric Curtin
7572236fb4 Improved command handling and refactoring
- Modified `Makefile` to pass the 'build' command to `container_build.sh`.
- Refactored `container_build.sh` to:
  - Use `case` statements for command handling.
  - Add `determine_platform` function to determine the platform.
  - Add `parse_arguments` function for better argument parsing.
  - Add `process_all_targets` function to handle all container images.
  - Add `print_usage` function to display usage information.
  - Enhance the `main` function to use the new functions and handle errors.

Signed-off-by: Eric Curtin <ecurtin@redhat.com>
2025-01-03 17:29:02 +00:00
swarajpande5
19e38583c5 Skip codespell of venv by adding --skip in the command
Signed-off-by: swarajpande5 <swarajpande5@gmail.com>
2024-12-30 18:12:36 +05:30
swarajpande5
87dcf7517f Remove tqdm dependency
Signed-off-by: swarajpande5 <swarajpande5@gmail.com>
2024-12-22 18:45:03 +05:30
Daniel J Walsh
9d8d4548a6 Make it easier to test-run manually
NO need to leak ramalama storage into container any longer.

Since we are disabling SELinux don't add Z.

Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2024-11-08 16:20:08 -05:00
Daniel J Walsh
7c2e8c35b3 Add man page information for ramalama.conf
Add missing transport directive in ramalama.conf
Add default ramalama.conf file.

Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2024-10-30 10:22:07 -04:00
Stephen Smoogen
cd2ee69a69 Fix merge branch 'main' into fix_spec_v0.0.20
I found that the fix in the Makefile still broke on f39, and having
the line just ignore if it fails worked.

Signed-off-by: Stephen Smoogen <ssmoogen@redhat.com>
2024-10-24 13:15:50 +00:00
Stephen Smoogen
24b3b813b3 Fix Makefile to be less stringent on failues of zsh
I am making some lines optional in the Makefile and added fixes from
the RPM review.

Signed-off-by: Stephen Smoogen <ssmoogen@redhat.com>
2024-10-24 12:57:03 +00:00
Eric Curtin
b9ed1eaf1d Added a check to the zsh completions generation step
Ensuring that zsh is actually supported

Signed-off-by: Eric Curtin <ecurtin@redhat.com>
2024-10-23 18:23:59 +01:00
Robert Sturla
4c85d7d710 fix: update Makefile and CI to run individual container image builds
Signed-off-by: Robert Sturla <robertsturla@outlook.com>
2024-10-21 17:55:18 +01:00
Daniel J Walsh
333ec5969d split build from validate in Makefile
Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2024-10-17 16:52:04 -04:00
Eric Curtin
5ca5a5a6a3 Merge pull request #320 from rhatdan/docker
Add support for testing with docker
2024-10-16 18:02:42 +01:00
Daniel J Walsh
77c149f9f7 Add support for testing with docker
Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2024-10-16 11:19:15 -04:00
Daniel J Walsh
dd1d1ef6df Fix handling of completions on MacOS
Currently we are not installing command completions via PyPi.
For some reason python is ignoring the build directory.
Moving the completions from build to completions fixes the issue.

Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2024-10-16 09:58:50 -04:00
Daniel J Walsh
75dfd10d69 Add zsh completions
Fixes: https://github.com/containers/ramalama/issues/302

Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2024-10-15 17:42:49 -04:00