1
0
mirror of https://github.com/containers/ramalama.git synced 2026-02-05 15:47:26 +01:00
Commit Graph

60 Commits

Author SHA1 Message Date
Mike Bonnet
d06f6d3519 remove whisper.cpp from all images
Remove build and installation of whisper.cpp, and installation of ffmpeg.
Rename build_llama_and_whisper.sh to build_llama.sh.
Update Containerfiles to reference new script name.
Consolidate management of cmake args in build_llama.sh.
Remove references to whisper-server in various locations.

Signed-off-by: Mike Bonnet <mikeb@redhat.com>
2026-01-27 16:34:24 -08:00
Mike Bonnet
14a3108872 stop building and releasing the entrypoint images
The entrypoint images are used by AI Lab, which pins to older digests and
has no need to update. Stop building and releasing new, unused versions.

Signed-off-by: Mike Bonnet <mikeb@redhat.com>
2026-01-22 09:44:41 -08:00
Mike Bonnet
b036161e41 don't install ramalama (or anything python-related) into the inference engine images
Since RAG has been moved into its own images, no python runtime environment is
required in the inference engine images.

Signed-off-by: Mike Bonnet <mikeb@redhat.com>
2025-11-18 06:47:16 -08:00
Kevin Pouget
8a8ce838d0 container_build.sh: pass environment variables to the image builder
Signed-off-by: Kevin Pouget <kpouget@redhat.com>
2025-08-19 15:29:58 +02:00
Jeff Ligon
678092638f building stable diffusion.
assisted by cursor/claude.

Signed-off-by: Jeff Ligon <jligon@redhat.com>

going with gemini's python_check(with the double equal) for this one.

Signed-off-by: Jeff Ligon <jligon@redhat.com>

changing to `MYPIP ?= pipx`

Signed-off-by: Jeff Ligon <jligon@redhat.com>

fixing uv pipx

Signed-off-by: Jeff Ligon <jligon@redhat.com>

taking it back to pip to fix ci

Signed-off-by: Jeff Ligon <jligon@redhat.com>
2025-08-01 12:06:47 -04:00
Mike Bonnet
2d3f8dfe28 fix GPU selection and pytorch URL when building rag images
A previous commit changed the second argument to add_rag() from the image name to the
full repo path. Update the case statement accordingly, so the "GPU" variable is set correctly.

The "cuda" directory is no longer available on download.pytorch.org. When building for cuda,
pull wheels from the "cu128" directory, which contains binaries built for CUDA 12.8.

When building rocm* images, download binaries from the "rocm6.3" directory, which are built
for ROCm 6.3.

Signed-off-by: Mike Bonnet <mikeb@redhat.com>
2025-07-16 23:27:54 -07:00
Mike Bonnet
46c0154d2a build layered images from Containerfiles
Move the Containerfiles for the entrypoint and rag images out of container_build.sh and into their
own files. This is necessary so they can be built with Konflux.

Signed-off-by: Mike Bonnet <mikeb@redhat.com>
2025-07-01 15:14:26 -07:00
Daniel J Walsh
0f90023a52 Merge pull request #1609 from rhatdan/build
Separate build image into its own VM
2025-06-26 09:36:59 -04:00
Daniel J Walsh
de46cd16c7 Separate build image into its own VM
Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-06-26 09:12:27 -04:00
Mike Bonnet
a651be7832 add support for running bats in a container
Add a new "bats" container which is configured to run the bats tests.
The container supports running the standard bats test suite
(container-in-container) as well as the "--nocontainer" tests.

Add two new Makefile targets for running the bats container via podman.

Signed-off-by: Mike Bonnet <mikeb@redhat.com>
2025-06-25 21:45:12 -07:00
Daniel J Walsh
3f87444b6f Merge pull request #1101 from lsm5/tmt-gpu
TMT: run tests with GPUs
2025-06-20 06:40:38 -04:00
Lokesh Mandvekar
a53c42723a TMT: run tests with GPUs
This commit adds TMT test jobs triggered via Packit that fetches an
instance with NVIDIA GPU, specified in `plans/no-rpm.fmf`, and can be
verified in the gpu_info test result.

In addition, system tests (nocontainer), validate, and unit tests are
also triggered via TMT.

Fixes: #1054

TODO:
1. Enable bats-docker tests
2. Resolve f41 validate test failures

Signed-off-by: Lokesh Mandvekar <lsm5@fedoraproject.org>
2025-06-19 15:32:31 -04:00
Mike Bonnet
ae114e45af install ramalama into containers from the current checkout
Copy the current checkout of the ramalama repo into the containers and use that for installation.
This removes the need for an extra checkout of the ramalama repo, and is consistent with the build
process used by container_build.sh (which used a bind-mount rather than a copy).

This keeps the version of ramalama in sync with the Containerfiles, and makes testing and CI more
useful.

Signed-off-by: Mike Bonnet <mikeb@redhat.com>
2025-06-19 08:37:53 -07:00
Aaron Teo
bf0bfe0761 docs: update container_build.sh help information
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>

fix: remove -v from print information

Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
2025-06-03 23:49:37 +08:00
Xiaodong Ye
b68c6b4c45 Address review comments
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
2025-05-15 19:20:31 +08:00
Xiaodong Ye
80f2393283 Support Moore Threads GPU
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
2025-05-15 19:20:31 +08:00
Xiaodong Ye
1801378950 Fix #1382
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
2025-05-12 10:32:23 +08:00
Xiaodong Ye
d32d6ed6dd Support older version of Docker
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
2025-05-10 20:51:28 +08:00
Daniel J Walsh
5260ad701a Make version optional in build
Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-05-09 10:14:04 -04:00
Daniel J Walsh
715dffbb53 Make --no-cache optional for make build
Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-04-28 06:59:13 -04:00
Daniel J Walsh
6c59fd7fd5 Use currenct ramalama directory rather them main from repo
This allows users to experiment with content and get it into
container image.

Fixes: https://github.com/containers/ramalama/issues/1274

Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-04-26 07:04:59 -05:00
Daniel J Walsh
c66e931b7a Allow building older versions of cuda
Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-04-21 13:51:10 -04:00
Brian
cb8630e9aa add openvino model server image to quay.io/ramalama
Signed-off-by: Brian <bmahabir@bu.edu>
2025-04-16 18:25:31 -04:00
Daniel J Walsh
b996ae315d Build images for llama-stack
Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-04-10 17:54:02 -04:00
Daniel J Walsh
275fe11a23 Mv ramalama-ci to ramalama-cli
Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-04-01 18:24:32 -04:00
Daniel J Walsh
cf0b842959 Merge pull request #1090 from rhatdan/rag
Move all RAG support to the -rag images
2025-04-01 17:37:39 -04:00
Daniel J Walsh
ef9e3fb377 Add ramalama-ci image
This image will just run ramalama inside of a container and
requires the user to leak the podman-socket into the container.

It will use Podman-remote for all of its actions.

Requested by the Podman Desktop team.

Fixes: https://github.com/containers/ramalama/issues/837

Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-04-01 15:59:45 -04:00
Daniel J Walsh
ca23630470 Move all RAG support to the -rag images
Images have grown considerably with RAG support.
Do not force users who do not use rag to pay the
penalty.

Helps revert some growth complained about here:
https://github.com/containers/ramalama/issues/838

Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-04-01 14:50:49 -04:00
Eric Curtin
83d9ac3966 Explain dryrun option better in container_build.sh
It was given some generic explanation

Signed-off-by: Eric Curtin <ecurtin@redhat.com>
2025-03-26 12:47:59 +00:00
Daniel J Walsh
ff0e5223d0 More updates for builds
Fix doc2rag to handle load properly

Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-03-25 16:30:19 -04:00
Daniel J Walsh
d216212207 Run build_rag.sh as root
This fixes the build on intel-gpu container.

Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-03-24 13:36:55 -04:00
Daniel J Walsh
27ca51d87a Add docling support version 2
Remove pragmatic, and move to using local implementation
until llama-stack version is ready.

 python3 container-images/scripts/doc2rag.py --help
usage: docling [-h] target source [source ...]

process source files into RAG vector database

positional arguments:
  target
  source

options:
  -h, --help  show this help message and exit

ramalama rag should be using accelerated images.

Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-03-19 14:47:46 -04:00
Daniel J Walsh
18d90bb1ed Fix container_build.sh to build all images
Fixes: https://github.com/containers/ramalama/issues/987

Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-03-19 05:04:38 -04:00
Daniel J Walsh
064b28d10f FIx handling of whisper-server and llama-server entrypoints
Entrypoint tests are blowing up so remove for now.

Fixes:  https://github.com/containers/ramalama/issues/977

Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-03-17 11:07:09 -04:00
Brian
57f4a6097b Add software to support using rag in RamaLama
This PR just installs the python requirements needed to play with the
rag_framework.py file.

I have not added the docling support yet, since that would swell the
size of the images.  Will add that in a separate PR.

Also remove pragmatic and begin conversion to new rag tooling.

Signed-off-by: Brian <bmahabir@bu.edu>
Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-03-15 07:28:47 -04:00
Daniel J Walsh
bff0b2de0b Fix ENTRYPOINTS of whisper-server and llama-server
Fixes:https://github.com/containers/ramalama/issues/964

Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-03-14 10:08:03 -04:00
Eric Curtin
bfe91e3c2d Make CI build all images
To ensure they all continue to build and remain of reasonable size.

Signed-off-by: Eric Curtin <ecurtin@redhat.com>
2025-02-22 13:48:50 +00:00
Daniel J Walsh
4d209b92f4 Add entrypoint container images
Install podman-remote and ramalama so we can use ramalama
from within a container.

$ podman run --env XDG_RUNTIME_DIR -v $HOME:$HOME -v /run/user:/run/user --userns=keep-id -ti --privileged quay.io/ramalama/ramalama ramalama run granite

Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-02-14 14:33:32 -05:00
Daniel J Walsh
10321b0ee0 Merge pull request #736 from cgruver/podman-farm-script
modify container_build.sh to add capability to use podman farm for multi-arch images
2025-02-04 16:48:52 -05:00
Charro Gruver
8e7dc3150c update logic to exclude certain images from multi-arch builds
Signed-off-by: Charro Gruver <cgruver@redhat.com>
2025-02-04 21:18:43 +00:00
Charro Gruver
8a187c65d2 fix syntax error
Signed-off-by: Charro Gruver <cgruver@redhat.com>
2025-02-04 20:25:09 +00:00
Charro Gruver
cdb7130f7f simplify script for multi-arch support
Signed-off-by: Charro Gruver <cgruver@redhat.com>
2025-02-04 20:19:32 +00:00
Charro Gruver
2193333029 modify container_build.sh to add capability to use podman farm for multi-arch images
Signed-off-by: Charro Gruver <cgruver@redhat.com>
2025-02-04 19:36:30 +00:00
Jakub Čajka
42f6cd9cf6 ramalama container: Make it possible to build basic container on all
RHEL architectures

Signed-off-by: Jakub Čajka <jcajka@redhat.com>
2025-02-04 14:49:52 +01:00
Eric Curtin
96542a67f2 This should save some space
All builds inherit just from the OS image. Helps save space as the
derived images just overwrite llama.cpp and whisper.cpp which is
wasteful.

Signed-off-by: Eric Curtin <ecurtin@redhat.com>
2025-01-08 12:58:24 +00:00
Daniel J Walsh
600c9b7676 whisper.cpp now installs properly
These changes are required for building images.

Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>
2025-01-07 10:16:54 -05:00
Eric Curtin
b3c1c0bc3e Update container images to use latest
Update CUDA base image from version to 12.6.3 for both builder
and runtime stages. Update UBI base image to 9.5. Update ROCm
version in to 6.3.1.

Signed-off-by: Eric Curtin <ecurtin@redhat.com>
2025-01-05 18:46:38 +00:00
Eric Curtin
7572236fb4 Improved command handling and refactoring
- Modified `Makefile` to pass the 'build' command to `container_build.sh`.
- Refactored `container_build.sh` to:
  - Use `case` statements for command handling.
  - Add `determine_platform` function to determine the platform.
  - Add `parse_arguments` function for better argument parsing.
  - Add `process_all_targets` function to handle all container images.
  - Add `print_usage` function to display usage information.
  - Enhance the `main` function to use the new functions and handle errors.

Signed-off-by: Eric Curtin <ecurtin@redhat.com>
2025-01-03 17:29:02 +00:00
Eric Curtin
7d657f8759 Consistency changes
Ensure llama.cpp version is the same accross containers. Removing some duplicate
actions, etc.

Signed-off-by: Eric Curtin <ecurtin@redhat.com>
2024-11-05 03:48:05 +00:00
Eric Curtin
4e2fcbe1e6 Add vulkan image and show size
Add this image, we also want to see the size of this image in the
build.

Co-authored-by: Steffen Roecker <sroecker@redhat.com>
Signed-off-by: Eric Curtin <ecurtin@redhat.com>
2024-10-23 18:16:27 +01:00