ramalama

mirror of https://github.com/containers/ramalama.git synced 2026-02-05 15:47:26 +01:00

Author	SHA1	Message	Date
Mike Bonnet	d06f6d3519	remove whisper.cpp from all images Remove build and installation of whisper.cpp, and installation of ffmpeg. Rename build_llama_and_whisper.sh to build_llama.sh. Update Containerfiles to reference new script name. Consolidate management of cmake args in build_llama.sh. Remove references to whisper-server in various locations. Signed-off-by: Mike Bonnet <mikeb@redhat.com>	2026-01-27 16:34:24 -08:00
Mike Bonnet	14a3108872	stop building and releasing the entrypoint images The entrypoint images are used by AI Lab, which pins to older digests and has no need to update. Stop building and releasing new, unused versions. Signed-off-by: Mike Bonnet <mikeb@redhat.com>	2026-01-22 09:44:41 -08:00
Mike Bonnet	b036161e41	don't install ramalama (or anything python-related) into the inference engine images Since RAG has been moved into its own images, no python runtime environment is required in the inference engine images. Signed-off-by: Mike Bonnet <mikeb@redhat.com>	2025-11-18 06:47:16 -08:00
Kevin Pouget	8a8ce838d0	container_build.sh: pass environment variables to the image builder Signed-off-by: Kevin Pouget <kpouget@redhat.com>	2025-08-19 15:29:58 +02:00
Jeff Ligon	678092638f	building stable diffusion. assisted by cursor/claude. Signed-off-by: Jeff Ligon <jligon@redhat.com> going with gemini's python_check(with the double equal) for this one. Signed-off-by: Jeff Ligon <jligon@redhat.com> changing to `MYPIP ?= pipx` Signed-off-by: Jeff Ligon <jligon@redhat.com> fixing uv pipx Signed-off-by: Jeff Ligon <jligon@redhat.com> taking it back to pip to fix ci Signed-off-by: Jeff Ligon <jligon@redhat.com>	2025-08-01 12:06:47 -04:00
Mike Bonnet	2d3f8dfe28	fix GPU selection and pytorch URL when building rag images A previous commit changed the second argument to add_rag() from the image name to the full repo path. Update the case statement accordingly, so the "GPU" variable is set correctly. The "cuda" directory is no longer available on download.pytorch.org. When building for cuda, pull wheels from the "cu128" directory, which contains binaries built for CUDA 12.8. When building rocm* images, download binaries from the "rocm6.3" directory, which are built for ROCm 6.3. Signed-off-by: Mike Bonnet <mikeb@redhat.com>	2025-07-16 23:27:54 -07:00
Mike Bonnet	46c0154d2a	build layered images from Containerfiles Move the Containerfiles for the entrypoint and rag images out of container_build.sh and into their own files. This is necessary so they can be built with Konflux. Signed-off-by: Mike Bonnet <mikeb@redhat.com>	2025-07-01 15:14:26 -07:00
Daniel J Walsh	0f90023a52	Merge pull request #1609 from rhatdan/build Separate build image into its own VM	2025-06-26 09:36:59 -04:00
Daniel J Walsh	de46cd16c7	Separate build image into its own VM Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2025-06-26 09:12:27 -04:00
Mike Bonnet	a651be7832	add support for running bats in a container Add a new "bats" container which is configured to run the bats tests. The container supports running the standard bats test suite (container-in-container) as well as the "--nocontainer" tests. Add two new Makefile targets for running the bats container via podman. Signed-off-by: Mike Bonnet <mikeb@redhat.com>	2025-06-25 21:45:12 -07:00
Daniel J Walsh	3f87444b6f	Merge pull request #1101 from lsm5/tmt-gpu TMT: run tests with GPUs	2025-06-20 06:40:38 -04:00
Lokesh Mandvekar	a53c42723a	TMT: run tests with GPUs This commit adds TMT test jobs triggered via Packit that fetches an instance with NVIDIA GPU, specified in `plans/no-rpm.fmf`, and can be verified in the gpu_info test result. In addition, system tests (nocontainer), validate, and unit tests are also triggered via TMT. Fixes: #1054 TODO: 1. Enable bats-docker tests 2. Resolve f41 validate test failures Signed-off-by: Lokesh Mandvekar <lsm5@fedoraproject.org>	2025-06-19 15:32:31 -04:00
Mike Bonnet	ae114e45af	install ramalama into containers from the current checkout Copy the current checkout of the ramalama repo into the containers and use that for installation. This removes the need for an extra checkout of the ramalama repo, and is consistent with the build process used by container_build.sh (which used a bind-mount rather than a copy). This keeps the version of ramalama in sync with the Containerfiles, and makes testing and CI more useful. Signed-off-by: Mike Bonnet <mikeb@redhat.com>	2025-06-19 08:37:53 -07:00
Aaron Teo	bf0bfe0761	docs: update container_build.sh help information Signed-off-by: Aaron Teo <aaron.teo1@ibm.com> fix: remove -v from print information Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-06-03 23:49:37 +08:00
Xiaodong Ye	b68c6b4c45	Address review comments Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>	2025-05-15 19:20:31 +08:00
Xiaodong Ye	80f2393283	Support Moore Threads GPU Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>	2025-05-15 19:20:31 +08:00
Xiaodong Ye	1801378950	Fix #1382 Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>	2025-05-12 10:32:23 +08:00
Xiaodong Ye	d32d6ed6dd	Support older version of Docker Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>	2025-05-10 20:51:28 +08:00
Daniel J Walsh	5260ad701a	Make version optional in build Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2025-05-09 10:14:04 -04:00
Daniel J Walsh	715dffbb53	Make --no-cache optional for make build Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2025-04-28 06:59:13 -04:00
Daniel J Walsh	6c59fd7fd5	Use currenct ramalama directory rather them main from repo This allows users to experiment with content and get it into container image. Fixes: https://github.com/containers/ramalama/issues/1274 Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2025-04-26 07:04:59 -05:00
Daniel J Walsh	c66e931b7a	Allow building older versions of cuda Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2025-04-21 13:51:10 -04:00
Brian	cb8630e9aa	add openvino model server image to quay.io/ramalama Signed-off-by: Brian <bmahabir@bu.edu>	2025-04-16 18:25:31 -04:00
Daniel J Walsh	b996ae315d	Build images for llama-stack Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2025-04-10 17:54:02 -04:00
Daniel J Walsh	275fe11a23	Mv ramalama-ci to ramalama-cli Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2025-04-01 18:24:32 -04:00
Daniel J Walsh	cf0b842959	Merge pull request #1090 from rhatdan/rag Move all RAG support to the -rag images	2025-04-01 17:37:39 -04:00
Daniel J Walsh	ef9e3fb377	Add ramalama-ci image This image will just run ramalama inside of a container and requires the user to leak the podman-socket into the container. It will use Podman-remote for all of its actions. Requested by the Podman Desktop team. Fixes: https://github.com/containers/ramalama/issues/837 Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2025-04-01 15:59:45 -04:00
Daniel J Walsh	ca23630470	Move all RAG support to the -rag images Images have grown considerably with RAG support. Do not force users who do not use rag to pay the penalty. Helps revert some growth complained about here: https://github.com/containers/ramalama/issues/838 Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2025-04-01 14:50:49 -04:00
Eric Curtin	83d9ac3966	Explain dryrun option better in container_build.sh It was given some generic explanation Signed-off-by: Eric Curtin <ecurtin@redhat.com>	2025-03-26 12:47:59 +00:00
Daniel J Walsh	ff0e5223d0	More updates for builds Fix doc2rag to handle load properly Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2025-03-25 16:30:19 -04:00
Daniel J Walsh	d216212207	Run build_rag.sh as root This fixes the build on intel-gpu container. Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2025-03-24 13:36:55 -04:00
Daniel J Walsh	27ca51d87a	Add docling support version 2 Remove pragmatic, and move to using local implementation until llama-stack version is ready. python3 container-images/scripts/doc2rag.py --help usage: docling [-h] target source [source ...] process source files into RAG vector database positional arguments: target source options: -h, --help show this help message and exit ramalama rag should be using accelerated images. Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2025-03-19 14:47:46 -04:00
Daniel J Walsh	18d90bb1ed	Fix container_build.sh to build all images Fixes: https://github.com/containers/ramalama/issues/987 Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2025-03-19 05:04:38 -04:00
Daniel J Walsh	064b28d10f	FIx handling of whisper-server and llama-server entrypoints Entrypoint tests are blowing up so remove for now. Fixes: https://github.com/containers/ramalama/issues/977 Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2025-03-17 11:07:09 -04:00
Brian	57f4a6097b	Add software to support using rag in RamaLama This PR just installs the python requirements needed to play with the rag_framework.py file. I have not added the docling support yet, since that would swell the size of the images. Will add that in a separate PR. Also remove pragmatic and begin conversion to new rag tooling. Signed-off-by: Brian <bmahabir@bu.edu> Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2025-03-15 07:28:47 -04:00
Daniel J Walsh	bff0b2de0b	Fix ENTRYPOINTS of whisper-server and llama-server Fixes:https://github.com/containers/ramalama/issues/964 Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2025-03-14 10:08:03 -04:00
Eric Curtin	bfe91e3c2d	Make CI build all images To ensure they all continue to build and remain of reasonable size. Signed-off-by: Eric Curtin <ecurtin@redhat.com>	2025-02-22 13:48:50 +00:00
Daniel J Walsh	4d209b92f4	Add entrypoint container images Install podman-remote and ramalama so we can use ramalama from within a container. $ podman run --env XDG_RUNTIME_DIR -v $HOME:$HOME -v /run/user:/run/user --userns=keep-id -ti --privileged quay.io/ramalama/ramalama ramalama run granite Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2025-02-14 14:33:32 -05:00
Daniel J Walsh	10321b0ee0	Merge pull request #736 from cgruver/podman-farm-script modify container_build.sh to add capability to use podman farm for multi-arch images	2025-02-04 16:48:52 -05:00
Charro Gruver	8e7dc3150c	update logic to exclude certain images from multi-arch builds Signed-off-by: Charro Gruver <cgruver@redhat.com>	2025-02-04 21:18:43 +00:00
Charro Gruver	8a187c65d2	fix syntax error Signed-off-by: Charro Gruver <cgruver@redhat.com>	2025-02-04 20:25:09 +00:00
Charro Gruver	cdb7130f7f	simplify script for multi-arch support Signed-off-by: Charro Gruver <cgruver@redhat.com>	2025-02-04 20:19:32 +00:00
Charro Gruver	2193333029	modify container_build.sh to add capability to use podman farm for multi-arch images Signed-off-by: Charro Gruver <cgruver@redhat.com>	2025-02-04 19:36:30 +00:00
Jakub Čajka	42f6cd9cf6	ramalama container: Make it possible to build basic container on all RHEL architectures Signed-off-by: Jakub Čajka <jcajka@redhat.com>	2025-02-04 14:49:52 +01:00
Eric Curtin	96542a67f2	This should save some space All builds inherit just from the OS image. Helps save space as the derived images just overwrite llama.cpp and whisper.cpp which is wasteful. Signed-off-by: Eric Curtin <ecurtin@redhat.com>	2025-01-08 12:58:24 +00:00
Daniel J Walsh	600c9b7676	whisper.cpp now installs properly These changes are required for building images. Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2025-01-07 10:16:54 -05:00
Eric Curtin	b3c1c0bc3e	Update container images to use latest Update CUDA base image from version to 12.6.3 for both builder and runtime stages. Update UBI base image to 9.5. Update ROCm version in to 6.3.1. Signed-off-by: Eric Curtin <ecurtin@redhat.com>	2025-01-05 18:46:38 +00:00
Eric Curtin	7572236fb4	Improved command handling and refactoring - Modified `Makefile` to pass the 'build' command to `container_build.sh`. - Refactored `container_build.sh` to: - Use `case` statements for command handling. - Add `determine_platform` function to determine the platform. - Add `parse_arguments` function for better argument parsing. - Add `process_all_targets` function to handle all container images. - Add `print_usage` function to display usage information. - Enhance the `main` function to use the new functions and handle errors. Signed-off-by: Eric Curtin <ecurtin@redhat.com>	2025-01-03 17:29:02 +00:00
Eric Curtin	7d657f8759	Consistency changes Ensure llama.cpp version is the same accross containers. Removing some duplicate actions, etc. Signed-off-by: Eric Curtin <ecurtin@redhat.com>	2024-11-05 03:48:05 +00:00
Eric Curtin	4e2fcbe1e6	Add vulkan image and show size Add this image, we also want to see the size of this image in the build. Co-authored-by: Steffen Roecker <sroecker@redhat.com> Signed-off-by: Eric Curtin <ecurtin@redhat.com>	2024-10-23 18:16:27 +01:00

1 2

60 Commits