1
0
mirror of https://github.com/containers/ramalama.git synced 2026-02-05 06:46:39 +01:00

3795 Commits

Author SHA1 Message Date
Mike Bonnet
ec166b6aba Merge pull request #2393 from containers/renovate/huggingface-hub-1.x
Update dependency huggingface-hub to ~=1.4.0
2026-02-04 12:04:28 -08:00
Mike Bonnet
d4646306cc Merge pull request #2366 from olliewalsh/version_0_17_0
Bump to v0.17.0
v0.17.0
2026-02-04 08:46:04 -08:00
renovate[bot]
39e1f1f9bb Update dependency huggingface-hub to ~=1.4.0
Signed-off-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
2026-02-04 15:10:26 +00:00
Oliver Walsh
74d664eb49 Add generated docs to gitignore
Signed-off-by: Oliver Walsh <owalsh@redhat.com>
2026-02-04 11:14:46 +00:00
Oliver Walsh
1377d5401f Bump to v0.17.0
Signed-off-by: Oliver Walsh <owalsh@redhat.com>
2026-02-04 11:14:33 +00:00
Mike Bonnet
d6e37782c6 Merge pull request #2370 from mikebonnet/rm-bats-image
stop building the bats image
2026-02-02 10:33:49 -08:00
Mike Bonnet
1fb1af8adf Merge pull request #2369 from mikebonnet/rag-deps-2026-01
Update all dependencies in the -rag images to their latest versions
2026-02-02 10:33:37 -08:00
Mike Bonnet
f483c043d1 Merge pull request #2368 from mikebonnet/multi-stage-everywhere
use multi-stage builds for all images
2026-02-02 10:33:19 -08:00
Mike Bonnet
f171a0c452 Merge pull request #2388 from containers/konflux/mintmaker/main/docsite-lockfile-maintenance
Lock file maintenance
2026-02-02 06:16:00 -08:00
Mike Bonnet
e0fb735f11 Merge pull request #2386 from containers/konflux/references/main
Update Konflux references
2026-02-02 06:15:11 -08:00
red-hat-konflux-kflux-prd-rh03[bot]
ca5bb2b3b2 Lock file maintenance
Signed-off-by: red-hat-konflux-kflux-prd-rh03 <206760901+red-hat-konflux-kflux-prd-rh03[bot]@users.noreply.github.com>
2026-02-02 01:16:49 +00:00
red-hat-konflux-kflux-prd-rh03[bot]
d80c22ff5b Update Konflux references
Signed-off-by: red-hat-konflux-kflux-prd-rh03 <206760901+red-hat-konflux-kflux-prd-rh03[bot]@users.noreply.github.com>
2026-01-31 05:12:56 +00:00
Mike Bonnet
9bc3a704b5 Merge pull request #2372 from jwieleRH/docsite
Remove generated doc files.
2026-01-30 13:51:38 -08:00
Mike Bonnet
1d3d44f4ec Merge pull request #2371 from ramalama-labs/chore/cli-comment-restore
Restores comment from #2309
2026-01-30 13:37:45 -08:00
Mike Bonnet
cbad31ad14 Update all dependencies in the -rag images to their latest versions
Run build_rag.sh from the bind-mounted context dir, rather than copying
the script and requirements files into the image.

Signed-off-by: Mike Bonnet <mikeb@redhat.com>
2026-01-30 07:23:47 -08:00
John Wiele
6ffa25ff89 Remove generated doc files.
Clean the docsite directory as part of "make clean". Remove generated
doc files from the repository; their presence creates a risk of
publishing incorrect documentation.

Signed-off-by: John Wiele <jwiele@redhat.com>
2026-01-30 08:34:24 -05:00
Mike Bonnet
88d597e4d9 use multi-stage builds for all images
Standardize on multi-stage builds for all images, which avoids including
development tools and libraries in the final images, reducing image size.
Install all llama.cpp binaries and shared libraries for consistency with
upstream images. Avoid installing unnecessary (and large) .a files from
the installation directory.
Call build_llama.sh to install runtime dependencies in the final images
so package versions can be kept consistent between build and final images.

Signed-off-by: Mike Bonnet <mikeb@redhat.com>
2026-01-29 19:17:37 -08:00
Ian Eaves
969156cd5f moved comment
Signed-off-by: Ian Eaves <ian.k.eaves@gmail.com>
2026-01-29 20:54:49 -06:00
Ian Eaves
e7ebd2ed4c restore comment
Signed-off-by: Ian Eaves <ian.k.eaves@gmail.com>
2026-01-29 20:49:19 -06:00
Mike Bonnet
5b78694273 stop building the bats image
Remove the Tekton pipelines for building the bats image, and the bats-integration
test. Run tests as part of the ramalama image build using the e2e image.

Signed-off-by: Mike Bonnet <mikeb@redhat.com>
2026-01-29 18:17:44 -08:00
Mike Bonnet
acbb271e1a intel-gpu: improvements to entrypoint.sh
Redirect the output from setvars.sh so commands that parse output (like
"ramalama bench") aren't confused.
If run with a tty and no args, start an interactive shell for convenience and
consistency with other images.

Signed-off-by: Mike Bonnet <mikeb@redhat.com>
2026-01-29 12:51:50 -08:00
Mike Bonnet
eea5c4f35f Merge pull request #2365 from olliewalsh/bump_llamacpp
Bump llama.cpp version
2026-01-29 12:35:24 -08:00
Mike Bonnet
5c3d3781ac Merge pull request #2359 from olliewalsh/flash_attn 2026-01-29 09:37:30 -08:00
Oliver Walsh
6e897fdb9e Merge pull request #2364 from olliewalsh/fix_unit_tests
[trivial] correctly omit test_serve_api
2026-01-29 17:06:15 +00:00
Oliver Walsh
fa89144e42 [trivial] correctly omit test_serve_api
Restore the e2e mark, otherwise unit-tests will run it

Signed-off-by: Oliver Walsh <owalsh@redhat.com>
2026-01-29 16:50:17 +00:00
Oliver Walsh
c30e455613 Bump llama.cpp version
Signed-off-by: Oliver Walsh <owalsh@redhat.com>
2026-01-29 16:29:59 +00:00
Oliver Walsh
e956d11d70 Use default (auto) value for llama.cpp flash-attn
Also fix the uses_nvidia logic which was inverted.

Signed-off-by: Oliver Walsh <owalsh@redhat.com>
2026-01-29 12:21:11 +00:00
Oliver Walsh
2fdd68abd9 Revert "trial fix for bats-nocontainer"
This reverts commit 82843f94f0.

Signed-off-by: Oliver Walsh <owalsh@redhat.com>
2026-01-29 12:21:11 +00:00
Oliver Walsh
97b49459d9 Merge pull request #2224 from jwieleRH/hf2
Download safetensors models from huggingface.co with https.
2026-01-29 11:25:21 +00:00
Oliver Walsh
f4df84d94a Merge pull request #2363 from olliewalsh/ci_fixes
Fix slow test_run_model_with_prompt on windows
2026-01-29 11:12:26 +00:00
Ian Eaves
fc802f3c3b Merge pull request #2309 from ramalama-labs/imp/command-load-time
Improving cold start time on cli invocation.
2026-01-29 03:29:12 -06:00
John Wiele
22232231e5 Download safetensors models from huggingface.co with https.
Refactor the fetch_metadata method to attempt to fetch GGUF metadata
from a manifest, and then to attempt to fetch safetensors metadata
from the repo tree, including potentially paginated lists, although
there does not seem to be a huggingface model that is big enough to
require pagination yet.

To fetch a safetensors model, download the set of files from the repo
with https requests. Add an "other_files" category for safetensors
files that are neither JSON config files nor .safetensors model files,
such as the tokenizer.model file.

Remove code to download models from huggingface.co with the
huggingface cli. It didn't work correctly anyway. Stub out the
_collect_cli_files and in_existing_cache methods for the huggingface
transport.

Enable bats test of downloading safetensors models from
huggingface. Remove bats test and e2e test of downloading models with
the huggingface cli. Adjust test expectation for split safetensors
model.

Fixes: #1493

Signed-off-by: John Wiele <jwiele@redhat.com>
2026-01-28 23:35:04 -05:00
Oliver Walsh
a096a0593d Fix slow test_run_model_with_prompt on windows
Limit the number of output tokens on macos/windows to workaround the
model rambling.

Signed-off-by: Oliver Walsh <owalsh@redhat.com>
2026-01-29 02:33:17 +00:00
Mike Bonnet
de06c29cde Merge pull request #2358 from olliewalsh/ci_fixes
Reduce CI load and fix unreliable tests
2026-01-28 17:49:04 -08:00
Oliver Walsh
894adfe113 Drop build-image job from ci
Signed-off-by: Oliver Walsh <owalsh@redhat.com>
2026-01-28 21:15:40 +00:00
Oliver Walsh
70a3be3d4c Drop test_serve_api from e2e tests
Signed-off-by: Oliver Walsh <owalsh@redhat.com>
2026-01-28 16:52:43 +00:00
Oliver Walsh
d1fb7d35c8 Serialize ci jobs for each branch and cancel running ci if PR is updated
Signed-off-by: Oliver Walsh <owalsh@redhat.com>
2026-01-28 16:52:43 +00:00
Oliver Walsh
8efa71623d Skip e2e tests if lint or unit-tests fail
Signed-off-by: Oliver Walsh <owalsh@redhat.com>
2026-01-28 16:52:43 +00:00
Oliver Walsh
6887dbab78 Remove bats tests from github ci workflow
Signed-off-by: Oliver Walsh <owalsh@redhat.com>
2026-01-28 16:52:43 +00:00
Oliver Walsh
0a9095df83 Enable pytest duration reporting
Signed-off-by: Oliver Walsh <owalsh@redhat.com>
2026-01-28 16:52:43 +00:00
Oliver Walsh
49ccb6b344 Fix unreliable assert in test_pull_using_ollama_cache
Just check the command output reports that it's pulling from cache.

Pull/pull-from-cache timing will vary depending on network/disk, so not safe
to assume pulling from disk will be twice as fast as network.

Signed-off-by: Oliver Walsh <owalsh@redhat.com>
2026-01-28 16:52:43 +00:00
Oliver Walsh
22efcb53cb Speed up quadlet/kube generate bats/e2e tests
Disable image pulling when using HIP_VISIBLE_DEVICES, otherwise the large
rocm image will be pulled.

Skip all of the generate tests when --nocontainer is true.

Signed-off-by: Oliver Walsh <owalsh@redhat.com>
2026-01-28 16:52:43 +00:00
Oliver Walsh
b3f09536e0 Merge pull request #2357 from mikebonnet/rm-whisper
remove whisper.cpp from all images
2026-01-28 09:16:58 +00:00
Mike Bonnet
828d11832f Merge pull request #2349 from containers/konflux/mintmaker/main/registry.access.redhat.com-ubi9-ubi-9.x 2026-01-27 22:53:01 -08:00
Mike Bonnet
8d325b06e9 Merge pull request #2354 from mikebonnet/fix-docs-escape 2026-01-27 22:50:12 -08:00
Mike Bonnet
d06f6d3519 remove whisper.cpp from all images
Remove build and installation of whisper.cpp, and installation of ffmpeg.
Rename build_llama_and_whisper.sh to build_llama.sh.
Update Containerfiles to reference new script name.
Consolidate management of cmake args in build_llama.sh.
Remove references to whisper-server in various locations.

Signed-off-by: Mike Bonnet <mikeb@redhat.com>
2026-01-27 16:34:24 -08:00
Mike Bonnet
34d89cee90 rocm-fedora: delete Containerfile
This image is no longer built. The "rocm" image is based on Fedora.

Signed-off-by: Mike Bonnet <mikeb@redhat.com>
2026-01-27 12:52:39 -08:00
Oliver Walsh
e6af4d3f14 Merge pull request #2335 from mikebonnet/black-26
update to black 26.1 and fix formatting
2026-01-27 18:41:08 +00:00
Mike Bonnet
9b3155db60 docs: fix docsite build by escaping angle bracket and curly brackets
Required for successful mdx processing.

Signed-off-by: Mike Bonnet <mikeb@redhat.com>
2026-01-27 10:13:01 -08:00
red-hat-konflux-kflux-prd-rh03[bot]
a5192eae41 Update registry.access.redhat.com/ubi9/ubi Docker tag to v9.7-1769417801
Signed-off-by: red-hat-konflux-kflux-prd-rh03 <206760901+red-hat-konflux-kflux-prd-rh03[bot]@users.noreply.github.com>
2026-01-27 16:52:08 +00:00