chore: add CLAUDE.md

a committed CLAUDE.md can ensure more consistent behavior for contributors utilizing Claude Code or other supported agents for ramalama development Signed-off-by: Nathan Weinberg <nweinber@redhat.com>
2026-02-05 06:46:39 +01:00 · 2026-01-05 12:39:47 -05:00
parent 005b91691e
commit 059945f3d8
1 changed files with 107 additions and 0 deletions
--- a/CLAUDE.md
+++ b/CLAUDE.md
@@ -0,0 +1,107 @@
+# CLAUDE.md
+
+This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
+
+## Project Overview
+
+RamaLama is a CLI tool for managing and serving AI models using containers. It provides a container-centric approach to AI model management, supporting multiple model registries (HuggingFace, Ollama, OCI registries) and automatic GPU detection with appropriate container image selection.
+
+## Build and Development Commands
+
+### Setup
+```bash
+make install-requirements    # Install dev dependencies via pip
+```
+
+### Testing
+```bash
+# Unit tests (pytest via tox)
+make unit-tests              # Run unit tests
+make unit-tests-verbose      # Run with full trace output
+tox                          # Direct tox invocation
+
+# E2E tests (pytest)
+make e2e-tests               # Run with Podman (default)
+make e2e-tests-docker        # Run with Docker
+make e2e-tests-nocontainer   # Run without container engine
+
+# System tests (BATS)
+make bats                    # Run BATS system tests
+make bats-nocontainer        # Run in nocontainer mode
+make bats-docker             # Run with Docker
+
+# All tests
+make tests                   # Run unit + end-to-end tests
+```
+
+### Running a single test
+```bash
+# Unit test
+tox -- test/unit/test_cli.py::test_function_name -vvv
+
+# E2E test
+tox -e e2e -- test/e2e/test_basic.py::test_function_name -vvv
+
+# Single BATS file
+RAMALAMA=$(pwd)/bin/ramalama bats -T test/system/030-run.bats
+```
+
+### Code Quality
+```bash
+make validate                # Run all validation (codespell, lint, format check, type check)
+make lint                    # Run flake8 + shellcheck
+make check-format            # Check black + isort formatting
+make format                  # Auto-format with black + isort
+make type-check              # Run mypy type checking
+make codespell               # Check spelling
+```
+
+### Documentation
+```bash
+make docs                    # Build manpages and docsite
+```
+
+## Architecture
+
+### Source Structure (`ramalama/`)
+- `cli.py` - Main CLI entry point, argparse setup, subcommand dispatch
+- `config.py` - Configuration constants and loading
+- `common.py` - Shared utility functions, GPU detection (`get_accel()`)
+- `engine.py` - Container engine abstraction (Podman/Docker)
+
+### Transport System (`ramalama/transports/`)
+Handles pulling/pushing models from different registries:
+- `base.py` - Base `Transport` class defining the interface
+- `transport_factory.py` - `New()` and `TransportFactory` for creating transports
+- `huggingface.py`, `ollama.py`, `oci.py`, `modelscope.py`, `rlcr.py` - Registry-specific implementations
+- Transports are selected via URL scheme prefixes: `huggingface://`, `ollama://`, `oci://`, etc.
+
+### Model Store (`ramalama/model_store/`)
+Manages local model storage:
+- `global_store.py` - `GlobalModelStore` for model management
+- `store.py` - Low-level storage operations
+- `reffile.py` - Reference file handling for tracking model origins
+
+### Command System (`ramalama/command/`)
+- `factory.py` - `assemble_command()` builds runtime commands (llama.cpp, vllm, mlx)
+- `context.py` - Command execution context
+- `schema.py` - Inference spec schema handling
+
+### Key Patterns
+- **GPU Detection**: `get_accel()` in `common.py` detects GPU type (cuda, rocm, vulkan, etc.) and selects appropriate container image
+- **Container Images**: GPU-specific images at `quay.io/ramalama/{ramalama,cuda,rocm,intel-gpu,...}`
+- **Inference Engines**: llama.cpp (default), vllm, mlx (macOS only) - configured via YAML specs in `inference-spec/engines/`
+
+## Test Structure
+
+- `test/unit/` - pytest unit tests (fast, no external dependencies)
+- `test/e2e/` - pytest end-to-end tests (marked with `@pytest.mark.e2e`)
+- `test/system/` - BATS shell tests for full CLI integration testing
+
+## Code Style
+
+- Python 3.10+ required
+- Line length: 120 characters
+- Formatting: black + isort
+- Type hints encouraged (mypy checked)
+- Commits require DCO sign-off (`git commit -s`)