README.md

![RAMALAMA logo](logos/PNG/ramalama-logo-full-vertical-added-bg.png)

# RamaLama

The RamaLama project's goal is to make working with AI boring
through the use of OCI containers.

On first run RamaLama inspects your system for GPU support, falling back to CPU
support if no GPUs are present. It then uses container engines like Podman or
Docker to pull the appropriate OCI image with all of the software necessary to
run an AI Model for your systems setup. This eliminates the need for the user
to configure the system for AI themselves. After the initialization, RamaLama
will run the AI Models within a container based on the OCI image.

RamaLama supports multiple AI model registries types called transports.
Supported transports:


## TRANSPORTS

| Transports    | Web Site                                            |
| ------------- | --------------------------------------------------- |
| HuggingFace   | [`huggingface.co`](https://www.huggingface.co)      |
| Ollama        | [`ollama.com`](https://www.ollama.com)              |
| OCI Container Registries | [`opencontainers.org`](https://opencontainers.org)|
||Examples: [`quay.io`](https://quay.io),  [`Docker Hub`](https://docker.io), and [`Artifactory`](https://artifactory.com)|

RamaLama uses the Ollama registry transport by default. Use the RAMALAMA_TRANSPORTS environment variable to modify the default. `export RAMALAMA_TRANSPORT=huggingface` Changes RamaLama to use huggingface transport.

Individual model transports can be modifies when specifying a model via the `huggingface://`, `oci://`, or `ollama://` prefix.

`ramalama pull huggingface://afrideva/Tiny-Vicuna-1B-GGUF/tiny-vicuna-1b.q2_k.gguf`

To make it easier for users, RamaLama uses shortname files, which container
alias names for fully specified AI Models allowing users to specify the shorter
names when referring to models. RamaLama reads shortnames.conf files if they
exist . These files contain a list of name value pairs for specification of
the model. The following table specifies the order which Ramama reads the files
. Any duplicate names that exist override previously defined shortnames.

| Shortnames type | Path                                            |
| --------------- | ---------------------------------------- |
| Distribution    | /usr/share/ramalama/shortnames.conf      |
| Administrators  | /etc/ramamala/shortnames.conf            |
| Users           | $HOME/.config/ramalama/shortnames.conf   |

```code
$ cat /usr/share/ramalama/shortnames.conf
[shortnames]
  "tiny" = "ollama://tinyllama"
  "granite" = "huggingface://instructlab/granite-7b-lab-GGUF/granite-7b-lab-Q4_K_M.gguf"
  "granite:7b" = "huggingface://instructlab/granite-7b-lab-GGUF/granite-7b-lab-Q4_K_M.gguf"
  "ibm/granite" = "huggingface://instructlab/granite-7b-lab-GGUF/granite-7b-lab-Q4_K_M.gguf"
  "merlinite" = "huggingface://instructlab/merlinite-7b-lab-GGUF/merlinite-7b-lab-Q4_K_M.gguf"
  "merlinite:7b" = "huggingface://instructlab/merlinite-7b-lab-GGUF/merlinite-7b-lab-Q4_K_M.gguf"
...
```

## Install

## Install via PyPi

RamaLama is available via PyPi [https://pypi.org/project/ramalama](https://pypi.org/project/ramalama)

```
pipx install ramalama
```

## Install by script

Install RamaLama by running this one-liner (on macOS run without sudo):

Linux:

```
curl -fsSL https://raw.githubusercontent.com/containers/ramalama/s/install.sh | sudo sh
```

macOS:

```
curl -fsSL https://raw.githubusercontent.com/containers/ramalama/s/install.sh | sh
```

## Hardware Support

| Hardware                           | Enabled |
| ---------------------------------- | ------- |
| CPU                                | :white_check_mark: |
| Apple Silicon GPU (macOS)          | :white_check_mark: |
| Apple Silicon GPU (podman-machine) | :x: |
| Nvidia GPU (cuda)                  | :x: |
| AMD GPU (rocm)                     | :white_check_mark: |

## COMMANDS

| Command                                                | Description                                                |
| ------------------------------------------------------ | ---------------------------------------------------------- |
| [ramalama(1)](docs/ramalama.1.md)                      | primary RamaLama man page                                  |
| [ramalama-containers(1)](docs/ramalama-containers.1.md)| list all RamaLama containers                               |
| [ramalama-list(1)](docs/ramalama-list.1.md)            | list all downloaded AI Models                              |
| [ramalama-login(1)](docs/ramalama-login.1.md)          | login to remote registry                                   |
| [ramalama-logout(1)](docs/ramalama-logout.1.md)        | logout from remote registry                                |
| [ramalama-pull(1)](docs/ramalama-pull.1.md)            | pull AI Model from Model registry to local storage         |
| [ramalama-push(1)](docs/ramalama-push.1.md)            | push AI Model from local storage to remote registry        |
| [ramalama-rm(1)](docs/ramalama-rm.1.md)                | remove AI Model from local storage                         |
| [ramalama-run(1)](docs/ramalama-run.1.md)              | run specified AI Model as a chatbot                        |
| [ramalama-serve(1)](docs/ramalama-serve.1.md)          | serve REST API on specified AI Model                       |
| [ramalama-stop(1)](docs/ramalama-stop.1.md)            | stop named container that is running AI Model              |
| [ramalama-version(1)](docs/ramalama-version.1.md)      | display version of AI Model                                |

## Usage

### Running Models

You can `run` a chatbot on a model using the `run` command. By default, it pulls from the ollama registry.

Note: RamaLama will inspect your machine for native GPU support and then will
use a container engine like Podman to pull an OCI container image with the
appropriate code and libraries to run the AI Model. This can take a long time to setup, but only on the first run.
```
$ ramalama run instructlab/merlinite-7b-lab
Copying blob 5448ec8c0696 [--------------------------------------] 0.0b / 63.6MiB (skipped: 0.0b = 0.00%)
Copying blob cbd7e392a514 [--------------------------------------] 0.0b / 65.3MiB (skipped: 0.0b = 0.00%)
Copying blob 5d6c72bcd967 done  208.5MiB / 208.5MiB (skipped: 0.0b = 0.00%)
Copying blob 9ccfa45da380 [--------------------------------------] 0.0b / 7.6MiB (skipped: 0.0b = 0.00%)
Copying blob 4472627772b1 [--------------------------------------] 0.0b / 120.0b (skipped: 0.0b = 0.00%)
>
```

After the initial container image has been downloaded, you can interact with
different models, using the container image.
```
$ ramalama run granite-code
> Write a hello world application in python

print("Hello World")
```

In a different terminal window see the running podman container.
```
$ podman ps
CONTAINER ID  IMAGE                             COMMAND               CREATED        STATUS        PORTS       NAMES
91df4a39a360  quay.io/ramalama/ramalama:latest  /home/dwalsh/rama...  4 minutes ago  Up 4 minutes              gifted_volhard
```

### Listing Models

You can `list` all models pulled into local storage.

```
$ ramalama list
NAME                                                                MODIFIED     SIZE
ollama://tiny-llm:latest                                            16 hours ago 5.5M
huggingface://afrideva/Tiny-Vicuna-1B-GGUF/tiny-vicuna-1b.q2_k.gguf 14 hours ago 460M
ollama://granite-code:3b                                            5 days ago   1.9G
ollama://granite-code:latest                                        1 day ago    1.9G
ollama://moondream:latest                                           6 days ago   791M
```
### Pulling Models

You can `pull` a model using the `pull` command. By default, it pulls from the ollama registry.

```
$ ramalama pull granite-code
###################################################                       32.5%
```

### Serving Models

You can `serve` multiple models using the `serve` command. By default, it pulls from the ollama registry.

```
$ ramalama serve --name mylama llama3
```

### Stopping servers

You can stop a running model if it is running in a container.

```
$ ramalama stop mylama
```

## Diagram

```
+---------------------------+
|                           |
| ramalama run granite-code |
|                           |
+-------+-------------------+
	|
	|
	|                                          +------------------+
	|                                          | Pull model layer |
	+----------------------------------------->| granite-code     |
						   +------------------+
						   | Repo options:    |
						   +-+-------+------+-+
						     |       |      |
						     v       v      v
					     +---------+ +------+ +----------+
					     | Hugging | | quay | | Ollama   |
					     | Face    | |      | | Registry |
					     +-------+-+ +---+--+ +-+--------+
						     |       |      |
						     v       v      v
						   +------------------+
						   | Start with       |
						   | llama.cpp and    |
						   | granite-code     |
						   | model            |
						   +------------------+
```

## In development

Regard this alpha, everything is under development, so expect breaking changes, luckily it's easy to reset everything and re-install:

```
rm -rf /var/lib/ramalama # only required if running as root user
rm -rf $HOME/.local/share/ramalama
```

and install again.

## Credit where credit is due

This project wouldn't be possible without the help of other projects like:

llama.cpp
whisper.cpp
vllm
podman
omlmd
huggingface

so if you like this tool, give some of these repos a :star:, and hey, give us a :star: too while you are at it.

## Community

[`Matrix`](https://matrix.to/#/#ramalama:fedoraproject.org)

## Contributors

Open to contributors

<a href="https://github.com/containers/ramalama/graphs/contributors">
  <img src="https://contrib.rocks/image?repo=containers/ramalama" />
</a>
fix up logo Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-10-08 09:59:43 -04:00			`![RAMALAMA logo](logos/PNG/ramalama-logo-full-vertical-added-bg.png)`
Add RamaLama logo to readme Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-10-07 11:45:20 -04:00
Fix capitalization of RamaLama Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-10-07 12:29:27 -04:00			`# RamaLama`
Cleanup Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-07-26 12:38:16 +01:00
Rename project to RamaLama There was a mix of Ramalama and RamaLama. ramalama is the executable name. Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-09-26 13:59:38 +01:00			`The RamaLama project's goal is to make working with AI boring`
Add more description of what ramalama is doing Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-08-01 09:18:13 -04:00			`through the use of OCI containers.`

Rename project to RamaLama There was a mix of Ramalama and RamaLama. ramalama is the executable name. Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-09-26 13:59:38 +01:00			`On first run RamaLama inspects your system for GPU support, falling back to CPU`
Add more information to man pages and readme Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-09-20 14:26:50 -04:00			`support if no GPUs are present. It then uses container engines like Podman or`
			`Docker to pull the appropriate OCI image with all of the software necessary to`
			`run an AI Model for your systems setup. This eliminates the need for the user`
Rename project to RamaLama There was a mix of Ramalama and RamaLama. ramalama is the executable name. Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-09-26 13:59:38 +01:00			`to configure the system for AI themselves. After the initialization, RamaLama`
Add more description of what ramalama is doing Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-08-01 09:18:13 -04:00			`will run the AI Models within a container based on the OCI image.`
Cleanup Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-07-26 12:38:16 +01:00
Rename project to RamaLama There was a mix of Ramalama and RamaLama. ramalama is the executable name. Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-09-26 13:59:38 +01:00			`RamaLama supports multiple AI model registries types called transports.`
Add more information to man pages and readme Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-09-20 14:26:50 -04:00			`Supported transports:`


			`## TRANSPORTS`

			`\| Transports \| Web Site \|`
			`\| ------------- \| --------------------------------------------------- \|`
			\| HuggingFace \| [`huggingface.co`](https://www.huggingface.co) \|
			\| Ollama \| [`ollama.com`](https://www.ollama.com) \|
			\| OCI Container Registries \| [`opencontainers.org`](https://opencontainers.org)\|
			\|\|Examples: [`quay.io`](https://quay.io), [`Docker Hub`](https://docker.io), and [`Artifactory`](https://artifactory.com)\|

Switch ramalama to RamaLama when not used as a command Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-09-26 10:15:59 -04:00			RamaLama uses the Ollama registry transport by default. Use the RAMALAMA_TRANSPORTS environment variable to modify the default. `export RAMALAMA_TRANSPORT=huggingface` Changes RamaLama to use huggingface transport.
Add more information to man pages and readme Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-09-20 14:26:50 -04:00
			Individual model transports can be modifies when specifying a model via the `huggingface://`, `oci://`, or `ollama://` prefix.

Update README.md Beautification Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-09-20 21:54:27 +01:00			`ramalama pull huggingface://afrideva/Tiny-Vicuna-1B-GGUF/tiny-vicuna-1b.q2_k.gguf`
Add more information to man pages and readme Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-09-20 14:26:50 -04:00
Switch ramalama to RamaLama when not used as a command Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-09-26 10:15:59 -04:00			`To make it easier for users, RamaLama uses shortname files, which container`
Add more information to man pages and readme Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-09-20 14:26:50 -04:00			`alias names for fully specified AI Models allowing users to specify the shorter`
Switch ramalama to RamaLama when not used as a command Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-09-26 10:15:59 -04:00			`names when referring to models. RamaLama reads shortnames.conf files if they`
Add more information to man pages and readme Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-09-20 14:26:50 -04:00			`exist . These files contain a list of name value pairs for specification of`
			`the model. The following table specifies the order which Ramama reads the files`
			`. Any duplicate names that exist override previously defined shortnames.`

			`\| Shortnames type \| Path \|`
			`\| --------------- \| ---------------------------------------- \|`
			`\| Distribution \| /usr/share/ramalama/shortnames.conf \|`
			`\| Administrators \| /etc/ramamala/shortnames.conf \|`
			`\| Users \| $HOME/.config/ramalama/shortnames.conf \|`

			```code
			`$ cat /usr/share/ramalama/shortnames.conf`
			`[shortnames]`
			`"tiny" = "ollama://tinyllama"`
			`"granite" = "huggingface://instructlab/granite-7b-lab-GGUF/granite-7b-lab-Q4_K_M.gguf"`
			`"granite:7b" = "huggingface://instructlab/granite-7b-lab-GGUF/granite-7b-lab-Q4_K_M.gguf"`
			`"ibm/granite" = "huggingface://instructlab/granite-7b-lab-GGUF/granite-7b-lab-Q4_K_M.gguf"`
			`"merlinite" = "huggingface://instructlab/merlinite-7b-lab-GGUF/merlinite-7b-lab-Q4_K_M.gguf"`
			`"merlinite:7b" = "huggingface://instructlab/merlinite-7b-lab-GGUF/merlinite-7b-lab-Q4_K_M.gguf"`
			`...`
			```

Cleanup Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-07-26 12:38:16 +01:00			`## Install`

Convert to fully use setup.py for PyPi installation Add pyproject.toml, setup.cfg Add new target make pypi. fix up requirements.txt Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> Update Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-09-30 12:48:11 -04:00			`## Install via PyPi`

Fix capitalization of RamaLama Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-10-07 12:29:27 -04:00			`RamaLama is available via PyPi [https://pypi.org/project/ramalama](https://pypi.org/project/ramalama)`
Convert to fully use setup.py for PyPi installation Add pyproject.toml, setup.cfg Add new target make pypi. fix up requirements.txt Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> Update Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-09-30 12:48:11 -04:00
			```
Recommend pipx install We can standardize on this technique for multiple Linux platforms and macOS. Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-10-08 11:03:50 +01:00			`pipx install ramalama`
Convert to fully use setup.py for PyPi installation Add pyproject.toml, setup.cfg Add new target make pypi. fix up requirements.txt Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> Update Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-09-30 12:48:11 -04:00			```

			`## Install by script`

Rename project to RamaLama There was a mix of Ramalama and RamaLama. ramalama is the executable name. Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-09-26 13:59:38 +01:00			`Install RamaLama by running this one-liner (on macOS run without sudo):`
Update README.md 2024-08-22 12:37:14 +01:00
			`Linux:`
Cleanup Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-07-26 12:38:16 +01:00
			```
Replace install.py with install.sh and add rm.sh Creating a hack/rm.sh script Want to try and install via pipx and wrote a script to remove old install. Also converted python install script back to shell, the python experience in a new mac is just not nice in comparison to bash. Signed-off-by: Eric Curtin <ecurtin@redhat.com> Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-10-03 14:40:36 -04:00			`curl -fsSL https://raw.githubusercontent.com/containers/ramalama/s/install.sh \| sudo sh`
Cleanup Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-07-26 12:38:16 +01:00			```

Update README.md 2024-08-22 12:37:14 +01:00			`macOS:`

			```
Replace install.py with install.sh and add rm.sh Creating a hack/rm.sh script Want to try and install via pipx and wrote a script to remove old install. Also converted python install script back to shell, the python experience in a new mac is just not nice in comparison to bash. Signed-off-by: Eric Curtin <ecurtin@redhat.com> Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-10-03 14:40:36 -04:00			`curl -fsSL https://raw.githubusercontent.com/containers/ramalama/s/install.sh \| sh`
Update README.md 2024-08-22 12:37:14 +01:00			```

Explain GPU Support status in README.md At present we just have macOS accelerated Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-08-28 12:39:49 +01:00			`## Hardware Support`

			`\| Hardware \| Enabled \|`
			`\| ---------------------------------- \| ------- \|`
			`\| CPU \| :white_check_mark: \|`
			`\| Apple Silicon GPU (macOS) \| :white_check_mark: \|`
			`\| Apple Silicon GPU (podman-machine) \| :x: \|`
			`\| Nvidia GPU (cuda) \| :x: \|`
AMD GPU (rocm) support is enabled We merged support for this a while back Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-10-15 11:31:44 +01:00			`\| AMD GPU (rocm) \| :white_check_mark: \|`
Update README.md 2024-08-15 13:46:45 -04:00
Add support for ramalama containers Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-09-10 15:51:15 -04:00			`## COMMANDS`

			`\| Command \| Description \|`
			`\| ------------------------------------------------------ \| ---------------------------------------------------------- \|`
Switch ramalama to RamaLama when not used as a command Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-09-26 10:15:59 -04:00			`\| [ramalama(1)](docs/ramalama.1.md) \| primary RamaLama man page \|`
			`\| [ramalama-containers(1)](docs/ramalama-containers.1.md)\| list all RamaLama containers \|`
Consistency changes The default help text in the python parsers starts with a lower-case, the only way to make everything consistent is to make everything start with a lower-case: -h, --help show this help message and exit Also make Model capital everywhere. Some typo fixes like "containersis". Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-09-21 21:27:50 +01:00			`\| [ramalama-list(1)](docs/ramalama-list.1.md) \| list all downloaded AI Models \|`
			`\| [ramalama-login(1)](docs/ramalama-login.1.md) \| login to remote registry \|`
			`\| [ramalama-logout(1)](docs/ramalama-logout.1.md) \| logout from remote registry \|`
			`\| [ramalama-pull(1)](docs/ramalama-pull.1.md) \| pull AI Model from Model registry to local storage \|`
			`\| [ramalama-push(1)](docs/ramalama-push.1.md) \| push AI Model from local storage to remote registry \|`
			`\| [ramalama-rm(1)](docs/ramalama-rm.1.md) \| remove AI Model from local storage \|`
			`\| [ramalama-run(1)](docs/ramalama-run.1.md) \| run specified AI Model as a chatbot \|`
rest api to REST API This got caught in the crossfire of an earlier PR Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-09-23 12:03:28 +01:00			`\| [ramalama-serve(1)](docs/ramalama-serve.1.md) \| serve REST API on specified AI Model \|`
Consistency changes The default help text in the python parsers starts with a lower-case, the only way to make everything consistent is to make everything start with a lower-case: -h, --help show this help message and exit Also make Model capital everywhere. Some typo fixes like "containersis". Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-09-21 21:27:50 +01:00			`\| [ramalama-stop(1)](docs/ramalama-stop.1.md) \| stop named container that is running AI Model \|`
			`\| [ramalama-version(1)](docs/ramalama-version.1.md) \| display version of AI Model \|`
Add support for ramalama containers Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-09-10 15:51:15 -04:00
Cleanup Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-07-26 12:38:16 +01:00			`## Usage`

Add man pages for ramalama commands Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-07-31 09:35:00 -04:00			`### Running Models`

			You can `run` a chatbot on a model using the `run` command. By default, it pulls from the ollama registry.

Rename project to RamaLama There was a mix of Ramalama and RamaLama. ramalama is the executable name. Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-09-26 13:59:38 +01:00			`Note: RamaLama will inspect your machine for native GPU support and then will`
Add more description of what ramalama is doing Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-08-01 09:18:13 -04:00			`use a container engine like Podman to pull an OCI container image with the`
			`appropriate code and libraries to run the AI Model. This can take a long time to setup, but only on the first run.`
Add man pages for ramalama commands Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-07-31 09:35:00 -04:00			```
Add example for ramalama list Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-08-01 07:03:55 -04:00			`$ ramalama run instructlab/merlinite-7b-lab`
Add more description of what ramalama is doing Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-08-01 09:18:13 -04:00			`Copying blob 5448ec8c0696 [--------------------------------------] 0.0b / 63.6MiB (skipped: 0.0b = 0.00%)`
			`Copying blob cbd7e392a514 [--------------------------------------] 0.0b / 65.3MiB (skipped: 0.0b = 0.00%)`
			`Copying blob 5d6c72bcd967 done 208.5MiB / 208.5MiB (skipped: 0.0b = 0.00%)`
			`Copying blob 9ccfa45da380 [--------------------------------------] 0.0b / 7.6MiB (skipped: 0.0b = 0.00%)`
			`Copying blob 4472627772b1 [--------------------------------------] 0.0b / 120.0b (skipped: 0.0b = 0.00%)`
			`>`
			```

			`After the initial container image has been downloaded, you can interact with`
Remove the typo There's an extra x here. Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-08-02 10:41:34 +01:00			`different models, using the container image.`
Add more description of what ramalama is doing Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-08-01 09:18:13 -04:00			```
Small typo in README.md There should be a space between $ and ramalama. Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-08-01 16:12:00 +01:00			`$ ramalama run granite-code`
Add more description of what ramalama is doing Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-08-01 09:18:13 -04:00			`> Write a hello world application in python`

			`print("Hello World")`
			```

			`In a different terminal window see the running podman container.`
			```
			`$ podman ps`
			`CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES`
			`91df4a39a360 quay.io/ramalama/ramalama:latest /home/dwalsh/rama... 4 minutes ago Up 4 minutes gifted_volhard`
Add man pages for ramalama commands Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-07-31 09:35:00 -04:00			```

Put Running Models at the top of README.md This behaves like a hello world example for ramalama, so it makes sense to have that after the installtion steps. Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-08-24 14:34:41 +01:00			`### Listing Models`

			You can `list` all models pulled into local storage.

			```
			`$ ramalama list`
			`NAME MODIFIED SIZE`
			`ollama://tiny-llm:latest 16 hours ago 5.5M`
			`huggingface://afrideva/Tiny-Vicuna-1B-GGUF/tiny-vicuna-1b.q2_k.gguf 14 hours ago 460M`
			`ollama://granite-code:3b 5 days ago 1.9G`
			`ollama://granite-code:latest 1 day ago 1.9G`
			`ollama://moondream:latest 6 days ago 791M`
			```
			`### Pulling Models`

			You can `pull` a model using the `pull` command. By default, it pulls from the ollama registry.

			```
			`$ ramalama pull granite-code`
			`################################################### 32.5%`
			```

Add man pages for ramalama commands Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-07-31 09:35:00 -04:00			`### Serving Models`

Add more information to man pages and readme Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-09-20 14:26:50 -04:00			You can `serve` multiple models using the `serve` command. By default, it pulls from the ollama registry.

			```
			`$ ramalama serve --name mylama llama3`
			```

			`### Stopping servers`

			`You can stop a running model if it is running in a container.`
Add man pages for ramalama commands Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-07-31 09:35:00 -04:00
			```
Add more information to man pages and readme Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-09-20 14:26:50 -04:00			`$ ramalama stop mylama`
Add man pages for ramalama commands Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-07-31 09:35:00 -04:00			```

Cleanup Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-07-26 12:38:16 +01:00			`## Diagram`

			```
Make granite, granite-code everywhere Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-07-26 12:53:42 +01:00			`+---------------------------+`
			`\| \|`
			`\| ramalama run granite-code \|`
			`\| \|`
			`+-------+-------------------+`
Add more information to man pages and readme Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> 2024-09-20 14:26:50 -04:00			`\|`
			`\|`
			`\| +------------------+`
			`\| \| Pull model layer \|`
			`+----------------------------------------->\| granite-code \|`
			`+------------------+`
			`\| Repo options: \|`
			`+-+-------+------+-+`
			`\| \| \|`
			`v v v`
			`+---------+ +------+ +----------+`
			`\| Hugging \| \| quay \| \| Ollama \|`
			`\| Face \| \| \| \| Registry \|`
			`+-------+-+ +---+--+ +-+--------+`
			`\| \| \|`
			`v v v`
			`+------------------+`
			`\| Start with \|`
			`\| llama.cpp and \|`
			`\| granite-code \|`
			`\| model \|`
			`+------------------+`
Cleanup Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-07-26 12:38:16 +01:00			```

			`## In development`

			`Regard this alpha, everything is under development, so expect breaking changes, luckily it's easy to reset everything and re-install:`

			```
			`rm -rf /var/lib/ramalama # only required if running as root user`
			`rm -rf $HOME/.local/share/ramalama`
			```

			`and install again.`

			`## Credit where credit is due`

Update README.md llama.cpp image is a broken link now. Some people may just end up using this with vllm anyway. Giving kudos to several projects now instead. Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-09-09 00:23:05 +01:00			`This project wouldn't be possible without the help of other projects like:`
Cleanup Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-07-26 12:38:16 +01:00
Update README.md llama.cpp image is a broken link now. Some people may just end up using this with vllm anyway. Giving kudos to several projects now instead. Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-09-09 00:23:05 +01:00			`llama.cpp`
			`whisper.cpp`
			`vllm`
			`podman`
			`omlmd`
			`huggingface`
Cleanup Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-07-26 12:38:16 +01:00
Update README.md llama.cpp image is a broken link now. Some people may just end up using this with vllm anyway. Giving kudos to several projects now instead. Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-09-09 00:23:05 +01:00			`so if you like this tool, give some of these repos a :star:, and hey, give us a :star: too while you are at it.`
Highlight that we are open to contributors in README.md We are a community project. Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-07-31 23:25:34 +01:00
Add link to discord on README.md So people know where to reach out. Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-10-05 15:55:22 +01:00			`## Community`

We now have a fedoraproject.org Matrix For people to reach out Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-10-06 13:13:51 +01:00			[`Matrix`](https://matrix.to/#/#ramalama:fedoraproject.org)
Add link to discord on README.md So people know where to reach out. Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-10-05 15:55:22 +01:00
Highlight that we are open to contributors in README.md We are a community project. Signed-off-by: Eric Curtin <ecurtin@redhat.com> 2024-07-31 23:25:34 +01:00			`## Contributors`

			`Open to contributors`

			`<a href="https://github.com/containers/ramalama/graphs/contributors">`
			`<img src="https://contrib.rocks/image?repo=containers/ramalama" />`
			`</a>`