oshepherd

The Oshepherd guiding the Ollama(s) inference orchestration.

A centralized FastAPI service, using Celery and Redis to orchestrate multiple Ollama servers as workers.

Install

pip install oshepherd

Usage

Setup Redis:

Celery uses Redis as message broker and backend. You'll need a Redis instance, which you can provision for free in redislabs.com.

Setup FastAPI Server:

# define configuration env file
# use credentials for redis as broker and backend
cp .api.env.template .api.env

# start api
oshepherd start-api --env-file .api.env

Setup Celery/Ollama Worker(s):

# install ollama https://ollama.com/download
# optionally pull the model
ollama pull mistral

# define configuration env file
# use credentials for redis as broker and backend
cp .worker.env.template .worker.env

# start worker
oshepherd start-worker --env-file .worker.env

Now you're ready to execute Ollama completions remotely. You can point your Ollama client to your oshepherd api server by setting the host, and it will return your requested completions from any of the workers:

ollama-python client:

import ollama

client = ollama.Client(host="http://127.0.0.1:5001")
ollama_response = client.generate({"model": "mistral", "prompt": "Why is the sky blue?"})

ollama-js client:

import { Ollama } from "ollama/browser";

const ollama = new Ollama({ host: "http://127.0.0.1:5001" });
const ollamaResponse = await ollama.generate({
    model: "mistral",
    prompt: "Why is the sky blue?",
});

Raw http request:

curl -X POST -H "Content-Type: application/json" -L http://127.0.0.1:5001/api/generate/ -d '{
    "model": "mistral",
    "prompt":"Why is the sky blue?"
}'

Disclaimers 🚨

This package is in alpha, its architecture and api might change in the near future. Currently this is getting tested in a controlled environment by real users, but haven't been audited, nor tested thorugly. Use it at your own risk.

As this is an alpha version, support and responses might be limited. We'll do our best to address questions and issues as quickly as possible.

API server parity

Generate a completion: POST /api/generate
Generate a chat completion: POST /api/chat
Generate Embeddings: POST /api/embeddings
List Local Models: GET /api/tags (pending)
Show Model Information: POST /api/show (pending)
List Running Models: GET /api/ps (pending)

Oshepherd API server has been designed to maintain compatibility with the endpoints defined by Ollama, ensuring that any official client (i.e.: ollama-python, ollama-js) can use this server as host and receive expected responses. For more details on the full API specifications, refer to the official Ollama API documentation.

Contribution guidelines

We welcome contributions! If you find a bug or have suggestions for improvements, please open an issue or submit a pull request pointing to development branch. Before creating a new issue/pull request, take a moment to search through the existing issues/pull requests to avoid duplicates.

Conda Support

To run and build locally you can use conda:

conda create -n oshepherd python=3.8
conda activate oshepherd
pip install -r requirements.txt

# install oshepherd
pip install -e .

Tests

Follow usage instructions to start api server and celery worker using a local ollama, and then run the tests:

pytest -s tests/

Author

This is a project developed and maintained by mnemonica.ai.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 78 Commits
.github/workflows		.github/workflows
oshepherd		oshepherd
tests		tests
.api.env.template		.api.env.template
.gitignore		.gitignore
.worker.env.template		.worker.env.template
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

oshepherd

Install

Usage

Disclaimers 🚨

API server parity

Contribution guidelines

Conda Support

Tests

Author

License

About

Releases 7

Packages

Contributors 2

Languages

License

mnemonica-ai/oshepherd

Folders and files

Latest commit

History

Repository files navigation

oshepherd

Install

Usage

Disclaimers 🚨

API server parity

Contribution guidelines

Conda Support

Tests

Author

License

About

Resources

License

Stars

Watchers

Forks

Releases 7

Packages 0

Contributors 2

Languages

Packages