A poc for serving ollama on colab without a need to have ollama to make rest api call

Problem Statement:

The most versatile attribute of Ollama is the restapi endpoints.

curl http://localhost:11434/api/chat -d '{
  "model": "llama2",
  "messages": [
    {
      "role": "user",
      "content": "why is the sky blue?"
    }
  ]
}'

However these requires a running instance of ollama with pulled models and that is difficult in colab.

An approach would be to pull the model using api call but that will timeout

curl http://localhost:11434/api/pull -d '{
  "name": "llama2"
}'

This repo create adds a background task which will start ollama and pull the intended model.

This repo improves on a repo by marco which uses colab as gpu host for ollama. The endpoint used by marco required the local machine to also have ollama. I.e. ollama on colab, ollama in local machine. This implementation only requires one colab notebook to have ollama and it simplifies integration to workflows.

special thanks: Ollama on colab by marco

setup:

replace add_your_auth_token_here in .env file with your ngrok token

ensure pulled model is the same model used in api call

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.env		.env
README.md		README.md
http_request_for_ollama_server.ipynb		http_request_for_ollama_server.ipynb
run_ollama_on_colab_gpu.ipynb		run_ollama_on_colab_gpu.ipynb
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A poc for serving ollama on colab without a need to have ollama to make rest api call

Problem Statement:

setup:

About

Releases

Packages

Languages

PaulSZH95/serve_ollama_as_restapi

Folders and files

Latest commit

History

Repository files navigation

A poc for serving ollama on colab without a need to have ollama to make rest api call

Problem Statement:

setup:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages