Skip to content
GitLab
Explore
Projects
Groups
Snippets
Projects
Groups
Snippets
/
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
Gaoqi Zheng
llama-cpp-python
Commits
c14201dc
Commit
c14201dc
authored
2 years ago
by
Niek van der Maas
Browse files
Options
Download
Email Patches
Plain Diff
Add Dockerfile + build workflow
parent
9f1e5655
main
No related merge requests found
Changes
3
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
.github/workflows/publish.yaml
+30
-1
.github/workflows/publish.yaml
Dockerfile
+10
-0
Dockerfile
README.md
+8
-0
README.md
with
48 additions
and
1 deletion
+48
-1
.github/workflows/publish.yaml
+
30
-
1
View file @
c14201dc
...
...
@@ -28,4 +28,33 @@ jobs:
# if: startsWith(github.ref, 'refs/tags')
uses
:
pypa/gh-action-pypi-publish@release/v1
with
:
password
:
${{ secrets.PYPI_API_TOKEN }}
\ No newline at end of file
password
:
${{ secrets.PYPI_API_TOKEN }}
docker
:
name
:
Build and push Docker image
runs-on
:
ubuntu-latest
needs
:
build-n-publish
steps
:
-
name
:
Checkout
uses
:
actions/checkout@v3
-
name
:
Set up QEMU
uses
:
docker/setup-qemu-action@v2
-
name
:
Set up Docker Buildx
uses
:
docker/setup-buildx-action@v2
-
name
:
Login to GitHub Container Registry
uses
:
docker/login-action@v2
with
:
registry
:
ghcr.io
username
:
${{ github.repository_owner }}
password
:
${{ secrets.GITHUB_TOKEN }}
-
name
:
Build and push
uses
:
docker/build-push-action@v4
with
:
push
:
true
# push to registry
pull
:
true
# always fetch the latest base images
platforms
:
linux/amd64,linux/arm64
# build for both amd64 and arm64
tags
:
ghcr.io/abetlen/llama-cpp-python:latest
\ No newline at end of file
This diff is collapsed.
Click to expand it.
Dockerfile
0 → 100644
+
10
-
0
View file @
c14201dc
FROM
python:3-buster
# We need to set the host to 0.0.0.0 to allow outside access
ENV
HOST 0.0.0.0
# Install the package
RUN
pip
install
llama-cpp-python[server]
# Run the server
CMD
python3 -m llama_cpp.server
\ No newline at end of file
This diff is collapsed.
Click to expand it.
README.md
+
8
-
0
View file @
c14201dc
...
...
@@ -66,6 +66,14 @@ python3 -m llama_cpp.server
Navigate to
[
http://localhost:8000/docs
](
http://localhost:8000/docs
)
to see the OpenAPI documentation.
## Docker image
A Docker image is available on
[
GHCR
](
https://ghcr.io/abetlen/llama-cpp-python
)
. To run the server:
```
bash
docker run
--rm
-it
-p8000
:8000
-v
/path/to/models:/models
-eMODEL
=
/models/ggml-vicuna-7b-4bit.bin ghcr.io/abetlen/llama-cpp-python:latest
```
## Low-level API
The low-level API is a direct
`ctypes`
binding to the C API provided by
`llama.cpp`
.
...
...
This diff is collapsed.
Click to expand it.
Write
Preview
Supports
Markdown
0%
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment
Menu
Explore
Projects
Groups
Snippets