Skip to content

Commit 69e9dd6

Browse files
committed
feat: Modify dockerfile to get INFERENCE_SERVER from env variables
vidy-br: modify-dockerfile
1 parent 80bac3c commit 69e9dd6

File tree

2 files changed

+21
-7
lines changed

2 files changed

+21
-7
lines changed

README.md

Lines changed: 13 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -44,13 +44,24 @@ python scripts/example_batch_request.py
4444
docker build -t llm-inference-server .
4545
```
4646

47+
### Run the S3 inference server
48+
4749
```bash
4850
docker run -d \
49-
-e HUGGING_FACE_TOKEN=${HUGGING_FACE_TOKEN} \
5051
-e S3_ACCESS_KEY_ID=${S3_ACCESS_KEY_ID} \
5152
-e S3_SECRET_ACCESS_KEY=${S3_SECRET_ACCESS_KEY} \
5253
-e S3_ENDPOINT_URL=${S3_ENDPOINT_URL} \
53-
-v my_docker_volume:/app/models \
54+
-e INFERENCE_SERVER=llm_inference.s3_inference_server \
55+
-p 8000:8000 \
56+
llm-inference-server
57+
```
58+
59+
### Run the HuggingFace inference server
60+
61+
```bash
62+
docker run -d \
63+
-e HUGGING_FACE_TOKEN=${HUGGING_FACE_TOKEN} \
64+
-e INFERENCE_SERVER=llm_inference.hf_inference_server \
5465
-p 8000:8000 \
5566
llm-inference-server
5667
```

dockerfile

Lines changed: 8 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ ENV PYTHONUNBUFFERED=1 \
66
POETRY_HOME="/opt/poetry" \
77
POETRY_VENV="/opt/poetry-venv" \
88
POETRY_CACHE_DIR="/opt/.cache" \
9-
PYTHONPATH="/app:$PYTHONPATH"
9+
PYTHONPATH="/app"
1010

1111
# Install system dependencies
1212
RUN apt-get update && apt-get install -y \
@@ -34,11 +34,14 @@ RUN poetry config virtualenvs.create false \
3434
COPY llm_inference /app/llm_inference
3535
COPY scripts /app/scripts
3636

37-
# Create a volume for the models
38-
VOLUME /app/models
39-
4037
# Expose port 8000 to the host
4138
EXPOSE 8000
4239

40+
# Define a build-time argument with a default value
41+
ARG INFERENCE_SERVER=llm_inference.s3_inference_server
42+
43+
# Set an environment variable using the argument
44+
ENV INFERENCE_SERVER=${INFERENCE_SERVER}
45+
4346
# Run the application
44-
CMD ["poetry", "run", "python", "-m", "llm_inference.s3_inference_server"]
47+
CMD ["sh", "-c", "poetry run python -m ${INFERENCE_SERVER}"]

0 commit comments

Comments
 (0)