LUMI-127 adding inference API #328

aittalam · 2024-10-30T21:17:11Z

What's changing

Added support for inference in our API:

new schema for input parameters and update job route
created ad-hoc inference configuration template and updated job service code to populate it
Updated inference job to support the new configuration:
defined an InferenceJobConfig pydantic class to validate input dictionary
updated inference code to make use of it
added unit tests to validate minimal / full configurations

How to test it

unit tests on backend and SDK with make test
unit tests on inference job: cd lumigator/python/mzai/jobs/inference; uv run --with pytest --with-requirements requirements.txt --isolated pytest
manual test: test_dataset_upload && test_inference_mistral

test_dataset_upload:

#!/bin/bash
if [ "$#" -gt 0 ]; then
    DATA_CSV_PATH="$1"
else
    DATA_CSV_PATH="$HOME/Downloads/thunderbird_gt_bart.csv"
fi

if [[ -z "${BACKEND_URL}" ]]; then
  BACKEND_URL=http://localhost:8000
fi

echo Connecting to $BACKEND_URL...

curl -s $BACKEND_URL/api/v1/datasets/ \
  -H 'Accept: application/json' \
  -H 'Content-Type: multipart/form-data' \
  -F 'dataset=@'"$DATA_CSV_PATH"';type=text/csv' \
  -F 'format=job' | jq

test_inference_mistral:

#!/bin/bash

if [[ -z "${BACKEND_URL}" ]]; then
  BACKEND_URL=http://localhost:8000
fi

DATASET_ID=$(curl -s $BACKEND_URL/api/v1/datasets/ | jq -r '.items |sort_by(.created_at) | reverse | .[0].id')

EVAL_NAME="test_run_inference"
EVAL_DESC="Test run for inference with Mistral"
EVAL_MODEL="mistral://open-mistral-7b"
EVAL_DATASET=$DATASET_ID
EVAL_MAX_SAMPLES="0"

JSON_STRING=$(jq -n \
                --arg name "$EVAL_NAME" \
                --arg desc "$EVAL_DESC" \
                --arg model "$EVAL_MODEL" \
                --arg dataset_id "$EVAL_DATASET" \
                --arg max_samples "$EVAL_MAX_SAMPLES" \
                '{name: $name, description: $desc, model: $model, dataset: $dataset_id, max_samples: $max_samples}' )

echo Connecting to $BACKEND_URL...

curl -s $BACKEND_URL/api/v1/jobs/inference/ \
  -H 'Accept: application/json' \
  -H 'Content-Type: application/json' \
  -d "$JSON_STRING" | jq

Additional notes for reviewers

I already...

added some tests for any new functionality
updated the documentation: NO (the current job did not exist before and I wanted to complete the sequence of PRs before having full documentation for this, please LMK if that's ok otherwise I am happy to add some)
checked if a (backend) DB migration step was required and included it if required: no DB migration step was required (this PR does not change the tables schema)

.vscode/settings.json

lumigator/python/mzai/backend/backend/services/jobs.py

lumigator/python/mzai/jobs/inference/inference_config.py

aittalam · 2024-10-30T21:37:59Z

@javiermtorres I added you as a reviewer because I saw the SDK integration test failed, but I had no issues testing locally with make test. lmk how I can help fix that (or do schemas just need to be rebuilt?)

javiermtorres · 2024-10-31T08:36:38Z

@aittalam I will include the SDK integration tests in the test target (maybe they're not already included).

The issue is the split of JobCreate, so it is not found by the integration tests. Let me provide a branch that can be merged into yours to solve this issue.

lumigator/python/mzai/schemas/schemas/jobs.py

veekaybee · 2024-10-31T18:47:39Z

For reference, complementary job in progress: #333

lumigator/python/mzai/backend/backend/api/routes/jobs.py

lumigator/python/mzai/jobs/inference/inference.py

lumigator/python/mzai/jobs/inference/tests/conftest.py

ividal

Thanks, @aittalam !

I'd be happy to ship this and start using it if we can remove part of the dupe code create_inference and create_eval have 😄 Just cleaning up one corner at a time.

.vscode/settings.json

lumigator/python/mzai/backend/backend/services/jobs.py

lumigator/python/mzai/jobs/inference/inference_config.py

Includes SDK code, and mocked and integration tests. Co-authored-by: Javier Torres <[email protected]>

* Support Alembic in tests --------- Signed-off-by: Peter Wilson <[email protected]>

* Changed local development favoring docker-compose watch&sync instead of volume mounting to avoid .venv conflicts --------- Signed-off-by: Alejandro Gonzalez <[email protected]> Co-authored-by: Alejandro Gonzalez <[email protected]>

* Update README.md Signed-off-by: Nate Brake <[email protected]> * Update README.md Ooh good idea, thanks! Co-authored-by: Vicki Boykis <[email protected]> Signed-off-by: Nate Brake <[email protected]> --------- Signed-off-by: Nate Brake <[email protected]> Co-authored-by: Vicki Boykis <[email protected]>

Signed-off-by: Davide Eynard <[email protected]>

aittalam · 2024-11-14T14:34:49Z

Ok all, here are my updates:

addressed reviews
rebased / merged code with current main
re-tested with pytest
re-tested with local tests specifically on the inference job
re-tested with API calls by manually submitting new inference jobs (which btw are not captured by integration tests yet, I'll add an issue for the next PR)

LMK if anything else should be done or if this is good to go

lumigator/python/mzai/backend/backend/config_templates.py

lumigator/python/mzai/backend/backend/services/datasets.py

lumigator/python/mzai/sdk/tests/test_jobs.py

veekaybee · 2024-11-14T17:07:32Z

LGTM after those few changes, thanks for all your patience on this! 🚀

lumigator/python/mzai/backend/backend/services/jobs.py

lumigator/python/mzai/jobs/inference/tests/test_configs.py

Co-authored-by: Peter Wilson <[email protected]> Signed-off-by: Davide Eynard <[email protected]>

aittalam added 2 commits October 29, 2024 19:21

First pass at inference API

7348558

Added tests

edc774e

aittalam requested review from veekaybee, ividal and dpoulopoulos October 30, 2024 21:17

aittalam commented Oct 30, 2024

View reviewed changes

.vscode/settings.json Show resolved Hide resolved

aittalam commented Oct 30, 2024

View reviewed changes

lumigator/python/mzai/backend/backend/services/jobs.py Outdated Show resolved Hide resolved

aittalam commented Oct 30, 2024

View reviewed changes

lumigator/python/mzai/backend/backend/services/jobs.py Show resolved Hide resolved

aittalam commented Oct 30, 2024

View reviewed changes

lumigator/python/mzai/jobs/inference/inference_config.py Show resolved Hide resolved

aittalam requested a review from javiermtorres October 30, 2024 21:36

javiermtorres reviewed Oct 31, 2024

View reviewed changes

lumigator/python/mzai/schemas/schemas/jobs.py Outdated Show resolved Hide resolved