add image support to NVIDIA inference provider #907

mattf · 2025-01-30T19:24:46Z

What does this PR do?

add support to the NVIDIA Inference provider for image inputs

Test Plan

Run local Llama 3.2 11b vision instruct NIM
Start a stack, e.g. llama stack run llama_stack/templates/nvidia/run.yaml --env NVIDIA_BASE_URL=http://localhost:8000
Run image tests, e.g. LLAMA_STACK_BASE_URL=http://localhost:8321 pytest -v tests/client-sdk/inference/test_inference.py --vision-inference-model meta-llama/Llama-3.2-11B-Vision-Instruct -k image

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Ran pre-commit to handle lint / formatting issues.
Read the contributor guideline,
Pull Request section?
Updated relevant documentation.
Wrote necessary unit or integration tests.

mattf · 2025-01-31T16:34:20Z

@ashwinb @yanxi0830 ptal

llama_stack/providers/remote/inference/nvidia/openai_utils.py

ashwinb

Cool looks good!

# What does this PR do? add support to the NVIDIA Inference provider for image inputs ## Test Plan 1. Run local [Llama 3.2 11b vision instruct](https://build.nvidia.com/meta/llama-3.2-11b-vision-instruct?snippet_tab=Docker) NIM 2. Start a stack, e.g. `llama stack run llama_stack/templates/nvidia/run.yaml --env NVIDIA_BASE_URL=http://localhost:8000` 3. Run image tests, e.g. `LLAMA_STACK_BASE_URL=http://localhost:8321 pytest -v tests/client-sdk/inference/test_inference.py --vision-inference-model meta-llama/Llama-3.2-11B-Vision-Instruct -k image` ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [x] Ran pre-commit to handle lint / formatting issues. - [x] Read the [contributor guideline](https://github.com/meta-llama/llama-stack/blob/main/CONTRIBUTING.md), Pull Request section? - [ ] Updated relevant documentation. - [x] Wrote necessary unit or integration tests.

add image support to NVIDIA inference provider

bcd14cc

mattf requested review from ashwinb, yanxi0830, hardikjshah, dltn, raghotham, dineshyv, vladimirivic and sixianyi0721 as code owners January 30, 2025 19:24

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jan 30, 2025

mattf added 4 commits January 31, 2025 08:59

Merge branch 'main' into add-image-support

a8027d2

update to new data type

0d38ba7

detect image.data mime type with pillow

cef35bb

use convert_image_content_to_url

bb21436

ashwinb reviewed Jan 31, 2025

View reviewed changes

llama_stack/providers/remote/inference/nvidia/openai_utils.py Show resolved Hide resolved

ashwinb approved these changes Jan 31, 2025

View reviewed changes

mattf requested a review from ashwinb January 31, 2025 22:58

ashwinb merged commit e21c8b6 into meta-llama:main Feb 1, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add image support to NVIDIA inference provider #907

add image support to NVIDIA inference provider #907

mattf commented Jan 30, 2025 •

edited

Loading

mattf commented Jan 31, 2025

ashwinb left a comment

add image support to NVIDIA inference provider #907

add image support to NVIDIA inference provider #907

Conversation

mattf commented Jan 30, 2025 • edited Loading

What does this PR do?

Test Plan

Before submitting

mattf commented Jan 31, 2025

ashwinb left a comment

Choose a reason for hiding this comment

mattf commented Jan 30, 2025 •

edited

Loading