Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug] Fixed the yaml of deployments in heterogenous GPU settings to make KPA scaling work as expected. #513

Merged
merged 1 commit into from
Dec 10, 2024

Conversation

zhangjyr
Copy link
Collaborator

Pull Request Description

The root cause of #507 is due to the previous spec.selector of mock app deployment configuration identifying model only and missing the label that identifies the deployment. After a fix of mock app deployment, KPA scaling now works as expected

Related Issues

Resolves: #507

Important: Before submitting, please complete the description above and review the checklist below.


Contribution Guidelines (Expand for Details)

We appreciate your contribution to aibrix! To ensure a smooth review process and maintain high code quality, please adhere to the following guidelines:

Pull Request Title Format

Your PR title should start with one of these prefixes to indicate the nature of the change:

  • [Bug]: Corrections to existing functionality
  • [CI]: Changes to build process or CI pipeline
  • [Docs]: Updates or additions to documentation
  • [API]: Modifications to aibrix's API or interface
  • [CLI]: Changes or additions to the Command Line Interface
  • [Misc]: For changes not covered above (use sparingly)

Note: For changes spanning multiple categories, use multiple prefixes in order of importance.

Submission Checklist

  • PR title includes appropriate prefix(es)
  • Changes are clearly explained in the PR description
  • New and existing tests pass successfully
  • Code adheres to project style and best practices
  • Documentation updated to reflect changes (if applicable)
  • Thorough testing completed, no regressions introduced

By submitting this PR, you confirm that you've read these guidelines and your changes align with the project's contribution standards.

@zhangjyr zhangjyr requested review from Jeffwan and nwangfw December 10, 2024 07:57
@Jeffwan
Copy link
Collaborator

Jeffwan commented Dec 10, 2024

using different label to distinguish the app makes sense to me.

@Jeffwan Jeffwan merged commit cc526f7 into main Dec 10, 2024
2 checks passed
@Jeffwan Jeffwan deleted the issues/507_KPA_scaling_results_in_heterogeneous_GPU branch December 10, 2024 18:35
gangmuk pushed a commit that referenced this pull request Jan 25, 2025
…ake KPA scaling work as expected. (#513)

Make spec.selector.matchLabels combination unique for deployments in the model.

Co-authored-by: Jingyuan Zhang <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

KPA scaling results in heterogeneous GPU
2 participants