Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Inference] FP8 gemm auto-tune #9094

Merged
merged 4 commits into from
Sep 11, 2024

Conversation

ckl117
Copy link
Contributor

@ckl117 ckl117 commented Sep 6, 2024

PR types

New features

PR changes

Others

Description

FP8 gemm自动调优

Copy link

paddle-bot bot commented Sep 6, 2024

Thanks for your contribution!

@CLAassistant
Copy link

CLAassistant commented Sep 6, 2024

CLA assistant check
All committers have signed the CLA.

@ckl117 ckl117 changed the title Fp8 cutlass tune [Inference] FP8 gemm auto-tune Sep 6, 2024
@@ -107,6 +117,12 @@ def get_gencode_flags():
os.makedirs(cutlass_dir)
clone_git_repo("v3.5.0", "https://github.com/NVIDIA/cutlass.git", cutlass_dir)

json_dir = "third_party/nlohmann_json"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

把cutlass也挪到third_party里去

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

Copy link

codecov bot commented Sep 9, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 53.34%. Comparing base (2f31866) to head (0453e8e).
Report is 227 commits behind head on develop.

Additional details and impacted files
@@           Coverage Diff            @@
##           develop    #9094   +/-   ##
========================================
  Coverage    53.34%   53.34%           
========================================
  Files          652      652           
  Lines       105401   105401           
========================================
  Hits         56222    56222           
  Misses       49179    49179           

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@ckl117 ckl117 force-pushed the fp8_cutlass_tune branch 2 times, most recently from 00d2585 to 81971ff Compare September 9, 2024 08:26
yuanlehome
yuanlehome previously approved these changes Sep 9, 2024
@qingqing01 qingqing01 merged commit 3675ea2 into PaddlePaddle:develop Sep 11, 2024
6 of 7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants