Fix the memory overflow bug of the tune_cublaslt_gemm operator #9076

Hanyonggong · 2024-09-04T01:51:56Z

PR types

Bug fixes

PR changes

Others

Description

使用cudaMalloc/cudaFree代替paddle::Tensor，用以解决传入M过大会out of memory问题。由于将把显存交给paddle去管理，而paddle的显存管理策略会导致长时间运行之后（反反复复申请又释放），最终导致oom。

paddle-bot · 2024-09-04T01:52:11Z

Thanks for your contribution!

codecov · 2024-09-04T04:49:00Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 53.50%. Comparing base (9939f84) to head (4245557).
Report is 219 commits behind head on develop.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9076      +/-   ##
===========================================
+ Coverage    53.44%   53.50%   +0.05%     
===========================================
  Files          652      652              
  Lines       105187   105187              
===========================================
+ Hits         56214    56277      +63     
+ Misses       48973    48910      -63

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

DesmonDay

LGTM

…ePaddle#9076) * fix bug * add cudacheck

fix bug

d10209b

Hanyonggong changed the title ~~修复tune_cublaslt_gemm算子M过大会内存溢出问题~~ Fix the memory overflow issue of the tune_cublaslt_gemm operator Sep 4, 2024

Hanyonggong changed the title ~~Fix the memory overflow issue of the tune_cublaslt_gemm operator~~ Fix the memory overflow bug of the tune_cublaslt_gemm operator Sep 4, 2024

add cudacheck

4245557

yuanlehome approved these changes Sep 4, 2024

View reviewed changes

DesmonDay approved these changes Sep 4, 2024

View reviewed changes

DesmonDay merged commit fbbc0a2 into PaddlePaddle:develop Sep 4, 2024
11 of 12 checks passed

ckl117 pushed a commit to ckl117/PaddleNLP that referenced this pull request Sep 9, 2024

Fix the memory overflow bug of the tune_cublaslt_gemm operator (Paddl…

7640f29

…ePaddle#9076) * fix bug * add cudacheck

Mangodadada pushed a commit to Mangodadada/PaddleNLP that referenced this pull request Sep 10, 2024

Fix the memory overflow bug of the tune_cublaslt_gemm operator (Paddl…

6969b80

…ePaddle#9076) * fix bug * add cudacheck

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the memory overflow bug of the tune_cublaslt_gemm operator #9076

Fix the memory overflow bug of the tune_cublaslt_gemm operator #9076

Hanyonggong commented Sep 4, 2024 •

edited by DesmonDay

Loading

paddle-bot bot commented Sep 4, 2024

codecov bot commented Sep 4, 2024 •

edited

Loading

DesmonDay left a comment

Fix the memory overflow bug of the tune_cublaslt_gemm operator #9076

Fix the memory overflow bug of the tune_cublaslt_gemm operator #9076

Conversation

Hanyonggong commented Sep 4, 2024 • edited by DesmonDay Loading

PR types

PR changes

Description

paddle-bot bot commented Sep 4, 2024

codecov bot commented Sep 4, 2024 • edited Loading

Codecov Report

DesmonDay left a comment

Choose a reason for hiding this comment

Hanyonggong commented Sep 4, 2024 •

edited by DesmonDay

Loading

codecov bot commented Sep 4, 2024 •

edited

Loading