-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TFactor: use optimised kernel #1256
Conversation
152f8e7
to
0d2bd4f
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good.
Request change is used only to avoid merge before the base is updated
175fc6d
to
aaba87f
Compare
0d2def8
to
c306078
Compare
efed0f0
to
8b66843
Compare
most trivial solution, at least as a starting point
8b66843
to
1017eab
Compare
cscs-ci run |
For what concerns the |
Codecov ReportAll modified and coverable lines are covered by tests ✅
❗ Your organization needs to install the Codecov GitHub app to enable full functionality. Additional details and impacted files@@ Coverage Diff @@
## master #1256 +/- ##
=======================================
Coverage 95.07% 95.07%
=======================================
Files 141 141
Lines 8655 8655
Branches 1110 1110
=======================================
Hits 8229 8229
Misses 239 239
Partials 187 187 ☔ View full report in Codecov by Sentry. |
In a joint effort to optimise TFactor, with #1219 a faster kernel for LARFT has been introduced, and here it is applied on top of #1214 which parallelises it.
TODO:
think about best solution about how to either copy taus or provide them toEDIT: tracked by TFactor: Store taus on GPU for reduction to band and its back transformation #1282_fixtau