forked from CEED/libCEED
-
Notifications
You must be signed in to change notification settings - Fork 0
Convert basic parallel_for kernel launches to nd_range #46
Copy link
Copy link
Open
Labels
Milestone
Description
Details
The work group size chosen by the SYCL runtime for the basic parallel_for kernel launches is suboptimal in some cases. These should be converted to the nd_range version and an appropriate default work group size should be found in each case.
Note: basic parallel_for usage is limited to the sycl-ref backend.
Todo
- Get baseline performance data for relevant kernels
- Evaluate performance using
nd_rangekernel launch
Vector
- Norm (x3)
- Reciprocal
- Scale
- AXPY
- PointwiseMult
ElementRestriction
Basis
QFunction
Operator
Reactions are currently unavailable