binary_algebra_v0.2.0-gpu
Pre-release
Pre-release
First fully working and well optimized GPU version.
I will still add more functionality (such as slices to then support MPI), and I will implement a system to switch between GPU, CPU and non-Openmp CPU depending on the workload.