-
Updated
Nov 11, 2021 - C++
sycl
Here are 68 public repositories matching this topic...
Bug summary
There is evidence that sub_group::get_group_id() does not return the same value as threadIdx.x / warpSize (assuming 1D kernel), as expected on CUDA. We should check the implementation of this function. Our implementation of this function performs bit manipulation magic, presumably the optimization went to far...
To Reproduce
Compare sub_group{}.get_group_id() or `sub
The problem is that the OpenCL types in https://github.com/triSYCL/triSYCL/blob/master/include/triSYCL/opencl_types.hpp are defined on the host according to the x86-64 Linux ABI which depends on the CPU & OS instead of using the description from https://www.khronos.org/registry/OpenCL/specs/2.2/html/OpenCL_C.html#built-in-scalar-data-types
Note that the system-wide cl_size_t has been removed
-
Updated
Nov 5, 2021 - C
The cl_ types are not needed in general, however they should still be kept in the OpenCL C interop sample code.
-
Updated
Nov 3, 2021 - HTML
-
Updated
Oct 8, 2021 - C++
-
Updated
Nov 11, 2021 - Python
-
Updated
Oct 21, 2021 - C++
-
Updated
Oct 28, 2021 - C++
-
Updated
Nov 8, 2021 - C++
-
Updated
Dec 17, 2020 - C++
-
Updated
Mar 31, 2020 - C++
Things like XILINX_XRT which are set by the user and others like XILINX_DEVICE_ONLY that are set by the compiler but affect the code users compile may be useful to add.
DEBUG variable should be removed from numba_dppy.compiler.
It will require replace callers code.
It is for separate PR.
Originally posted by @PokhodenkoSA in IntelPython/numba-dppy#354 (comment)
-
Updated
Feb 19, 2019 - C++
Some of the gt-blas calls require temporary space, particularly the SYCL backend. Having a managed storage helper class would be useful for implementing this, in a nice RIAA way.
-
Updated
Jul 5, 2021 - C++
Improve this page
Add a description, image, and links to the sycl topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the sycl topic, visit your repo's landing page and select "manage topics."
This patch adds template argument to the
sycl::buffer_allocatorclass (according to SYCL 2020 spec,sycl::buffer_allocatoris templated on the data type.) The newsycl::buffer_allocatoris defined as a class to let C++17's class template argument deduction feature work. So, the old code that uses `sycl::buffer_al