


#OPENCL BENCHMARK FOR 2016 AMD GPUS MAC OS X#
nVidia GeForce GT 650M GPU in a host with Mac OS X 10.9.2 installed.nVidia GeForce GTX 980 GPU in a host with Ubuntu 14.04 64-bit Linux installed.Intel Xeon Kinghts Landing Phi (KNL) 7210 in a host with CentOS 7.2 64-bit Linux installed.Intel Xeon Knights Landing Phi (KNL) AVX-512 version Currently, only 64-bit double precision SpMV is supported.Intel Xeon E5-2667 v3 dual-socket CPUs with Redhat 6.5 64-bit Linux installed.Intel Core i7-4770R CPU with Ubuntu 14.04 64-bit Linux installed.For example, use source /opt/intel/composer_xe_2015.1.133/bin/compilervars.sh intel64, Set environments for the Intel C/C++ Compilers.(Jul 2016, avx2): Fixed a bug in processing small matrices.
#OPENCL BENCHMARK FOR 2016 AMD GPUS DRIVERS#
(Jul 2016, avx2): Improved performance of y-vector update. Drivers We will concentrate here on AMD GPUs as they offer the best hashing performance for the money when it comes to hashing power, the cost of the GPU. (Jul 2016, phi): fixed the same two issues in the original AVX2 version. The Radeon GPU Profiler is a performance tool that can be used by developers to optimize DirectX12, Vulkan and OpenCL applications for AMD RDNA and GCN hardware. (Jan 2017, avx512 and opencl): added two versions: AVX-512 for Knights Landing Phi (KNL) and OpenCL for nVidia GPUs. In Proceedings of the 29th ACM international conference on Supercomputing (ICS '15), pp.339-350, 2015.Ĭontact: Weifeng Liu and Brian Vinter (vinter at nbi.ku.dk). Weifeng Liu and Brian Vinter, "CSR5: An Efficient Storage Format for Cross-Platform Sparse Matrix-Vector Multiplication".
