[clfft] 92/128: Double Precision pragma enable in Square Transpose generator

Ghislain Vaillant ghisvail-guest at moszumanska.debian.org
Thu Oct 22 14:54:43 UTC 2015


This is an automated email from the git hooks/post-receive script.

ghisvail-guest pushed a commit to branch master
in repository clfft.

commit da3965cc928bc8b962b0294d8f60b5062b441f83
Author: Pradeep <pradeep.rao at amd.com>
Date:   Thu Oct 1 12:05:51 2015 +0530

    Double Precision pragma enable in Square Transpose generator
---
 src/library/generator.transpose.square.cpp | 8 ++++++++
 1 file changed, 8 insertions(+)

diff --git a/src/library/generator.transpose.square.cpp b/src/library/generator.transpose.square.cpp
index 357dfad..230c32d 100644
--- a/src/library/generator.transpose.square.cpp
+++ b/src/library/generator.transpose.square.cpp
@@ -278,6 +278,14 @@ static clfftStatus genTransposeKernel( const FFTGeneratedTransposeSquareAction::
     case CLFFT_DOUBLE_FAST:
         dtPlanar = "double";
         dtComplex = "double2";
+
+		// Emit code that enables double precision in the kernel
+        clKernWrite( transKernel, 0 ) << "#ifdef cl_khr_fp64" << std::endl;
+        clKernWrite( transKernel, 3 ) << "#pragma OPENCL EXTENSION cl_khr_fp64 : enable" << std::endl;
+        clKernWrite( transKernel, 0 ) << "#else" << std::endl;
+        clKernWrite( transKernel, 3 ) <<  "#pragma OPENCL EXTENSION cl_amd_fp64 : enable" << std::endl;
+        clKernWrite( transKernel, 0 ) << "#endif\n" << std::endl;
+
         break;
     default:
         return CLFFT_TRANSPOSED_NOTIMPLEMENTED;

-- 
Alioth's /usr/local/bin/git-commit-notice on /srv/git.debian.org/git/debian-science/packages/clfft.git



More information about the debian-science-commits mailing list