[arrayfire] branch debian/experimental updated (7803782 -> fc8d0ad)
Ghislain Vaillant
ghisvail-guest at moszumanska.debian.org
Sun Feb 7 18:59:12 UTC 2016
This is an automated email from the git hooks/post-receive script.
ghisvail-guest pushed a change to branch debian/experimental
in repository arrayfire.
from 7803782 d/changelog: update with latest changes
adds b2ef2a3 d/rules: add upstream changelog
adds 7579e5f d/changelog: update current entry
adds bf6d865 d/changelog: release to unstable
adds fa8b9a2 d/gbp.conf: now strictly follows DEP-14
adds 36708b0 Merge pull request #815 from arrayfire/hotfixes-3.0.1
adds 3053edd Implement CUDA complex dot product
adds ab35ad1 Add complex dot product for CPU backend
adds 0372645 Add complex dot product for OpenCL backend
adds 0bd1110 Add forgotten instatiations
adds 075f48d Merge pull request #817 from FilipeMaia/complex_dot_product
adds f00a396 FEAT: Difference of Gaussians
adds b037a8c Merge pull request #835 from 9prady9/dog
adds b0e4992 FEAT Add CPU backend for unwrap function
adds 73d9038 Added CUDA backend for Unwrap
adds 669b4b2 Added OpenCL backend for Unwrap
adds 4badf65 Bugfixes, comments
adds 9b13e8a Adding padding for strides > 1
adds 9281f4e Added test for unwrap, updated data submodule
adds ec5c0d1 Added documentation for unwrap
adds 2816c01 Changing behavior of unwrap using padding
adds b3d5bba Changed unwrap tests to new behavior
adds dafe47a Updated documentation for unwrap
adds dc18c34 Unwrap: Remove duplicate asserts, add intl, uintl to test
adds ce0d03f Merge branch 'unwrap' of shehzan10/arrayfire into devel
adds 967ffb0 Make dim4 a POD object
adds 4c9962b Make ArrayInfo a POD object
adds 8204912 Add static checks for POD for dim4 and ArrayInfo
adds 91f7431 Make Array<T> a standard layout type
adds f037380 Ensure Array<T> is standard layout using static_asserts
adds 409b295 Revert "Make dim4 a POD object"
adds 67d0f60 Reduce size of Array<T> by rearranging mem vars
adds 0492397 Merge pull request #842 from umar456/pod
adds ee72370 Add static checks to make sure ArrayInfo is the first mem var
adds 19b7067 Merge pull request #843 from umar456/offset_check
adds 152fccf FEAT: Implementing array::lock() and array::unlock()
adds 691f0d1 FEAT / TEST: Adding af::copy()
adds 94909e1 Merge pull request #848 from pavanky/new_additions
adds 6bd71f0 DOCS: Fixing documentation for lock and unlock
adds 2175ccb Added AF_INTERP_LOWER and implementation for resize
adds d6bf32e Merge pull request #852 from shehzan10/resize_lower
adds 470cf99 FEAT: intl/uintl for random
adds 7883d32 FEAT: intl/uintl support for all reduce functions
adds 16b4c0a TEST: Refactor reduce tests. Test intl/uintl
adds cf58d66 BUILD: Fix redefinition warning in blas.
adds 29d0df7 Revert "BUILD: Fix redefinition warning in blas."
adds 6f3e9f8 Merge pull request #855 from umar456/long_long
adds e2ee96e FEAT Added nearest neighbour with SSD, SAD and SHD
adds 9f46f19 Added tests for nearest neighbour
adds f3d4660 Added documentation for nearest neighbour
adds 7f9b423 Merge remote-tracking branch 'upstream/devel' into nearestNeighbour
adds f999b05 Fix double compilation
adds 90a9ffb Remove redefition warning for blas
adds b626967 Added options for dotc and dotu to dot function
adds 354a40b Bug fixes for nearest neighbour and hamming
adds 8540fa0 Added tests for dot
adds 87ca992 Merge pull request #853 from shehzan10/nearestNeighbour
adds fd8d75d Merge remote-tracking branch 'upstream/master' into devel
adds 74c2b05 Compilation and warning fixes
adds 2ae60bb Merge branch 'master' into devel
adds 8aae20b BUFIX: Remove memory leak in af::copy()
adds 1161ff5 Updating forge tag to fix build issuess with ninja
adds 8ac5cb9 Merge pull request #863 from pavanky/bugfix
adds 0672f56 FEAT: Adding function to get use_count of shared pointers
adds 51b90c7 PERF: Do not make copies if the number of references is only 1
adds 44d857d Merge pull request #870 from pavanky/refs
adds a815801 Don't link the installed version of forge on Linux/OSX
adds 7b657e4 Merge pull request #866 from JuliaComputing/kf/forgerpath
adds 91cea3b Add support for BLAS symbol renaming
adds 319753f Merge pull request #867 from JuliaComputing/kf/blasrename
adds 28d9e71 Add lower interpolation to rotate and transform
adds 99a9234 Allow users to set precision when using print
adds c62a250 Changed af_print macro, added documentation
adds e12dceb Merge pull request #873 from shehzan10/more-lower
adds 4dc9836 Allow building against 64bit index OpenBlas
adds c9fbfc5 Also use the void* interface for MKL on windows
adds 1b315f9 STYLE: Remove macros; Simplify templates;
adds b774d57 Merge pull request #1 from umar456/JuliaComputing-kf/openblas64
adds 895d6b2 Merge pull request #865 from JuliaComputing/kf/openblas64
adds 8f37d8c Fix print macro
adds 998a0b8 Change build labels to be for devel branch
adds 9d09831 Added noDoubleTest for nearest neighbour and dot test
adds 3ebc19e BUGFIX: Ignoring NaN values in min and max for all backends
adds 3b2ce09 FEAT: Added support to substitute nan values for sum and product
adds 9036d73 Adding missing instantiations for compat functions
adds 248de68 TEST: Adding tests for reductions when using NaNs
adds 93a581d Merge pull request #875 from pavanky/reduce-nan
adds 7c41de9 Added C API for Harris corner detector
adds 4049083 Added C++ API for Harris corner detector
adds ea3c150 Added CPU backend for Harris corner detector
adds e9e8474 Added CUDA backend for Harris corner detector
adds d183732 Added OpenCL backend for Harris corner detector
adds cef733b Added Harris corner detector unit tests
adds cfb97ca Added documentation and function definitions for Harris
adds 2b6712b Fixed FAST unit tests
adds 311b2c5 Updated data
adds 7d1da23 Removed unused variables from OpenCL's sort_index
adds 0cf4d80 Merge remote-tracking branch 'upstream/devel' into harris
adds a257ef4 Merge pull request #874 from pentschev/harris
adds bd21a82 BUGFIX: Fixing casting to and from complex numbers in CPU backend
adds 56a000d Adding more operator overloading for af::cfloat and af::cdouble
adds bb7bb77 BUGFIX: Making sure c32/c64 imitate f32/f64 when operating with scalars
adds 2ee28e7 TEST: Add mixed type tests with complex inputs
adds 5ef10b3 Merge pull request #877 from pavanky/cplx_fixes
adds f5ac99e Fixed bug affecting Harris on AMD GPUs
adds d656a20 Renaming enums for convolve batch modes
adds d08400d PERF: Improvements for non linear JIT kernels in OpenCL backend
adds 9ee690c TEST: Adding batched mode tests
adds f8e9398 PERF: Speeding up JIT for 3D arrays in OpenCL backends
adds 5b49f07 BUGFIX: median of all elements is now fixed
adds 05de8bf PERF: Improvements to tile when tiling along singleton dimensions
adds ecdab67 PERF: Improvements to CUDA JIT for non linear 3D and 4D arrays
adds f8f072c FEAT: Adding support for non overlapping batched convolution
adds 2ca72cb TEST: Adding tests for non overlapping convolves
adds 73dbeb9 DOCS: Updating the documentation for convolution
adds c4b6ace Merge pull request #889 from pavanky/features-3.1
adds a8c6ae9 Adding missing license for a few files
adds 218043d Updating version to 3.1.0
adds 3a163f5 Changes in examples to reflect asset modifications
adds 210978f Merge pull request #891 from 9prady9/assets_changes
adds 4ff4de3 turned off clFFT examples in external project build
adds eb9367a fix forge dependency target errors in cpu, opencl backends
adds 79fbe5d Merge pull request #892 from 9prady9/cmake_fixes
adds fb17b53 Added AF_API_VERSION
adds 724450f Merge pull request #893 from shehzan10/devel
adds 30cbbc7 Fix double free issue.
adds 9cbc641 Add test for index copy assignment.
adds 4d23d41 Add move constructor and move assignment op
adds 4cc3a9b Merge pull request #895 from FilipeMaia/devel
adds a5efdea BUGFIX: Check for NULL values when allocating memory on CPU backend
adds 981cb4a Ensure CUDA and OpenCL return proper errors when out of memory
adds 171d8dd TEST: Adding test to trash the memory manager and see if it recovers
adds d22c0bb FEAT,TEST: Adding sigmoid function for all backends
adds bc2b8f5 Adding the option to remove tests from ctest
adds 1a2b405 Merge pull request #897 from pavanky/minor_changes
adds 8253205 BUGFIX/TEST: Fixing bug in rank. Added appropriate tests
adds e7d75be BUGFIX/TEST: Fixing not for C API. Added relevant tests.
adds 1bde871 BUGFIX: Fixing a bug in randn for CPU backend
adds 05509c9 BUGFIX: Fixing setSeed for randu
adds b8cfd77 TEST: Updating and fixing the randu/randn tests
adds bd39083 TEST: Updating random tests to properly reset seeds
adds cd78911 TEST: Fixing out of bounds access in fft tests
adds 3fb199d BUGFIX in randn for apple systems
adds 214acf1 Renaming rank test to rank_dense
adds ed3a830 Merge pull request #906 from pavanky/bugfixes
adds c69ca0a Added 64-bit integer type support for functions
adds 426dc0b Added print errors to documentation
adds a86171a Merge pull request #910 from shehzan10/devel
adds 89d7e65 Automatically enable and build CUDA and OpenCL backends by default.
adds 5ad9ada Suppress FIND output for non-essential libraries.
adds 88a2c26 Merge pull request #909 from bkloppenborg/autobuild-backends
adds 5df0554 BUGFIX: Fixed issues with mixed real and complex types
adds 9564317 Fixing the checks for skew
adds fb24097 BUGFIX: conjg no longer errors out for real inputs
adds d4cd847 af_scale now checks for default parameters properly
adds 3b19720 Merge pull request #915 from pavanky/fixes
adds f3df808 matchTemplate example
adds aca80f8 matchTemplate fix in opencl to support indexed template images
adds d322941 Removed uncessary copy in matchTemplate example
adds 314bc9e Added heat colormap to display disparity values for matching
adds 51d441c Merge pull request #922 from 9prady9/match_template_example
adds 4c34bf2 Made cpu::Array constructor consistent with CUDA & OpenCL
adds 3499b98 Fixed filenames for the files used in computer vision examples
adds de9ba74 SUSAN Corner Detector
adds 1a8892b Added error checks non-array parameters of susan API
adds 2c3e0d8 CUDA backend for SUSAN dectector
adds e57ed91 OpenCL backend for SUSAN dectector
adds 220f027 fixes: typos; specific to windows; additional unit test
adds 307040b Corrected memory allocation bug in cpu backend for orb, where
adds 2f5c672 Removed uncessary corner sorting for SUSAN
adds e9535b9 Changed CUDA/OpenCL kernels to use zero leading dimension
adds 57ffa26 typo fix in cuda SUSAN kernel
adds 26df8b3 SUSAN CUDA/OpenCL: Added bound checks to load shared/local Memory
adds a9fd10c Replaced static shared memory with dynamic in SUSAN CUDA kernel
adds 3d405b9 Merge pull request #921 from 9prady9/susan
adds 61ea091 FEAT: Summed Area Tables (sat, af_sat) a.k.a integral images
adds 8f250e7 Added check for skipping double type test when not supported
adds d8760f6 Added uintl, intl support for jit operations in cuda backend
adds 27f3385 Merge pull request #931 from 9prady9/sat
adds 3ede60b Changed default cuda stream to be non-zero
adds 1363441 Wrapped cuda kernel launches with CUDA_LAUNCH macro
adds d46b15b Added stream parameter for upstream{thrust, cufft, cublas} calls
adds 60c2be9 thrust fixes for cuda stream selection on cuda < 7.0
adds f19ecf8 Merge pull request #929 from 9prady9/cuda_default_stream
adds 16d8e27 Extended support for interleaved convolution
adds 8aaf299 Updating the COPYRIGHT.md document
adds ddd704b Updating the language in COPYRIGHT document
adds 3cedee7 Added new example: heston_model in financial
adds 4358056 Moving af_array info gathering functions from util.h to array.h
adds 75c81f1 Merge pull request #928 from pavanky/minor_changes
adds 499dc09 shared/local memory loading fix
adds 8086d34 Merge pull request #933 from 9prady9/ker_fixes
adds 60c4979 namespace fix for POST_LAUNCH_CHECK macro: cuda backend
adds 6cf4a5a Replaced cuda Memcopy/Memset with async versions
adds e2994d9 Merge pull request #935 from 9prady9/cudaMem_stream
adds f816f41 Making fft_inplace consistent across all backends
adds 61968d6 FEAT / TEST: Adding support for inplace fft
adds 822fd5a Moving general fft implementation to src/api/c
adds b92e629 Removing consts from the fftInplace API
adds 155293f FEAT/TEST: Adding R2C and C2R FFT transforms for all backends
adds 91f6e35 DOCS: Adding documentation for real to complex transforms
adds 7900fca Fixing a minor issue in ArrayFireConfigVersion.cmake file
adds d65d048 Fixing issue in documentation
adds 0efb69a Cleaning up multiply_inplace in cpu backend
adds 0a1adee Enabling memory manager back in cuda backend
adds d2e182f Merge pull request #936 from pavanky/fft
adds 3b4d983 typo fix in opencl morph kernel
adds dc38b24 graphics window set size functions
adds 6efd999 style fixes in graphics examples
adds b47dc5b Merge pull request #937 from 9prady9/wind_resize
adds 910feb7 FEAT: Adding select for CPU backend
adds 97c916a FEAT: Select added for CUDA backend
adds 773c02d FEAT: Select for opencl backend
adds 981c5e6 FEAT: replace for all backends
adds 0472884 TEST: Adding tests for select and replace
adds 5df1b8c Merge pull request #941 from pavanky/new_funcs_31
adds b259259 YCbCr <-> RGB conversion functions
adds 4c876fc unit tests for YCbCr <-> RGB conversion
adds ce88eec Documentation for ycbcr_rgb conversion functions
adds 587aa2d Modified colorspace function to handle new colorspace - YCbCr
adds b55a459 Modifed colorspace function wrapper code for efficiency
adds 384cbc7 Replaced padArray calls with Jit operations in ycbcr conversions
adds f4cc97f Updated forge tag
adds 0da5291 Merge pull request #942 from 9prady9/ycbcr
adds 1e60132 BUGFIX: Fixing offset issue with CPU element wise operations
adds 26a2e40 PERF: improvements to element wise operations in CPU backend
adds 9e20a23 Merge pull request #945 from pavanky/jit_fixes
adds 77c31e6 FEAT: adding complex support for exp
adds 345c579 Binary operations with floating point scalars default to single precision
adds cbdd5be Merge pull request #944 from pavanky/minor
adds 3e3be90 Fix type in linux doc
adds 62b0360 Fix signed-usigned comp warnings
adds 07ffda7 Fixes to print functions
adds 50699cc FEAT Added saveArray and readArray functions for file read/write
adds 2a9e855 Removed af_print_array_c. af::print now calls af_print_array_p
adds 57449b0 saveArray returns index of array
adds e59c953 Rename af_print_array_p to af_print_array_gen
adds ac92dda FEAT added to string function
adds dc6b11c Merge pull request #943 from shehzan10/stream
adds 30dd47a modified cpu::getInfo to display CPU information
adds 4cd73be Fixes for cpu backend getInfo on Tegra platform
adds 2fa808b Fixed missing header error for windows platform
adds 791dfcc Merge pull request #952 from 9prady9/cpuinfo
adds 817ccc3 FEAT Added image IO using memory functions and tests
adds c43af47 Changed saveImageMem API. Added image format enum
adds 05c3d62 Add typedef af_image_format af::imageFormat
adds 0681ddd Merge pull request #954 from shehzan10/imageio_mem
adds 78b9b99 Style changes to code in unwrap
adds 2d60aed FEAT: Adding support to unwrap along rows as well as columns
adds a958628 BUGFIX: Fixed a bug for unwrap in all backends
adds 7df4dcd FEAT/TEST/DOC: Adding wrap for CPU backend
adds 1909fa5 FEAT: wrap for CUDA backend
adds 1fab923 Adding atomics.hpp file for CUDA that can be used in the future
adds 365afb4 Moving the kernel cache map to a centralized location
adds 2e05dd4 FEAT: wrap for OpenCL backend
adds a8113b0 Removing faulty test
adds aca30fa Cleaning up unwrap code in OpenCL by using cache store
adds 08b6ecf Fixing the compile error on windows
adds f89e37b Merge pull request #955 from pavanky/wrap
adds cd097af Fix conjugate transpose for vectors
adds 5867404 Merge pull request #958 from shehzan10/ctrans_fix
adds 1580652 fix #957: remove unused dtype trait for size_t
adds a8d8837 Merge pull request #959 from ghisvail/bugfix/remove-unused-dtype-traits
adds 7f06c0d update cl.hpp to upstream version 1.2.7
adds 30f9e3c fix instantiation of Platform objects
adds 162e258 fix instantiation of cl::Platform objects
adds 7b759ce adding GL headers in platform.cpp
adds af598d1 Moved GL headers in platform.cpp inside WITH_GRAPHICS block
adds 9dbf2df Merge pull request #961 from 9prady9/cl_hpp_fixes
adds 3121ca1 Create instances for const index member functions
adds cc0f401 BUGFIX: in array_proxy::get() const
adds 1ba33a6 Merge pull request #953 from umar456/fix_951
adds b6d8e2d Updated boost compute version tags
adds 0ce0df2 Merge pull request #962 from shehzan10/update-deps
adds 028960f Added SIFT prototypes and parameter documentation
adds 60546a1 Added C API for SIFT
adds efe1331 Added C++ API for SIFT
adds 3ef9fdf Added CUDA backend for SIFT
adds b87ba86 Added OpenCL backend for SIFT
adds 25344cb Added CPU backend for SIFT
adds 48e909a Added AF_ERR_NONFREE to defines.h
adds 1ee6b70 Added BUILD_NONFREE option to CMake
adds b177218 Added SIFT Copyright information
adds b69215c Added OpenSIFT License
adds afe1e46 Added SIFT documentation
adds d1d122e Updated test data
adds ca57067 Added SIFT unit tests
adds 43e5fe1 Made SIFT image indexing more readable in CPU backend
adds a6ae78b SIFT fix for CUDA on Windows, made it more readable
adds 9d09e4d Made SIFT image indexing more readable in OpenCL backend
adds cce0e73 Templated SIFT gaussianElimination() in CPU and CUDA backends
adds a5e8a5e Added missing CUDA_LAUNCH and THRUST_SELECT to SIFT
adds bda81b0 Improved CUDA SIFT coalescing and performance
adds dd97ce5 Improved OpenCL SIFT coalescing and performance
adds a9a8140 Passing shared size memory dynamically to CUDA SIFT
adds 3ada93a Moved OpenCL's conv2Helper to kernel directory
adds b5cf8d5 Improved SIFT OpenCL code
adds 079f4df Using pre-defined constants for workgroup sizes in CUDA SIFT
adds c17da09 Using 3D arrays for Gaussian/DoG pyramids in CUDA SIFT
adds 9bd1693 Using cudaMemsetAsync for SIFT
adds 966a130 Fixed OpenCL SIFT bug causing segmentation faults on Intel
adds 45a4966 Added missing buffer freeing call to OpenCL SIFT
adds 298c4d8 Fixed CUDA SIFT on unused memory buffer
adds b563615 Changed SIFT unit test to use std::stable_sort()
adds faefa30 Moved syncthreads/barriers out of thread conditionals
adds ee4d0bd Moving CUDA SIFT syncthreads calls out of thread conditionals
adds c88929e Merge pull request #948 from pentschev/sift
adds b372e49 Reorganizing non free build process.
adds b9fac73 Changing build flags to build non free algorithms
adds 5e0ceac Templated options are now runtime compile options for opencl reductions
adds 8bf6589 Templated options are now runtime compile options for opencl convolutions
adds bf69efc Templated options are now runtime compile options for opencl indexed min/max
adds 5c0da49 Templated options are now runtime compile options for opencl scan
adds 39f9977 Templated options are now runtime compile options for opencl nearest neighbor
adds d03c3c0 Removing unnecessary switch case from opencl ireduce
adds 2c00e64 Templated options are now runtime compile options for opencl FAST
adds 72f7ff5 Splitting up opencl sort_by_key files to compile in parallel
adds 398b55c Splitting sort_by_key across too many files slows down compile times
adds 98f6f7c Fixing a bug introduced a couple of commits ago in OpenCL SIFT
adds 8bb6ad7 Merge pull request #966 from pavanky/compile_fixes
adds 0183b6a Fixing the output of af::info() for OpenCL backend
adds f34693e Updating documentation and adding version guards for for 3.1
adds c071d86 Merge pull request #964 from pavanky/fixes_310
adds 7fb28f6 Merge branch 'devel' into nonfree_fixes
adds af7681f Merge pull request #965 from pavanky/nonfree_fixes
adds 71c12e8 Remove set_scalar(x, 0) instructions
adds d0e1e7b Remove unused opencl/kernel/set files
adds f52389e Merge pull request #968 from shehzan10/opencl_fixes
adds d1b1a0e Fix typo
adds 12e4f9d Add SVD API
adds 369013c Add SVD Cuda backend
adds 2969c73 Add SVD CPU Backend
adds 82d039f Add SVD OpenCL Stump
adds 864c51d Add SVD example
adds 71da964 Merge pull request #882 from klemmster/cusolver_svd
adds 3794c80 Merge branch 'devel' into svd
adds bdbf30e Changes to style and fix compile errors
adds 20b5f51 Cleaning up cpu blas / lapack in OpenCL backend
adds e7e38b5 Fixes to supress annoying compiler warnings in OpenCL backend
adds fde6380 Adding functions from clMagma necessary for OpenCL SVD:
adds 73b8081 Initial support for SVD in OpenCL backend
adds 163ab37 Adding proper error checking in magma
adds eb66094 Fixing svd params to reflect clmagma
adds c265948 Merge remote-tracking branch 'upstream/devel' into svd
adds 549f6b5 Work around for issues in OpenCL svd
adds 5fc32dc API clean up and adding support for complex numbers for SVD
adds 4118733 Fixing various typos and bug fixes for SVD in CUDA and OpenCL
adds b3c5f0f TEST: for SVD
adds 974856b DOCS: Updating the documentation for SVD
adds 9ef664c Adding version guards for svd
adds 73717c1 Adding more pragma directives to supress GCC warnings
adds 083f6b8 TEST: updating SVD tests to contain all four floating point data types
adds 25975bb Fixing svd example to reflect the change in API
adds 0b76aa3 Revert "Updated boost compute version tags"
adds ed9e1be Compilation fixes for OSX
adds 8bcbf98 Use xGESVD instead of xGESDD for ARM platforms
adds 80593d5 Merge pull request #969 from pavanky/svd
adds 8755004 Added missing 3.1 version gaurds
adds 40912e5 BUGFIX SVD use gesdd only with MKL, use gesvd with atlas
adds 52923a1 Updated release notes for v3.1.0
adds 7f8799d Version guards for complex operators
adds ff39491 Added SIFT license info to release notes
adds e1bee7a SVD using gesdd on Apple
adds 2c7044b Merge pull request #970 from shehzan10/rel_31
adds aa8bc95 FEAT: Adding support for linear assignment in C API
adds 654dffb Restore original shape after flattening input for linear indexing
adds 5b671c2 Merge pull request #971 from pavanky/assign
adds 4b5725b FEAT: Adding support for linear indexing in C API
adds d2741e0 OpenCL backend af_info function string fixes
adds 52f45e9 Documentation for CUDA backend specific API
adds 41d517e Documentation fixes for 3.1 release
adds d27f91b Updated forge upstream tag for 3.1 release
adds 2712182 Documentation for missing index header functions
adds 164b39e Added missing docs for complex and opencl backend specific fns
adds 4f60872 Added missing docs for operator%, array::H and array::T
adds 1a52ee1 Merge pull request #973 from 9prady9/upstream_updates
adds e6acad6 Merge pull request #974 from arrayfire/devel
adds 5474921 dfsg clean
adds 4d7e369 Increment version to 3.1.1
adds e0b8b5b Fixes for snprintf on windows
adds 095f29e BUGFIX: For calculating number of elements for a buffer in CUDA backend
adds 619ecf7 BUGFIX: For calculating number of elements for a buffer in OpenCL backend
adds bce6967 TEST: Adding tests for indexed reductions
adds 704c0dc Merge pull request #983 from pavanky/indexed_reduce_fixes
adds 64a0dbf DOC fix for AF_PATH rendering missing %
adds e87a003 Read me fixes
adds 644d074 Try PkgConf first to find CBLAS
adds 7788c3b Merge pull request #990 from marbre/hotfixes-3.1.1-cblas
adds fc0ed41 Try PkgConf first to find LAPACKE
adds a083294 Merge pull request #991 from marbre/hotfixes-3.1.1-lapacke
adds 887ae72 Changes to Heston model to remove c++11 dependencies
adds 714cdc0 BUGFIX: seq.begin can now use negative offsets just like seq.end
adds 01af962 Updating Release notes for 3.1.1
adds eb9fde9 BUG: Fixing seq when passing af::end to af::seq
adds 5e91f81 Merge pull request #996 from pavanky/hotfixes-3.1.1
adds 3d7f26b bug fix in image_editing example
adds f2474cf Updated forge tag for 3.1 release
adds 90b61ce Merge pull request #997 from 9prady9/gfx_examples_fixes
adds 169af41 Updating release notes
adds 92e08e9 Merge pull request #998 from arrayfire/hotfixes-3.1.1
adds 9208992 Merge tag 'upstream/v3.1.1' into dfsg
adds 31e7cca Merge tag 'upstream/v3.1.1+dfsg1' into debian/sid
adds edc40ff d/changelog: new upstream release, switch to unreleased
adds 08dadf9 d/watch: add dversionmangling due to dfsg repack
adds 88da7eb d/p: refresh doxygen patch
adds 93ad33f d/rules: meta-commit
adds 211c64e d/p: add upstream fix for memory corruption in assign
adds ae6a9aa d/copyright: update copyright information
adds 5ca3a04 d/control: format now inline with debmake template
adds c762772 d/control: update short and long descriptions
adds 9c2a440 add README.* files
adds 48e129a d/changelog: add release info
adds 64b4553 release to unstable
adds ba19ee5 d/gbp.conf: switch upstream branch to dfsg-clean
adds 2b81c04 d/p: add patch fixing FTBFS due to usage of assembly code
adds 0538a8a d/changelog: add release information
adds 09a7181 release to unstable
adds 4698f12 BUGFIX in assign
adds b40e017 Add missing compute2cores versions
adds f5e0a33 Increment version to 3.1.2
adds 1311e81 BUGFIX convolve frequency condition is now based on kernel size
adds a9e33bc Add missing AFAPIs
adds 32cf297 BUGFIX: Fix indexed reductions with complex types in OpenCL backend
adds 0886f63 BUGFIX: Fix kernel name generation in ireduce for OpenCL backend
adds 139de04 BUFIX: Converting non-linear indices to linear indices in ireduce
adds 9b4a8a8 TEST: Adding tests for bugs in indexed reductions
adds c865ca4 Merge pull request #1009 from pavanky/reduce_fixes
adds b0c7c7b Reduction fixes for smaller arrays (<4096 elements)
adds 40f8cd1 Fixed histogram cuda/opencl kernels for indexed arrays
adds 2f56b39 Removed unncessary memory overhead in histogram cuda/opencl kernels
adds 392d26c type cast fix in histogram unit test
adds 60b39e8 Merge pull request #1012 from 9prady9/histogram_fixes
adds e5b8788 Restricts cpuid usage to only 64 & 32 bit architectures
adds 61643e0 Merge pull request #1016 from 9prady9/cpuid_fixes
adds bf53528 Add missing examples to documentation
adds d99397f Fix quoting text in readme docs
adds 5fd5463 Minor fixes in documentation. Fix cmake command for docs
adds e1196f8 Add -L to lib path in using on pages
adds 0da2226 Merge pull request #1017 from shehzan10/hotfixes-3.1.2
adds 8ed7ae4 Removed __LP64__ macro from checks related to valid cpuid usability
adds e6d2ae7 Forge tag update for ArrayFire 3.1.2
adds f2f1d16 Merge pull request #1023 from 9prady9/cpuid_lp64_marco_fix
adds fb9f225 Fix tests for 32-bit systems
adds d639289 Added enviornment variable to disable multi gpu tests
adds fa26517 Merge pull request #1018 from shehzan10/hotfixes-3.1.2
adds 3a92a6f Fix median test (again)
adds de4edd4 Send err messages to file for OSX installer
adds 9907d46 Merge branch 'osx_inst_fix' of git://github.com/umar456/arrayfire into hotfixes-3.1.2
adds 381e4ad Updated release notes
adds 77128bb Merge pull request #1026 from arrayfire/hotfixes-3.1.2
adds 6f09650 Merge tag 'upstream/v3.1.2' into dfsg-clean
adds cdecad9 Merge tag 'upstream/v3.1.2+dfsg1' into debian/sid
adds 2857f91 refresh patch bugfix-in-assign.patch
adds 1bfa3a9 remove patch bugfix-cpuid-usage.patch, applied upstream
adds 850aa24 split -arch and -indep targets
adds bc1f70c d/rules: drop restriction of test suite to 64-bit architectures only
adds 9056c48 d/changelog: add release information
adds afcc437 release to unstable
adds db0fef7 d/control: replace OpenBLAS build-dep by generic BLAS / LAPACK
adds 8828f85 d/p: add patch fixing detection of BLAS in upstream find module
adds de16fd1 remove source override on docs/highlight.pack.js
adds 0eea6d9 d/changelog: add release information
adds b821b3a release to unstable
adds 4ad8ad0 Increment version to 3.1.3
adds 5028db2 Removing ARCH_32 and ARCH_64 flags
adds 3fe2b08 Fixing missing symbol issues when freeimage is not found
adds 7335f49 Merge pull request #1041 from pavanky/fixes
adds 254458f Return CUDA Driver version on windows too
adds 9f15ec2 Add Paths to FindOpenCL for linux
adds 6b6ce72 Improved SIFT descriptor scaling
adds f10dddc Update test data
adds bf7e205 Merge branch 'sift_scale' of git://github.com/pentschev/arrayfire into hotfixes-3.1.3
adds 5eaa630 Fix memory leak in median
adds 8856529 Fix windows builds when not using MKL
adds 2ca9bf4 Merge pull request #1048 from shehzan10/hotfixes-3.1.3
adds 507ea36 Fixed SIFT on CPU backend when double_input is false
adds 43810d5 Fixed several memory leaks in CUDA and OpenCL SIFT
adds 5d4e868 Fixed min/max values of sigma in SIFT scale levels
adds b931609 Added several SIFT unit tests
adds 8d0ac0a Merge remote-tracking branch 'upstream/hotfixes-3.1.3' into sift_tests
adds ce43ad2 Updated test data
adds fcaa3a8 Merge pull request #1050 from pentschev/sift_tests
adds 191e1fa Add paths to examples FindOpenCL.cmake file
adds 2c3b475 Fix documentation groups for select and replace
adds f944c75 Fixes for building without lapack
adds d9c793c Fix doc for af_isnan
adds 17ff045 Updated release notes for 3.1.3
adds 2dfad04 Adding missing offsets for various OpenCL kernels
adds 71e02b8 Merge pull request #1061 from pavanky/bugfixes-3.1.3
adds a422f8a Add return type to cuda blas (for windows)
adds ac81157 Add change to release notes for 3.1.3
adds 35c89f5 Merge pull request #1059 from arrayfire/hotfixes-3.1.3
adds eac9359 Merge tag 'upstream/v3.1.3' into dfsg-clean
adds 8b9539e Merge tag 'upstream/v3.1.3+dfsg1' into debian/sid
adds c628642 add upstream metadata
adds 1ac46b8 d/changelog: add release information
adds db666a0 release to unstable
adds 06a9cd6 Added GLOH function prototypes
adds 037d2f7 Added C API for GLOH
adds 746a40f Added C++ API for GLOH
adds 97dae2e Added CPU implementation of GLOH
adds 337fcec Added CUDA implementation of GLOH
adds de6d401 Added OpenCL implementation of GLOH
adds 5f18136 Added GLOH documentation
adds eaa14d5 Added GLOH unit tests
adds 88680b5 Additional operator* overloads for cfloat, cdouble
adds b3e65dc Added mean<T> instantiations for int64 and uint64 in C++ API
adds 034e905 Clean up mean helper functions & typo fix in af_mean_all_weighted
adds 8cd7964 Additional unit tests for mean
adds d30462c Updating assets commit tag
adds a736988 Merge branch 'devel' into stats_tests
adds 7f422f5 Merge branch 'devel' into stats_tests
adds 8b11400 Corrected path typo in mean tests
adds d026e5f Merge branch 'devel' into stats_tests
adds 66de62b Merge branch 'devel' into stats_tests
adds 45d804b Merge branch 'devel' into stats_tests
adds 857a6b4 Initial commit for heterogenous api for ArrayFire backends
adds b70c2c4 Windows specific changes to HAPI Symbol Manager
adds 280af8d Cleaned up function call in hapi functions
adds 131de34 Heterogeneous API for arith and algorithm header functions
adds 7a462eb Merge branch 'devel' into heterogeneous_api
adds 79c90ac Updated copyright year in hapi source files
adds f9ebb38 backend-independent api wrapper for image & vision headers
adds ceabe02 backend-independent wrapper for arrayfire funtions
adds d12d141 Renamed cmake file hapi build-identifier
adds 8d73db4 Cleaned up symbol manager class in HAPI wrapper
adds cc26908 Changed default backend enum to point to zero
adds a83b095 Fixed typo in data, device & index wrapper source files
adds 92d0ec1 Documentation for runtime backend selection functions
adds 7463601 Wrapper work around for af_make_seq function in hapi
adds a9cb8fe set_backend and get_backend_count functions
adds 53d327e Moved HAPI examples into standard examples location
adds 7b17460 Corrected BUILD_ALL cmake macro arguments
adds 9f36a1d fix: opencl backend alone build fails due to this missing header
adds 8b94ac1 FEAT Added batch support for approx1 and approx2
adds c954729 Changing int to dim_t in approx kernels
adds 871e114 Added any dimension batching and gfor support for approx1 and approx2
adds 15c3cb0 Change condition structure in approx
adds 5db4b02 Merge pull request #979 from shehzan10/approx-batch
adds be34055 Merge branch 'devel' into heterogeneous_api
adds 6ca7247 Utility functions for generating af_index_t array objects
adds 11fbdfb Moved indexing utility functions to common location
adds 15b0221 Added missing functions hapi wrapper
adds d6c13c2 Renaming src/api/hapi to src/api/unified
adds ec94996 Fixing CMakeFiles for unified backend
adds 4cb64cb Changes required to make unified library build the cpp bindings
adds e35b5ec Changes to examples and test CMakeLists to build *_unified binaries
adds d042df9 changed unified api to load libraries using prioritized list of paths
adds 7e46c54 Fixed CMake source bugs for windows platform in unified api sources
adds aa5b8ae Removed AFAPI attribute declaration where no needed for func definitions
adds 59484b0 Another cmake fix for windows platform in unified api project
adds 859112e Fixed cmake bug in examples also
adds a884c96 Added error display strings for unified api error codes
adds a24a264 fix in unified api for af_save_image
adds 6c0c1b8 Disabled Sort1000 & SortMed tests for sort_by_key and sort_index
adds 63436aa Merge pull request #977 from arrayfire/heterogeneous_api
adds e737b52 use RAII to avoid freeimage bitmap resource leaks
adds 11a6b07 Merge pull request #994 from vakopian/fi-leak-fix
adds 2d75672 Merge branch 'arrayfire/master' into 'arrayfire/devel'
adds bd5ab15 Merge branch 'devel' into stats_tests
adds f26cc0d typo fixes in mean unit test
adds cb54732 Merge pull request #1001 from arrayfire/hotfixes-3.1.2
adds 1b52681 Fix sizes for approx batch tests
adds fc7630f Use af_print_array_gen in unified basic example
adds 4a0f164 Change unified backend priority. Add af/backend.h to arrayfire.h
adds c5f8b3c Changed unified/basic.cpp to use C++ api
adds 94e8dbf Add unified backend details to using on pages and cmake.in file
adds 4923e58 Documentation for unified backend
adds cb68a30 Reduced size of approx1 batched linear test
adds fe1f289 Merge pull request #1007 from shehzan10/unified_doc
adds 4693a58 Change output of DOG to floating type
adds 4439521 Increment version to 3.2.0
adds 8d68ba5 Added AF_MSG macro
adds ebfe9e5 Added short (s16) and ushort (u16) types for CPU
adds 7f3ff10 Added short, ushort support for CUDA backend
adds 5e88e4a Fix memory alloc for fast opencl
adds 50582e1 Added short and ushort support for CUDA backend
adds 4a2b3bc Remove ushort redifinition from imageio
adds 984525c Change ushort to unsigned short in cpp
adds 4286b86 Add typedef for ushort in tests
adds 5b82ff1 Corrections in unified backend doc
adds f606a91 Add 16-bit enums to docs
adds 14b0a4d Merge pull request #1014 from shehzan10/16bit
adds 31d2475 Merge branch 'master' into devel
adds 4b478af Merge branch 'devel' into stats_tests
adds 01bd5d1 Corrected a typo in statistics functions documentation
adds d7ce9fa basic unit tests for `af::cov` and `af_cov`
adds dd0a803 unit tests for standard deviation function
adds d004ca8 unit tests for correlation coefficient function
adds 6934df6 Enabled integral types to float/double reduction
adds c19058f statistics functions fixes
adds 2876c22 Fix dlopen string for OSX
adds c32937d Documentation fixes
adds 064be97 Merge pull request #1031 from shehzan10/unified_fixes
adds 645903e Merge branch 'devel' into stats_tests
adds 8dd257b function to set active cuda device using native id
adds 9d85fb5 function to set active opencl device using cl_device_d
adds ff0cbf8 Fixed template specilization for MSVC compiler in mean function
adds 581ea3d Added check to verify f64 support in covariance unit test
adds 5c012ec Specilization for Binary functor for cdouble type in cpu backend
adds 63807f4 Merge pull request #1034 from 9prady9/set_native_device
adds f9f3e7d Merge remote-tracking branch 'upstream/hotfixes-3.1.3' into devel
adds e7facc8 Merge branch 'devel' into gloh
adds 653f789 Added missing 'AFAPI' to C++ API
adds ef53a67 Added unified API for GLOH
adds 73bcb3b Merge pull request #1052 from pentschev/gloh
adds d3f29d5 Updated test data
adds 0ca7aeb Added CPU fallback for CUDA LU when CUDA older than 7
adds 590da11 Added CPU fallback for CUDA QR when CUDA older than 7
adds 37e0658 Added CPU fallback for CUDA QR when CUDA older than 7
adds 876494d Added CPU fallback for CUDA Solve when CUDA older than 7
adds adbd62b Added CPU fallback for CUDA Inverse when CUDA older than 7
adds 55fdae4 Added CPU fallback for CUDA SVD when CUDA older than 7
adds c166c8f Call deviceGC before solve tests to minimize memory (tegra)
adds 6223be6 Default CPU fallback for CUDA LAPACK to OFF. Use CUDA_LAPACK_CPU_FALLBACK=ON
adds 0759aea Merge pull request #1053 from shehzan10/cudalapack
adds 9191492 Change condition when nonfree are removed from ctest
adds 5abb983 Fix comparison warnings
adds 66453c4 Call submodule update if submodules are missing
adds ee04f37 Changed std::sort to std::stable_sort in CPU SIFT
adds 08e6b65 Updated SIFT/GLOH test thresholds
adds bb93ee4 Merge remote-tracking branch 'upstream/devel' into sift_fixes
adds 9b47179 Merge pull request #7 from pentschev/sift_fixes
adds a67ea40 Updated SIFT/GLOH test thresholds
adds f8462bc Merge pull request #1056 from shehzan10/devel
adds 7fc3856 Fixed out-of-bounds memory access in CUDA/OpenCL SIFT
adds c607f62 Merge pull request #1062 from arrayfire/master
adds c23e49b Port shallow water eq example from 2.1
adds 380b7b0 Merge pull request #1065 from umar456/swe
adds a4733f5 Update installation documentation to match current methods.
adds ae5d7a5 Include special instructions for Windows.
adds 7437975 Merge pull request #1070 from bkloppenborg/devel
adds eb0f5b6 Add unified backend binaries to the OSX installer
adds 95de3c5 Merge pull request #1069 from umar456/devel
adds c0b9c80 Provide option for MKL use for CUDA lapack cpu fallback
adds 52b63cf Fix compilation fixes for VS2015
adds ded5320 Add return type docs for functions with varying return type
adds ca1e922 Fix warnings
adds 5a938e2 Change clBLAS tag to the corrected commit
adds 0c90cce Added function to get available backends
adds 218d2b1 Optimizations to backends available computation
adds 97bed15 Merge pull request #1064 from shehzan10/devel
adds 2b4ed25 3D line plot feature
adds 2a1d63d 3d surface rendering features
adds 35579b4 Merge branch 'surface_plot' of git://github.com/syurkevi/arrayfire into syurkevi-surface_plot
adds 9436f10 Fix for cuda backend surface rendering function
adds 33de00a Merge branch 'syurkevi-surface_plot' into devel
adds 8f54598 Replaced deviceSychronize calls with async versions
adds ff4812c Removed uncessary stream synchronizes in device pointer functions
adds 0b90a21 Style fixes
adds ab2e222 Merge pull request #1076 from 9prady9/cuda_memcpy_stream_fixes
adds aa75b14 Optimization for JPEG, cleanup
adds 551cd56 Moved common functions from imageio into header file
adds 61226f3 FEAT add loadImageT and saveImageT. Provides loading in different types
adds 662efff Change loop in surface example
adds ff8369b Fix enum value conversion in image
adds 6a21345 Fix imageio load order in case of bitmap and not bitmap
adds 86457a9 Add s16 and u16 types to image (graphics)
adds ec6d55d Add s16 and u16 types to surface (graphics)
adds f3e7308 Add s16 and u16 types to histogram (graphics)
adds 2f209d4 Add s16 and u16 types to plot (graphics)
adds 4dd618c Add s16 and u16 types to plot3 (graphics)
adds 26399fc Update forge build tag
adds f3dbb85 Add load_image_t and save_image_t to unified
adds 35111ba Doc for loadImageT and saveImageT
adds 75f228b Fixes for ushort on windows
adds 61d0d54 Update test data
adds 805176e Merge pull request #1077 from shehzan10/imageio
adds 778b13f Add intl/uintl to sort, sort_index, sort_by_key
adds d60ae28 use cl_long and cl_ulong in sort functions
adds c0cb5cd Add intl/uintl to lookup
adds 5ff4efc Add intl/uintl to histogram and histeq
adds 401f130 Add intl/uintl to convolve and fftconvolve
adds 2e36523 Add intl/uintl to set functions
adds a479c25 Add intl/uintl to meanshift
adds 944159c Fix cuda shared memory instantiation for s64 and u64
adds 804651a Fix comparison warning
adds 04153f9 Compilation fix for non-imageio builds
adds 6601bef API Change loadImageT -> loadImageNative
adds dc1bea3 Add support for c32/c64 for isInf, isNaN, iszero
adds 888cf7c Merge pull request #1089 from shehzan10/devel2
adds 452f51f Add OpenGL requirements for Forge.
adds ec4adea Update documentation.
adds c19e636 Add Ubuntu 14.04 installation quirk
adds f819023 Update INSTALL.md
adds e0dcaa6 Update CMake and Make examples.
adds 674b9df Update section titles, helloworld exe.
adds e3b7a1c Add install page to layout
adds 69b9ace More documentation updates for tutorials
adds 4bcf103 Update links
adds b9495c7 Merge pull request #1087 from shehzan10/devel
adds 2a21ddc Memory leak fix in SUSAN feature detector
adds 42d3301 specilizations for abs math function for int & char
adds dd3f023 Indexing test for out of bounds access
adds 68df69a Merge pull request #1096 from 9prady9/susan_fixes
adds 7ad7ce0 Encode backend info into ArrayInfo::devId
adds efd5c02 Added array/backend checks to unified backend
adds da4d2cc Add getBackendId function to get backend info of an array
adds e0233fc Update unified api docs
adds 7aa5911 CHECK_ARRAYS lets C-API return errors in case of arr = 0
adds 9067eb8 Update link to example projects.
adds ba5ca79 Add XCode instructions, assets.
adds 75891bb Add link to ArrayFire project template repo.
adds 49a41d4 Add PPA for glfw3 on Ubuntu 14.04
adds bd3e37d Update INSTALL.md
adds 6e19054 Merge branch 'doc-updates' of https://github.com/bkloppenborg/arrayfire into unified_checks
adds 6975da8 Fix iota dims check
adds 7ad94e5 Fix af_device_array dims check
adds ca9ca6f Typo AFF_ERR_NONFREE -> AF_ERR_NONFREE
adds 15411eb Add version guards for v3.2
adds 4242378 Merge branch 'devel' of https://github.com/shehzan10/arrayfire into unified_checks
adds cf0ab91 Added version checks for getBackendId
adds 144a2db Fix triangle test failures
adds 53d77a7 Added homography function prototype and API
adds 80869d9 Added CPU backend for homography
adds 693397d Added CUDA backend for homography
adds 5ca352a Added OpenCL backend for homography
adds 008a6d9 Added homography documentation
adds b514aab Added homography unit tests
adds e5e954e Updated test data
adds 80d49eb Merge remote-tracking branch 'upstream/devel' into homography
adds 33d4ead Fixed homography for Intel OpenCL
adds cb13531 Disabled homography LMedS unit tests
adds b3da23b Split vision.h prototypes into multiple lines
adds d7abcf2 Fixed __syncthreads() calls in homography
adds 3e0abfa Added AF_HOMOGRAPHY prefix to af_homography_t enum
adds c55cae4 Fixed homography documentation
adds 0d8f647 Merge branch 'homography' of https://github.com/pentschev/arrayfire into unified_checks
adds 31761d2 Removed unnecessary __syncthreads() on homography
adds 1fd4511 Removed unnecessary barrier from homography
adds 1bf915a Merge branch 'homography' of https://github.com/pentschev/arrayfire into unified_checks
adds 463f044 Moved det to rank test file. Removed rank and det from missing test
adds 1b8c231 Merge pull request #1097 from shehzan10/unified_checks
adds ddc179e updates matrix manipulation documentation
adds bbd3d8a Merge pull request #1099 from syurkevi/maniparr_docupdate
adds ace69d2 Removed gfor unsupported functionality
adds 70de4fe Added new examples
adds 1830993 Added release notes for 3.2.0
adds 140c340 Update forge tag for af3.2.0
adds a4ffcf5 Add Tegra X1 badges to readme
adds 8bbb6eb Merge pull request #1100 from shehzan10/docs-3.2
adds 2a45792 Transpose build table in readme
adds 46a45b5 Added groups for graphics func documentation
adds c9b22d5 Merge pull request #1102 from arrayfire/devel
adds 62696f1 Merge tag 'upstream/v3.2.0' into dfsg-clean
adds d214169 Merge tag 'upstream/v3.2.0+dfsg1' into debian/sid
adds 5a97942 d/changelog: new upstream release.
adds 2ca08a7 d/gbp.conf: use submodules with buildpackage.
adds a55764d d/gbp.conf: disable patch numbering with pq.
adds e699af3 Refresh patch queue: - Drop bugfix-in-assign.patch, applied upstream. - Rename remaining patches to their corresponding commit message, following switch to gbp-pq.
adds 1361c08 d/control: add gtest to build dependencies.
adds f3f7b60 Add patch allowing build of testsuite with Debian gtest.
adds 66afa8e Disable build of unified backend.
adds 69c0851 d/{*.install,rules}: use more specific regexes. Reason: in anticipation of other backends being packaged.
adds cd3ced2 d/rules: move nocheck filter inside autotest-arch target.
adds 21ff9b6 d/copyright: add missing copyright information.
adds 5a5aeab d/changelog: update release information.
adds 7073489 Release to unstable.
adds f4d9708 d/rules: simplify targets.
adds 1d3c0b1 d/rules: disable build of testsuite if no check requested.
adds 828f9dd d/changelog: add release information.
adds 636b0da Release to unstable.
adds 603c34e Remove jquery symlink in HTML documentation.
adds eb5474a d/rules: set build directory explicitly.
adds 9fc36cb d/rules: use correct cmake option for include install path.
adds af5ab8e Add example source code to documentation package.
adds 6cdad09 d/rules: exclude examples and documentation from compression.
adds 43ba61a Add patch enabling usage of custom compile flags in examples.
adds f360dc3 Add autopkgtest testsuite.
adds 3ff6278 Add patch fixing build of examples.
adds de0b4b7 Use cmake options to inject documentation and example install paths.
adds b5c1de4 d/control: cme fixed, wrap ans sort.
adds 4bf8635 d/changelog: add release information.
adds 00d77c5 Release to unstable.
adds 8d887c3 Fixup autopkgtest testsuite.
adds 3bfed6c Remove unnecessary CMAKE_BUILD_TYPE override.
adds b819ffa Add release information.
adds 7f43dbd Release to unstable.
adds ca0c7cc Fixes for examples when used with installer
adds f10075b Fixed typos in documentation.
adds f11a990 Merge branch 'patch-2' of https://github.com/mlloreda/arrayfire into hotfixes-3.2.1
adds 1b0ef66 Add missing linkage with libdl
adds 565c9af Merge branch 'fix/missing-libdl-linkage' of https://github.com/ghisvail/arrayfire into hotfixes-3.2.1
adds ef2e7d7 Fix documentation when using older doxygen
adds b0a8e7e Merge branch 'minor-docs' of https://github.com/shehzan10/arrayfire into hotfixes-3.2.1
adds 3281b50 Added missing symbol export for af_draw_surface
adds 0075fdf Merge branch 'gfx_surface_fix' of https://github.com/9prady9/arrayfire into hotfixes-3.2.1
adds 86dd6c7 Build and install documentation in a separate output folder.
adds b747245 Merge branch 'enh/docs-target-settings' of https://github.com/ghisvail/arrayfire into hotfixes-3.2.1
adds 54ad0b3 Fix examples target.
adds 87bfd99 Merge branch 'fix/examples-target' of https://github.com/ghisvail/arrayfire into hotfixes-3.2.1
adds d39f9e8 Fix type in documentation
adds 59faf59 Merge pull request #1115 from shehzan10/hotfixes-3.2.1
adds 0d5913b Fixes for examples cmakelists for dl lib
adds 86c1d7d Tests are now available as standalone
adds bdee78a Fix examples/cmakelist arguments for osx and windows
adds 7fcf9bb Add enable_testing to test/CMakeLists.txt
adds 3fdef4a Merge pull request #1120 from shehzan10/tests-standalone
adds 75c2172 BUGFIX: GFOR assignment when other dimensions have step indices
adds 7d06e9f BUGFIX: Issue with vector indexing when using spans
adds 05e00d5 Do not perform copies in moddims if memory is contiguous
adds d7b9f9a TEST: Adding test for GFOR assign bug
adds 37b3c8c BUGFIX: Getting the device pointer performs memory copy when needed
adds 967545d TEST: Adding tests to verify unnecessary copies aren't being done
adds 4d75c78 Compile fixes for older compilers
adds 57a89ba Merge pull request #1130 from pavanky/bugfixes-3.2.1
adds 1c673f9 Fixed and improved OpenCL's homography
adds 9b00511 Fixed and improved CUDA's homography
adds d4c5ed6 Merge pull request #1136 from pentschev/homography_fixes
adds ee8a1ee Documentation for seq class
adds 71442bb Fix possible divide by zero case in cpu info
adds 3adec6f Merge pull request #1132 from shehzan10/seq_docs
adds 99fe1ac Fix examples installation directory
adds d3f3080 Use folders (VS sln) for examples/tests when built out of source
adds 92599d7 Install examples source irrespective of value of BUILD_EXAMPLES
adds 8136f21 Updated forge tag
adds 0dce77c CMake generates the list of examples
adds d0732f1 Generate examples as dir/filename.cpp
adds c0aba7f Update examples refs to match updated example style
adds 6978c90 Updated release notes for 3.2.1
adds 121caef Fix typo
adds 9fb64de Merge pull request #1137 from shehzan10/hotfixes-3.2.1
adds eb0e0a5 fix code formatting in doxygen
adds 5955b14 initial vectorization tutorial
adds aa076e9 forge visualization tutorial
adds 1cca0de initial opencl, cuda interop tutorials
adds b6e7542 initial interop tutorials
adds 1ada68e doxygen formatting and reference fixes
adds 516c36e interop tweaks temporarily remove external opencl context from interop tutorial. add cuda stream topic
adds 72fe0ae interop formatting tweaks
adds ffd5941 additional vectorization content
adds 384ce6d remove extra information from vectorization
adds 8ad12b4 Merge pull request #1125 from syurkevi/tutorials
adds 0836463 DOC Add background and bold to inline code tags
adds 1183029 DOC corrections, proper linking and syntaxes
adds 7749f4d Merge pull request #1138 from shehzan10/hotfixes-3.2.1
adds 4d6eaba Merge pull request #1139 from arrayfire/hotfixes-3.2.1
adds f263db0 Increment version to 3.2.1
adds 6072652 Merge tag 'upstream/v3.2.1' into dfsg-clean
adds 84ea70a Merge tag 'upstream/v3.2.1+dfsg1' into debian/sid
adds 2c831d9 Update patch queue: - Drop Fix-documentation-target.patch, Fix-examples-target.patch and Fix-missing-cstdio-include.patch, applied upstream. - Refresh Use-custom-cflags-in-examples.patch and Use-system-gtest.patch. - Refresh Fix-CBLAS-detection.patch, use upstream fix. - Add Fix-missing-includes-in-testsuite.patch, fix FTBFS in testsuite on Ubuntu 14.04.
adds a786c99 Update installation of examples.
adds 90c8338 Update autopkgtest testsuite: add script building and running the upstream testsuite against the installed packages.
adds 1af1d3b Update changelog.
adds 17f300b Release to unstable.
adds 8f1d859 Install common development files to new libarrayfire-dev package.
adds 7bdb6f9 Install OpenCL backend to new libarrayfire-opencl packages.
adds c03f6ca Install unified backend to new libarrayfire-unified packages.
adds 57e387d Add patch disabling calls to git when building the documentation. File: Disable-git-usage-in-Doxygen-settings.patch
adds fc89db6 Update changelog.
adds 035ad29 Release to unstable.
adds b440613 Add upstream patch fixing the testsuite for i686 architecture.
adds c19ba29 Add missing copyright information.
adds a2bf2cf Build without AltiVec support on ppc64el, fixes FTBFS.
adds 8a87657 Update changelog.
adds 58d4595 Release to unstable.
adds 37f14f5 d/rules: fix missing dpkg query for DEB_HOST_ARCH.
adds eda4626 Release to unstable.
adds 362b08a Add missing maintainer CXXFLAGS.
adds 8f24455 Release to unstable.
adds 7e2ecb4 Increment version to 3.2.2
adds edee05b Remove unused variable warning in homography cuda kernel
adds ba19743 Fix memory leak in cuda random. Additionally allow seeds per device
adds c13227e Use cudaMalloc/Free for memory ops in curand
adds 6a18064 Use custom cflags in examples.
adds b61bb42 Merge branch 'fix/examples-use-custom-cflags' of https://github.com/ghisvail/arrayfire into hotfixes-3.2.2
adds 20ba4b9 Fix missing includes in testsuite.
adds b1ee11c Merge branch 'fix/testsuite-missing-includes' of https://github.com/ghisvail/arrayfire into hotfixes-3.2.2
adds 2287c5c Move AF_REVISION and AF_COMPILER_STR to backend/version.hpp
adds eb3b242 Merge pull request #1147 from shehzan10/hotfixes-3.2.2
adds cac82df Minor cleanup to FindCBLAS.cmake
adds a1d6213 Merge pull request #1146 from pavanky/build_fixes
adds 09f0367 Replaced cudaMemcpy with async version calls in homography
adds 0b10b3a Merge pull request #1153 from 9prady9/perf_fix_homography
adds 9459c62 Fix bug in identity cuda plaguing compute 5.2
adds 1f3e7b3 Merge pull request #1154 from shehzan10/hotfixes-3.2.2
adds 8405d5d Add multiprocess compilation flags for Visual Studio
adds cda0923 Add MSVC flag around example build flags
adds 1af10e9 Merge pull request #1156 from shehzan10/MSVC_MP
adds dc1e53f Adding default parameter for surface
adds 687167b Cleaning up graphics examples
adds b21a838 Update number of iterations in black scholes example
adds cbe4af5 Cleaning up examples
adds 52a8409 Fixing select and replace tests
adds 57779c2 Fixing surface.cpp example to work on windows
adds 59da65f Removing unnecessary af/macros.h from all examples
adds a3a19b5 Merge pull request #1158 from pavanky/example_fixes
adds 59c9820 Add function to check Image IO availability
adds 5bd550f Added noImageIOTests() to relevant tests
adds 4145a04 Add function to check LAPACK availability
adds 1237820 Added noLAPACKTests() to relevant tests
adds 9d218a5 Hide scrollbars appearing for quoted/pre text
adds 20e1a18 Merge remote-tracking branch 'origin/scrollbar-fix' into no-freeimage-disable-tests
adds e94f037 Handle printing empty Arrays
adds fea38e4 Remove macros.h and MSG from defines.hpp
adds 8c945f3 Added MEMINFO macro to print memory stats
adds 6bf306e CPU: Using mt19973 as generator instead of default_random_engine
adds 0b11c6e MEMINFO -> AF_MEM_INFO. Add documentation for the macro
adds fda0a80 Merge pull request #1169 from shehzan10/no-freeimage-disable-tests
adds 35f0fc2 Change clBLAS/FFT external projects to clBLAS/FFT-ext
adds ed0373f Creating streams for devices only when device is active
adds bf9d70a Check the stream before returning in getStream
adds 6dbd4cc Merge pull request #1180 from shehzan10/hotfixes-3.2.2
adds 36ed9a4 bug fix in plot3 graphics example
adds 301e388 Merge pull request #1176 from 9prady9/example_fixes
adds ac09f91 Fixing documentation for replace function
adds eed0651 Adding documentation for `device` that it locks the memory.
adds cd26c4e Updating forge tag to reduce the path lengths
adds c171395 Merge pull request #1183 from pavanky/docs-fixes
adds 1c584de Add extra paths to check for unified on Linux/OSX
adds dc4c479 Added documentation link when library fails to load in unified
adds 1ed27ff Add detail for AF_ERR_ARR_BKND_MISMATCH
adds 7a93d9a Using vector<string> in unified for extra paths
adds e60fa94 Using auto to iterated extra paths in unified
adds 1e0ab50 Fix colorspace c functions to use exceptions properly
adds affc59b Adding __AF_FILENAME__ to give the just filename without the path
adds ca6ed2b Add AF_RETURN_ERROR macro to return errors and print msg
adds 7cb790e Use AF_RETURN_ERROR when graphics is not configured
adds e26c341 Use AF_RETURN_ERROR when Image IO is not configured
adds 2717d42 Add function name to exceptions in internal error classes
adds c074de7 Add missing vector include
adds 301b21d Using AF_THROW_MSG instead of THROW in cpp/array.cpp
adds ff83267 __AF_FILENAME__ returns path from src
adds 194de52 Formatting the exception string
adds 4e06483 DOC Add code sample to convert available backends to bool
adds a75590b TESTS Removed typed_test from info.cpp
adds 05471d1 TEST add a test for unified api
adds 9604fcc Change AF_THROW_MSG to AF_THROW_ERR - Does not check for AF_SUCCESS
adds 50bccdd Change CHECK_ARRAYS to be used like a function
adds 5df0b29 TEST Rename test/unified.cpp to test/backend.cpp
adds db3626f Build only info and backend test for unified
adds 862b523 Fixes for exceptions in minimal builds
adds 613557c Changes to opencl backend when building with openblas
adds 0c77893 Changes to find OpenBLAS as lapack and lapacke alternative
adds 0d270ad Merge pull request #1197 from pavanky/openblas_fixes
adds 091cdf9 Adding a Fast configuration
adds 40f2cbe Test reordering: Sort by backend, then alphabetically
adds 235728a Revert "Adding a Fast configuration"
adds 4243ffc BUILD Adding option MIN_BUILD_TIME to CMake. Options sets O0 for fast compile
adds d308ae1 DOC Fix typo in af_div
adds 7c96fd4 Update release notes for v3.2.2
adds 59dcacd Update forge tag for release v3.2.2
adds 005894e Merge pull request #1195 from shehzan10/fastconfig
adds 7507b61 Merge pull request #1200 from arrayfire/hotfixes-3.2.2
adds 25a2a75 Merge tag 'upstream/v3.2.2' into dfsg-clean
adds eab3b7b Merge tag 'upstream/v3.2.2+dfsg1' into debian/sid
adds 544eca1 d/control: add missing break / replace relationship.
adds 0881fd3 Refresh patch queue: - Drop Fix-CBLAS-detection.patch, fixed upstream. - Drop Fix-missing-includes-in-testsuite.patch, applied upstream. - Drop Use-custom-cflags-in-examples.patch, applied upstream.
adds 2d932c8 d/control: cme fixed.
adds 9fd4828 d/copyright: cme fixed.
adds 1b10bf6 d/rules: disable build of examples if nocheck requested.
adds e4f8163 Update changelog.
adds f3078e6 Release to unstable.
adds a84107e Refresh patch queue: - Delete Fixing-select-and-replace-tests.patch, already dropped. - Refresh Use-system-gtest.patch and Use-system-mathjax.patch. - Add cherry-picked upstream fix for ArrayFire image i/o on selected architectures. File: Always-use-freeimage-flags-instead-of-hardcoded-offsets.patch
adds 5e0d56a Update changelog.
adds 3cafa9b Release to unstable.
new be0ff22 Setup async project; Setup basic queues;
new b94c3df Convert CPU blas to use async queues
new 3188bdf Async CPU approx1 and approx2
new f797314 Async CPU Assign
new c8ecdb9 Async CPU Bilateral
new 759b506 Async CPU Convolve
new c399e75 Async CPU diff1 and diff2
new 80903d0 Avoid sending references to queued lambdas
new 96c5602 Async CPU Copy, Assign, and Index
new 46beebc Async CPU diagonal
new b7c83e8 Async FFT for the CPU backend
new 413eea8 Add eval to copyData
new 49f0cce Async random on CPU backend
new fada883 Async where on the CPU backe
new 1a0802f Async CPU reduce and ireduce
new 1842bcf Async CPU Transpose. Fix bug in eval
new 5d428df Merge branch 'devel' into async
new 4a8e723 Merge additional changes for async CPU.
new 91f7a1f async cpu::index function
new 2796770 Fixed asynchronous indexing & assignment in cpu backend
new 0aeed42 converted cpu tile to asychronous call
new 330ae1c converted sort_index cpu function to asynchronous call
new 5eea071 converted triangle fn in cpu backend to async call
new 551433e converted lu & cholesky decomposition functions to async calls
new ed6d26d svd cpu backend is async now
new e0d7c12 converted qr & solve cpu functions to async calls
new d0223f9 Converted wrap & unwrap cpu fns to async calls
new 32a65d8 converted transform to async call
new 743fb4a converted susan fn in cpu backend to asynchronous call
new 840af46 convert sort & sort_by_key cpu fns to async calls
new e005557 sobel cpu fn is async fn after this change
new 14e9d31 Converted resize & shift cpu fns to async calls
new 3c2bc65 convert select & rotate cpu fns to async calls
new 258d573 Convert morph & range cpu fns to async calls
new 9510fcb Converted cpu scan function to async call
new 6fc636f fix for async sift cpu function
new 48a9e58 converted matchTemplate, meanShift & medfilt to async calls
new b813fd4 nearest neighbour cpu func is asyn call now
new 35a462c conversion of listed functions to async calls
new e5ab671 converted join cpu func to async call
new 0c72451 converted cpu regions function to asynchronous call
new ed730cf adds scatter function
new 0a78b60 update plot2d example to include scatter plot
new 4ec314a cleanup and scatter example update
new 6f52c36 adds scatter3
new 6515414 Merge pull request #1116 from syurkevi/forge_scatter
new b267ffd Increment version for devel to 3.3
new 65c7a23 Fixes for scatter
new 323aa77 Merge pull request #1133 from shehzan10/devel
new 53de790 Removed dead code from opencl::DeviceManager class
new ce2d6a6 Enables use of ArrayFire with external context & command queue
new f65ee89 cpp wrappers for opencl external context related fns
new 2bcc6de unit tests for afcl::{pushDevice, setDevice, popDevice} fns
new 964b1ea Style changes in opencl header
new d41839f api name change for afcl external context functionality
new 227377d Added OpenCL include dir for unit tests
new 67ef051 additional style changes
new 72b71e8 Merge pull request #1140 from 9prady9/ext_ocl_context
new b9e113b Merge remote-tracking branch 'upstream/master' into devel
new 365dc94 DOC resolve markerType enum in graphics
new b878711 Remove unused variable warning in homography cuda kernel
new ebe6cac Merge branch 'devel' into async
new df2c091 Converted iir, fir, fftconvolve to async calls
new c06f24d Change to gfx to handle Arrays created by async calls
new 21f74eb Fixed harris & homography cpu fns to work with async fns
new 2217014 Fix in Array::device method
new 919333e Fix for getDeviceMemInfo function in cpu
new 7dac34a Fixes for asynchronous cpu copy && set functions
new 6b9c157 FEAT added allocHost and freeHost functions
new 3c9d69d FEAT added infoString function to return info as string
new e73ee6d Merge remote-tracking branch 'upstream/hotfixes-3.2.2' into devel
new 1de97de Using af_alloc_host when allocating user-return string
new f628fbe toString now uses af_alloc_host to allocate memory
new 7206028 Add overload of toString that returns a string
new 06d4bef FEAT add af_get_revision to get commit instead of AF_REVISION
new b3c28b6 Using c/version.cpp in unified
new 34f3569 Merge pull request #1163 from shehzan10/devel
new ab11b6b Merge branch 'devel' into async
new 3ba9633 Added missing eval for input Array's in cpu backend fns
new 8cc9c9c threads library is now a submodule in cpu backend
new abce1e8 Moved fns enqueued onto async queue to separate folder
new 5f2f155 Removed obselete fn of evalArray from all backends
new e651cad cpu::Array::eval queue work moved to kerenel namespace
new 3cddae2 moved assign cpu async fn to kernel space
new 0e563ba moved kernel namespace in cpu backend inside cpu namespace
new d03bb75 moved bilateral, convolve, fftconvolve to cpu kernel namespace
new e8f0242 moved copy queue fns from cpu backend to kernel namespace
new 913850d Moved diagonal cpu implementation to kernel namespace
new 71298c6 moved diff, fast, gradient, harris, histogram to kernel namespace
new cc7d665 moved rgb_hsv & identity fns to kernel namespace
new 696657c moved indexing & assignment fns to kernel namespace
new f2b84dd template parameters style fixes in cpu kernel namespace fns
new bb0a22c Moved more cpu fns implementations to kernel namespace
new 7d7f32f moved the left over fns to cpu kernel namespace
new 1313f98 Fixed the bug in cpu ireduce kernel function
new b684b06 Fixed orb async cpu fn
new a0f17b6 cmake fix to check for threads submodule
new c539f1d moved fft cpu fns implementations to kernel namespace
new 4831215 moved dot cpu implementation to kernel namespace
new d1089f8 moved fftconvolve reorder helper fn to kernel namespace
new 4f8b3fa fixed cpu::kernel::dot fn signature
new 95d9346 Added ENQUEUE macro in cpu backend
new 7dad2ef Removed obsolete queue sync in cpu::padArray fn
new 90611a2 Fixed cmake condition for threads submodule check
new 1cbffbb Synchronize when AF_SYNCHRONOUS_CALLS is set to 1
new 6058dd2 Revert "Added ENQUEUE macro in cpu backend"
new edda52a Update README.md with updated status badges
new 29cfed5 Merge pull request #1178 from shehzan10/readme-badges
new cad4c2c initial gravity example
new 5507717 add collisions, split vectors into components for performance
new 5c01608 remove state globals
new b1aa875 remove windows pause ifdef
new c62eea0 Merge pull request #1152 from syurkevi/gravity_example
new 5ecdc54 Added API support for perspective transform
new 2a43871 Added perspective transform to CPU backend
new 7fdfe3e Added perspective transform to CUDA backend
new 15b9ad6 Added perspective transform to OpenCL backend
new 81dca06 Updated transform documentation
new 8e4e766 Added perspective transform unit tests
new 7327fb2 Updated test data
new 27aeed0 Fixed wrong data type in OpenCL transform
new bdc31d0 Simplified test for perspective transform in API
new 951533c Merge pull request #1190 from pentschev/perspective_transform
new 2ae8273 Merge remote-tracking branch 'arrayfire/master' into devel
new f0b9691 Merge branch 'devel' into async
new 861808e Merge pull request #1174 from umar456/async
new c2d7e42 Fix clang warnings (std::abs, pragma ignores)
new fa765a6 Merge pull request #1208 from shehzan10/clang-warnings
new 1dd2195 Cleanup util.cpp
new e19a6be Using getEnvVar instead of getenv
new b260abf Cleanup/improve backend test
new de4851d Not building info for unified. backend does the same as info
new 4d06c74 FEAT Added isImageIOAvailable function to check support
new 1b85d6d FEAT Added isLAPACKAvailable function to check support
new 7747ee6 Use isImageIOAvailable in testHelper
new fe3fa66 Use isLAPACKAvailable in testHelper
new b89ab5d Add missing af_err to string
new 8813a2e af_get_last_error supports NULL as valid argument for len
new b7af25a Improvements to af::exception messages
new 7b6eee1 Add version guards around allocHost and freeHost
new 9605740 Deprecate af_(lock/unlock)_device_ptr. Use af_(lock/unlock)_array
new d02636a Add memFreeUnlinked to free locked device ptrs
new 330f4f8 FEAT Add printMemInfo to print memory information
new ed5556c Renamed is_free -> mngr_lock and is_unlinked -> user_lock in cpu memory mngr
new aa25b17 Renamed is_free -> mngr_lock and is_unlinked -> user_lock in cuda memory mngr
new cef8559 Renamed is_free -> mngr_lock and is_unlinked -> user_lock in opencl memory mngr
new dbe861e Reverse conditions for freeing in memory managers
new 33fbf33 Renamed internal memFree functions
new 8cb21a4 Fix AF_DISABLE_GRAPHICS condition (Fixes e19a6be)
new b0c7c9a Merge pull request #1207 from shehzan10/memory
new 3c91935 Replaced ssh based url with http url for threads submodule
new 7f7ce20 Merge pull request #1216 from 9prady9/threads_uri_fix
new e5bb334 Add missing isLAPACKAvailable implementation in CPU backend
new 775747e Set revision to "default" when git is not available
new 84dccc8 Documentation fixes
new 60a722a Merge pull request #1215 from shehzan10/devel
new 6988950 Build fix for CUDA backend when using boost 1.60
new 09bc060 Merge pull request #1211 from pavanky/compile_fixes
new 5940d4b Always use freeimage flags instead of hardcoded offsets
new a6ae9bd Merge pull request #1218 from pavanky/freeimage_fix
new 3047acd Add the ability to sort OpenCL devices Using the following criterion
new 8873ed2 Using proper offsets for loadImageNative and saveImageNative
new 14230d2 Adding environment variables to choose OpenCL device
new 2f16824 Cleaning up exception handling in src/api/c
new b42cbeb Updating docs for new AF_OPENCL_*_TYPE environment variables
new 17b2600 Freeimage only requires the flags for 24 / 32 bit images
new bb29d92 Merge pull request #1222 from pavanky/opencl_device_sort
new 685dccd Update boost compute release tag
new 6b7b1ce Update clFFT release tag
new 5866b17 Merge pull request #1202 from shehzan10/clupdates
new 777abcb Moving dispatch.hpp / dispatch.cpp to src/backend/
new 828138c Renaming a few internal functions
new c8cd29b Adding a unified memory manager for all backends
new a175432 Remove unnecessary line from CMakeLists
new 43d030d Cleaning up error messages in loading and saving files
new 73b7cac Fixing CUDA platform manager to sort devices in a more saner manner.
new d75b899 Adding lock to memory allocated using af_alloc_device / af::alloc
new d5f3bf1 Adding documentation for AF_MEM_DEBUG
new f9a8336 Additional sanitizing for mutex locks
new 0638f3f Removing unnecessary returns from void functions
new 1520dc3 Fixing issue where garbageCollect was only called on current device
new aaf554e BUGFIX: Initialize buffer counts to 0
new 9d0c159 af_set_device now only warns when device > 0 on CPU
new 3009e8f Merge pull request #1225 from pavanky/memory_manager
new db14451 Re-enable disabled sort tests from issue #995
new 6da71e5 BUGFIX Handle 16-bit data in saveImage
new b14ae20 Fix saveImageNative for 1-channel images
new a6a4cdb Update test data submodule commit
new 88cf471 Add tests for 16-bit images for ImageIO+Native
new 968ae4e Handle CUDA devices locked in exclusive mode
new cc9018e Add try/catch around cuda::setDevice in Pinned Memory Manager
new 904d3e0 Using device independent vector for cuda Pinned Memory Manager
new bb479bf Merge pull request #1229 from shehzan10/imageio16-test
new d9e5288 Fix NONFREE Build CMake Options
new 5be5511 Handle compute_53 (tegra x1) for cuda lapack
new cc00f35 Add definition for each compute type
new a1823b3 Added helper functions for device type and unified mem in OpenCL
new 4275f5f Added getMappedPtr and unmapPtr functions in opencl memory
new 3c1ab9f Added matmul offloading to CPU
new f9819f7 Fix blas header types in cpu
new d5077ec Fix bug in OpenCL JIT when calling functions that return same value
new ac25f5b Fix bug in CUDA JIT when calling functions that return same value
new 507ec92 dot in CUDA/OpenCL now uses mul followed by reduction
new 7eafd44 Remove cpu dot fallback. Slower than opencl dot always
new 45abbc3 Add OpenCL-CPU fallback for LU
new 88e910d Add OpenCL-CPU fallback for Cholesky
new 872acfb Add OpenCL-CPU fallback for QR
new 59a9df0 Add OpenCL-CPU fallback for SVD
new ffb191c Add OpenCL-CPU fallback for solve
new 4e2d46c Add OpenCL-CPU fallback for inverse
new 210a64c Clean up header files in opencl/cpu/
new e08d41b Update environment variables doc
new b36d003 Move MappedPtr into the Array class
new 56f9140 FEAT Add getActiveBackend function
new b39b60d Merge remote-tracking branch 'upstream/devel' into fallback-opts
new a8b831b Cleanup for opencl-cpu offload
new f6e309b Clean up cusolver finding in cmake
new 9962b87 Merge pull request #1221 from shehzan10/fallback-opts
new 323bf75 Added tranform coordinates functionality
new ba483f1 Added transform coordinates to unified backend
new 3522f80 Added transform coordinates documentation
new 7f3e215 Added transform coordinates unit tests
new 230c603 Updated test data
new ffc6e7f Putting transform coordinates within version guards
new 3941550 Move asserts inside try/catch in indexer functions in util
new 735b66b Fix OpenCL-CPU offload when OpenCL is built without lapack
new 8a68775 Merge branch 'devel' of shehzan10/arrayfire into devel
new 3389940 Fix resize unit test.
new 6a34bee Compile fixes for gcc 5.3
new 46042fe Fixing compiler warnings
new 5fba37c Functions to get opencl device type and platforms
new 34c8c97 Work around for a bug in AMD's clBuildProgram
new 4579762 Merge branch 'compile_fixes' of pavanky/arrayfire into devel
new e7e6080 Update clBLAS release tag
new 845d3b3 Fixes in magma potrf (opencl cholesky)
new 3ce49a5 BUGFIX Fix how streams are created in setActiveDevice (CUDA)
new 76e9af4 Merge pull request #1220 from shehzan10/clBLAS-updates
new cc2dda0 Fixes to build with MKL when INTEL_MKL_ROOT is exported
new 58fc4c8 Fixes to getMappedPtr in OpenCL backend
new d3d2996 Add getHostMemorySize and getDeviceMemorySize functions
new 6c30652 Updates to Memory Manager and Garbage Collection Heuristics
new 043739f Move ArrayFireConfig, CPack (as CPackConfig) into CMakeModules
new cfd60f1 Move /bigobj definitions into main CMakeList (windows)
new aba1851 BUGFIX Add/remove entries for platform when adding external device/context
new 163b5fb BUGFIX Fix CUDA device management and free at destructor
new 805dc5b Fix Tests: ORB, Meanshift, basic_c, solve
new 7eb905f Add documentation for deviceInfo
new 78f157c Merge branch 'memory' of shehzan10/arrayfire into devel
new 96041b5 BUGFIX: incorrect index for 3rd dimension in select / replace
new f228de3 TEST: Adding tests for 3D and 4D select and replace
new f8b0fb1 Merge pull request #1251 from pavanky/bug_fixes
new 0039cdb Proper exception handling for memory manager
new 91bed33 Removing unneeded cudaDeviceSynchronize()
new 519d3bb Adding compute 37 to list of accepted CUDA computes
new 41bad15 Changes required to build tests in a single file
new f94f522 Merge pull request #1250 from pavanky/misc_fixes
new 3242618 Documentation fix in matchTemplate function
new 209643b syntax+typo fix in opencl backend
new 9bf1455 Updated test data for meanshift, bilateral & morph
new 64ef789 Merge pull request #1252 from 9prady9/docs_fixes
new 3fc6939 Fixes issues when compiling with icc
new 8648066 Merge branch 'fixes_to_compile_with_icc' of mchandra/arrayfire into devel
new ee7fa33 Removing unnecessary option "BUILD_GTEST"
new 653416d Updating release notes for 3.3 pre-release
new fc7553d BUGFIX: max_bytes were being set incorrectly in MemoryManager
new 5183a35 Cleaning up internal API for memory functions
new 82e6558 JIT evaluation can now be tweaked by environment variables
new a1b7f8c Changes to internal memory manager
new a938500 Fixes to random.hpp to work in multi-threaded environment
new 95aaf72 OpenCL JIT now launches more threads per work group for CPU devices
new 90ab8cd Merge pull request #1262 from pavanky/memory_manager_fixes
new f674cda BUGFIX: Fixing error in where for OpenCL backend
new 98ff1c0 Merge pull request #1260 from pavanky/where_fix
new fb7d11e Merge tag 'v3.3.alpha' into dfsg-clean
new cb95bda Clean source tree from non-dfsg components.
new 8eb294f Merge tag 'upstream/3.3_alpha+dfsg1' into debian/experimental
new 829a42e d/changelog: new upstream release.
new a0b8a53 d/gbp.conf: use recommended DEP-14 format for upstream tags.
new d82420f d/copyright: update list of files excluded after dfsg-cleaning.
new 13993e8 Drop Always-use-freeimage-flags-instead-of-hardcoded-offsets.patch, fixed upstream.
new 8dfa073 d/rules: remove usage of deprecated BUILD_GTEST option.
new 2772653 cme fix d/control: - use secure VCS- Git URI. - remove unnecessary versioned depends. - sort install dependencies.
new c72f6ff Update changelog.
new fc8d0ad Release to experimental.
The 284 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "adds" were already present in the repository and have only
been added to this reference.
Summary of changes:
.gitignore | 3 +
.gitmodules | 3 +
ArrayFireConfig.cmake.in | 64 -
ArrayFireConfigVersion.cmake.in | 73 -
CMakeLists.txt | 88 +-
CMakeModules/AFInstallDirs.cmake | 2 +-
CMakeModules/ArrayFireConfig.cmake.in | 72 +
CMakeModules/ArrayFireConfigVersion.cmake.in | 73 +
CMakeModules/CPackConfig.cmake | 96 +
CMakeModules/FindCBLAS.cmake | 139 +-
CMakeModules/FindLAPACKE.cmake | 172 +-
CMakeModules/FindOpenCL.cmake | 34 +-
CMakeModules/MinBuildTime.cmake | 93 +
CMakeModules/Version.cmake | 33 +-
CMakeModules/build_boost_compute.cmake | 9 +-
CMakeModules/build_clBLAS.cmake | 8 +-
CMakeModules/build_clFFT.cmake | 9 +-
CMakeModules/build_forge.cmake | 18 +-
CMakeModules/build_gtest.cmake | 99 -
CMakeModules/examples.dox.in | 3 +
CMakeModules/osx_install/OSXInstaller.cmake | 16 +-
CMakeModules/osx_install/cpu_scripts/postinstall | 6 +-
CMakeModules/osx_install/cuda_scripts/postinstall | 4 +-
CMakeModules/osx_install/distribution.dist | 35 +-
.../osx_install/opencl_scripts/postinstall | 4 +-
CMakeModules/version.h.in | 2 +-
CMakeModules/version.hpp.in | 13 +
COPYRIGHT.md | 40 +-
CPack.cmake | 96 -
LICENSES/OpenSIFT License.txt | 57 +
README.md | 14 +-
assets | 2 +-
debian/README.Debian | 19 +
debian/README.source | 14 +
debian/changelog | 203 +-
debian/control | 204 +-
debian/copyright | 977 +-
debian/gbp.conf | 10 +-
debian/libarrayfire-cpu-dev.install | 5 +-
debian/libarrayfire-cpu3.install | 2 +-
debian/libarrayfire-dev.install | 2 +
debian/libarrayfire-doc.install | 3 +-
debian/libarrayfire-doc.links | 1 -
debian/libarrayfire-opencl-dev.install | 2 +
debian/libarrayfire-opencl3.install | 1 +
debian/libarrayfire-opencl3.lintian-overrides | 1 +
debian/libarrayfire-unified-dev.install | 2 +
debian/libarrayfire-unified3.install | 1 +
debian/libarrayfire-unified3.lintian-overrides | 1 +
.../Disable-git-usage-in-Doxygen-settings.patch | 21 +
.../Disable-privacy-breaching-settings.patch | 67 +
debian/patches/Use-system-gtest.patch | 21 +
debian/patches/Use-system-mathjax.patch | 21 +
debian/patches/disable-privacy-breach-doc.patch | 62 -
debian/patches/fix-doxygen-settings.patch | 16 -
debian/patches/series | 6 +-
debian/rules | 101 +-
debian/source/lintian-overrides | 2 -
debian/tests/build-examples | 25 +
debian/tests/build-testsuite | 26 +
debian/tests/control | 15 +
debian/upstream/metadata | 12 +
debian/watch | 3 +-
docs/CMakeLists.txt | 36 +-
docs/arrayfire.css | 11 +
docs/details/algorithm.dox | 38 +-
docs/details/array.dox | 34 +-
docs/details/backend.dox | 84 +
docs/details/data.dox | 44 +-
docs/details/device.dox | 59 +-
docs/details/image.dox | 291 +-
docs/details/index.dox | 25 +
docs/details/lapack.dox | 30 +
docs/details/signal.dox | 65 +-
docs/details/util.dox | 148 +
docs/details/vision.dox | 148 +-
docs/doxygen.mk | 10 +-
docs/header.htm | 3 -
docs/highlight.pack.js | 1 -
docs/highlight_js_doxygen.css | 93 -
docs/layout.xml | 6 +
docs/pages/INSTALL.md | 168 +-
docs/pages/README.md | 46 +-
docs/pages/configuring_arrayfire_environment.md | 119 +-
docs/pages/forge_visualization.md | 160 +
docs/pages/getting_started.md | 6 +-
docs/pages/gfor.md | 32 -
docs/pages/interop_cuda.md | 163 +
docs/pages/interop_opencl.md | 189 +
docs/pages/matrix_manipulation.md | 336 +-
docs/pages/release_notes.md | 507 +-
docs/pages/timing.md | 6 +-
docs/pages/unified_backend.md | 214 +
docs/pages/using_on_linux.md | 200 +-
docs/pages/using_on_osx.md | 233 +-
docs/pages/using_on_windows.md | 304 +-
docs/pages/vectorization.md | 212 +
examples/CMakeLists.txt | 131 +-
examples/CMakeModules/FindOpenCL.cmake | 34 +-
examples/benchmarks/blas.cpp | 13 +-
examples/benchmarks/fft.cpp | 7 -
examples/benchmarks/pi.cpp | 7 -
examples/common/progress.h | 2 +-
examples/computer_vision/fast.cpp | 2 +-
examples/computer_vision/harris.cpp | 2 +-
examples/computer_vision/matching.cpp | 120 +
examples/computer_vision/susan.cpp | 87 +
examples/financial/black_scholes_options.cpp | 9 +-
examples/financial/heston_model.cpp | 116 +
examples/getting_started/convolve.cpp | 7 -
examples/getting_started/integer.cpp | 7 -
examples/getting_started/rainfall.cpp | 7 -
examples/getting_started/vectorize.cpp | 8 +-
examples/graphics/conway.cpp | 9 -
examples/graphics/conway_pretty.cpp | 11 +-
examples/graphics/fractal.cpp | 11 +-
examples/graphics/gravity_sim.cpp | 140 +
examples/graphics/histogram.cpp | 13 +-
examples/graphics/plot2d.cpp | 20 +-
examples/graphics/plot3.cpp | 48 +
examples/graphics/surface.cpp | 44 +
examples/helloworld/helloworld.cpp | 8 +-
.../image_processing/adaptive_thresholding.cpp | 11 +-
examples/image_processing/binary_thresholding.cpp | 8 -
examples/image_processing/brain_segmentation.cpp | 11 +-
examples/image_processing/edge.cpp | 7 +-
examples/image_processing/filters.cpp | 32 +-
examples/image_processing/image_demo.cpp | 5 +-
examples/image_processing/image_editing.cpp | 12 +-
examples/image_processing/morphing.cpp | 7 +-
examples/image_processing/optical_flow.cpp | 7 -
examples/image_processing/pyramids.cpp | 5 +-
examples/lin_algebra/cholesky.cpp | 7 -
examples/lin_algebra/lu.cpp | 7 -
examples/lin_algebra/qr.cpp | 7 -
examples/lin_algebra/svd.cpp | 48 +
examples/machine_learning/bagging.cpp | 1 +
examples/machine_learning/deep_belief_net.cpp | 7 +-
examples/machine_learning/kmeans.cpp | 2 +
examples/machine_learning/knn.cpp | 1 +
examples/machine_learning/logistic_regression.cpp | 7 +-
examples/machine_learning/naive_bayes.cpp | 1 +
examples/machine_learning/neural_network.cpp | 7 +-
examples/machine_learning/perceptron.cpp | 7 +-
examples/machine_learning/rbm.cpp | 7 +-
examples/machine_learning/softmax_regression.cpp | 1 +
examples/pde/swe.cpp | 87 +
examples/unified/basic.cpp | 71 +
include/af/algorithm.h | 143 +
include/af/arith.h | 48 +-
include/af/array.h | 417 +-
include/af/backend.h | 125 +
include/af/blas.h | 20 +-
include/af/complex.h | 64 +-
include/af/constants.h | 8 +
include/af/cuda.h | 121 +
include/af/data.h | 145 +-
include/af/defines.h | 130 +-
include/af/device.h | 213 +-
include/af/dim4.hpp | 6 +-
include/af/exception.h | 3 +
include/af/graphics.h | 218 +-
include/af/image.h | 502 +-
include/af/index.h | 132 +-
include/af/lapack.h | 80 +
include/af/macros.h | 84 +
include/af/opencl.h | 601 +-
include/af/seq.h | 170 +-
include/af/signal.h | 540 +-
include/af/statistics.h | 28 +-
include/af/traits.hpp | 24 +
include/af/util.h | 286 +-
include/af/vision.h | 444 +-
include/arrayfire.h | 56 +-
src/api/c/approx.cpp | 30 +-
src/api/c/assign.cpp | 102 +-
src/api/c/bilateral.cpp | 2 +
src/api/c/binary.cpp | 8 +
src/api/c/blas.cpp | 16 +-
src/api/c/cast.cpp | 2 +
src/api/c/colorspace.cpp | 69 +-
src/api/c/complex.cpp | 2 +-
src/api/c/convolve.cpp | 74 +-
src/api/c/convolve_common.hpp | 11 +-
src/api/c/corrcoef.cpp | 2 +
src/api/c/covariance.cpp | 12 +-
src/api/c/data.cpp | 89 +-
src/api/c/device.cpp | 167 +-
src/api/c/diff.cpp | 8 +
src/api/c/dog.cpp | 72 +
src/api/c/err_common.cpp | 145 +-
src/api/c/err_common.hpp | 122 +-
src/api/c/fast.cpp | 2 +
src/api/c/features.hpp | 8 +
src/api/c/fft.cpp | 204 +-
src/api/c/fft_common.hpp | 78 +
src/api/c/fftconvolve.cpp | 119 +-
src/api/c/filters.cpp | 2 +
src/api/c/flip.cpp | 6 +-
src/api/c/graphics_common.cpp | 87 +-
src/api/c/graphics_common.hpp | 25 +-
src/api/c/hamming.cpp | 54 +-
src/api/c/handle.hpp | 3 +
src/api/c/harris.cpp | 84 +
src/api/c/hist.cpp | 7 +-
src/api/c/histeq.cpp | 4 +
src/api/c/histogram.cpp | 24 +-
src/api/c/homography.cpp | 88 +
src/api/c/image.cpp | 59 +-
src/api/c/imageio.cpp | 541 +-
src/api/c/imageio2.cpp | 403 +
src/api/c/imageio_helper.h | 102 +
src/api/c/implicit.cpp | 6 +
src/api/c/index.cpp | 48 +-
src/api/c/join.cpp | 6 +
src/api/c/lu.cpp | 10 +
src/api/c/match_template.cpp | 2 +
src/api/c/mean.cpp | 131 +-
src/api/c/meanshift.cpp | 4 +
src/api/c/median.cpp | 16 +-
src/api/c/memory.cpp | 263 +
src/api/c/moddims.cpp | 20 +-
src/api/c/morph.cpp | 4 +
src/api/c/nearest_neighbour.cpp | 93 +
src/api/c/ops.hpp | 21 +
src/api/c/optypes.hpp | 2 +
src/api/c/plot.cpp | 39 +-
src/api/c/plot3.cpp | 129 +
src/api/c/print.cpp | 137 +-
src/api/c/rank.cpp | 14 +-
src/api/c/reduce.cpp | 95 +-
src/api/c/regions.cpp | 2 +
src/api/c/reorder.cpp | 2 +
src/api/c/replace.cpp | 113 +
src/api/c/resize.cpp | 7 +-
src/api/c/rgb_gray.cpp | 2 +
src/api/c/rotate.cpp | 7 +-
src/api/c/sat.cpp | 59 +
src/api/c/scan.cpp | 4 +
src/api/c/select.cpp | 162 +
src/api/c/set.cpp | 12 +
src/api/c/shift.cpp | 4 +
src/api/c/sift.cpp | 132 +
src/api/c/sobel.cpp | 2 +
src/api/c/sort.cpp | 16 +
src/api/c/stats.h | 45 +-
src/api/c/stdev.cpp | 24 +-
src/api/c/stream.cpp | 374 +
src/api/c/surface.cpp | 135 +
src/api/c/susan.cpp | 78 +
src/api/c/svd.cpp | 128 +
src/api/c/tile.cpp | 32 +-
src/api/c/transform.cpp | 50 +-
src/api/c/transform_coordinates.cpp | 96 +
src/api/c/transpose.cpp | 20 +-
src/api/c/type_util.cpp | 10 +-
src/api/c/unary.cpp | 92 +-
src/api/c/unwrap.cpp | 66 +
src/api/c/util.cpp | 81 +
src/api/c/var.cpp | 18 +-
src/api/c/version.cpp | 16 +
src/api/c/where.cpp | 2 +
src/api/c/wrap.cpp | 80 +
src/api/c/ycbcr_rgb.cpp | 160 +
src/api/cpp/array.cpp | 144 +-
src/api/cpp/complex.cpp | 61 +-
src/api/cpp/constants.cpp | 8 +
src/api/cpp/corrcoef.cpp | 4 +
src/api/cpp/data.cpp | 34 +
src/api/cpp/device.cpp | 88 +-
src/api/cpp/dog.cpp | 24 +
src/api/cpp/error.hpp | 18 +-
src/api/cpp/exception.cpp | 16 +-
src/api/cpp/features.cpp | 1 -
src/api/cpp/fft.cpp | 137 +
src/api/cpp/gfor.cpp | 2 +-
src/api/cpp/graphics.cpp | 37 +
src/api/cpp/hamming.cpp | 2 +-
src/api/cpp/harris.cpp | 27 +
src/api/cpp/homography.cpp | 32 +
src/api/cpp/imageio.cpp | 38 +
src/api/cpp/index.cpp | 43 +
src/api/cpp/lapack.cpp | 25 +
src/api/cpp/mean.cpp | 4 +
src/api/cpp/median.cpp | 6 +-
src/api/cpp/nearest_neighbour.cpp | 29 +
src/api/cpp/reduce.cpp | 90 +-
src/api/cpp/sat.cpp | 24 +
src/api/cpp/seq.cpp | 14 +-
src/api/cpp/sift.cpp | 51 +
src/api/cpp/stdev.cpp | 4 +
src/api/cpp/susan.cpp | 25 +
src/api/cpp/timing.cpp | 8 +-
src/api/cpp/transform_coordinates.cpp | 24 +
src/api/cpp/unary.cpp | 1 +
src/api/cpp/unwrap.cpp | 23 +
src/api/cpp/util.cpp | 50 +-
src/api/cpp/var.cpp | 2 +
src/api/cpp/where.cpp | 2 +-
src/api/cpp/wrap.cpp | 27 +
src/api/cpp/ycbcr_rgb.cpp | 31 +
src/api/unified/CMakeLists.txt | 72 +
src/api/unified/algorithm.cpp | 148 +
src/api/unified/arith.cpp | 102 +
src/api/unified/array.cpp | 108 +
src/api/unified/blas.cpp | 40 +
src/api/unified/data.cpp | 180 +
src/api/unified/device.cpp | 178 +
src/api/unified/features.cpp | 44 +
src/api/unified/graphics.cpp | 95 +
src/api/unified/image.cpp | 264 +
src/api/unified/index.cpp | 54 +
src/api/unified/lapack.cpp | 103 +
src/api/unified/signal.cpp | 143 +
src/api/unified/statistics.cpp | 96 +
src/api/unified/symbol_manager.cpp | 241 +
src/api/unified/symbol_manager.hpp | 110 +
src/api/unified/util.cpp | 63 +
src/api/unified/vision.cpp | 86 +
src/backend/ArrayInfo.cpp | 82 +
src/backend/ArrayInfo.hpp | 42 +-
src/backend/MemoryManager.cpp | 308 +
src/backend/MemoryManager.hpp | 119 +
src/backend/cblas.cpp | 105 +-
src/backend/cpu/Array.cpp | 377 +-
src/backend/cpu/Array.hpp | 97 +-
src/backend/cpu/CMakeLists.txt | 75 +-
src/backend/cpu/TNJ/BinaryNode.hpp | 19 +-
src/backend/cpu/TNJ/BufferNode.hpp | 45 +-
src/backend/cpu/TNJ/Node.hpp | 10 +-
src/backend/cpu/TNJ/ScalarNode.hpp | 9 +-
src/backend/cpu/TNJ/UnaryNode.hpp | 15 +-
src/backend/cpu/approx.cpp | 380 +-
src/backend/cpu/assign.cpp | 91 +-
src/backend/cpu/bilateral.cpp | 76 +-
src/backend/cpu/blas.cpp | 270 +-
src/backend/cpu/blas.hpp | 13 +
src/backend/cpu/cast.hpp | 17 +
src/backend/cpu/cholesky.cpp | 16 +-
src/backend/cpu/convolve.cpp | 259 +-
src/backend/cpu/copy.cpp | 255 +-
src/backend/cpu/copy.hpp | 3 +
src/backend/cpu/diagonal.cpp | 92 +-
src/backend/cpu/diff.cpp | 127 +-
src/backend/cpu/err_cpu.hpp | 5 +-
src/backend/cpu/exampleFunction.cpp | 7 +
src/backend/cpu/fast.cpp | 236 +-
src/backend/cpu/fft.cpp | 157 +-
src/backend/cpu/fft.hpp | 10 +-
src/backend/cpu/fftconvolve.cpp | 426 +-
src/backend/cpu/gradient.cpp | 86 +-
src/backend/cpu/hamming.cpp | 103 -
src/backend/cpu/hamming.hpp | 20 -
src/backend/cpu/harris.cpp | 145 +
src/backend/cpu/harris.hpp | 23 +
src/backend/cpu/hist_graphics.cpp | 6 +
src/backend/cpu/histogram.cpp | 40 +-
src/backend/cpu/histogram.hpp | 2 +-
src/backend/cpu/homography.cpp | 389 +
src/backend/cpu/homography.hpp | 22 +
src/backend/cpu/hsv_rgb.cpp | 102 +-
src/backend/cpu/identity.cpp | 55 +-
src/backend/cpu/iir.cpp | 86 +-
src/backend/cpu/image.cpp | 56 +-
src/backend/cpu/index.cpp | 80 +-
src/backend/cpu/inverse.cpp | 13 +-
src/backend/cpu/iota.cpp | 65 +-
src/backend/cpu/ireduce.cpp | 228 +-
src/backend/cpu/join.cpp | 313 +-
src/backend/cpu/kernel/Array.hpp | 58 +
src/backend/cpu/kernel/approx1.hpp | 144 +
src/backend/cpu/kernel/approx2.hpp | 170 +
src/backend/cpu/kernel/assign.hpp | 80 +
src/backend/cpu/kernel/bilateral.hpp | 81 +
src/backend/cpu/kernel/convolve.hpp | 267 +
src/backend/cpu/kernel/copy.hpp | 90 +
src/backend/cpu/kernel/diagonal.hpp | 67 +
src/backend/cpu/kernel/diff.hpp | 86 +
src/backend/cpu/kernel/dot.hpp | 46 +
src/backend/cpu/kernel/fast.hpp | 224 +
src/backend/cpu/kernel/fft.hpp | 192 +
src/backend/cpu/kernel/fftconvolve.hpp | 256 +
src/backend/cpu/kernel/gradient.hpp | 89 +
src/backend/cpu/kernel/harris.hpp | 124 +
src/backend/cpu/kernel/histogram.hpp | 49 +
src/backend/cpu/kernel/hsv_rgb.hpp | 124 +
src/backend/cpu/kernel/identity.hpp | 37 +
src/backend/cpu/kernel/iir.hpp | 61 +
src/backend/cpu/kernel/index.hpp | 71 +
src/backend/cpu/kernel/iota.hpp | 45 +
src/backend/cpu/kernel/ireduce.hpp | 108 +
src/backend/cpu/kernel/join.hpp | 144 +
src/backend/cpu/kernel/lookup.hpp | 62 +
src/backend/cpu/kernel/lu.hpp | 80 +
src/backend/cpu/kernel/match_template.hpp | 141 +
src/backend/cpu/kernel/meanshift.hpp | 135 +
src/backend/cpu/kernel/medfilt.hpp | 135 +
src/backend/cpu/kernel/morph.hpp | 140 +
src/backend/cpu/kernel/nearest_neighbour.hpp | 143 +
src/backend/cpu/kernel/orb.hpp | 509 +
src/backend/cpu/kernel/random.hpp | 200 +
src/backend/cpu/kernel/range.hpp | 52 +
src/backend/cpu/kernel/reduce.hpp | 71 +
src/backend/cpu/kernel/regions.hpp | 194 +
src/backend/cpu/kernel/reorder.hpp | 55 +
src/backend/cpu/kernel/resize.hpp | 177 +
src/backend/cpu/kernel/rotate.hpp | 84 +
src/backend/cpu/kernel/scan.hpp | 72 +
src/backend/cpu/kernel/select.hpp | 124 +
src/backend/cpu/kernel/shift.hpp | 69 +
src/backend/cpu/kernel/sobel.hpp | 86 +
src/backend/cpu/kernel/sort.hpp | 51 +
src/backend/cpu/kernel/sort_by_key.hpp | 85 +
src/backend/cpu/kernel/sort_index.hpp | 70 +
src/backend/cpu/kernel/susan.hpp | 99 +
src/backend/cpu/kernel/tile.hpp | 55 +
src/backend/cpu/kernel/transform.hpp | 131 +
src/backend/cpu/kernel/transpose.hpp | 122 +
src/backend/cpu/kernel/triangle.hpp | 61 +
src/backend/cpu/kernel/unwrap.hpp | 81 +
src/backend/cpu/kernel/wrap.hpp | 80 +
src/backend/cpu/lookup.cpp | 60 +-
src/backend/cpu/lu.cpp | 138 +-
src/backend/cpu/lu.hpp | 2 +
src/backend/cpu/match_template.cpp | 128 +-
src/backend/cpu/math.cpp | 75 +-
src/backend/cpu/math.hpp | 3 -
src/backend/cpu/meanshift.cpp | 123 +-
src/backend/cpu/medfilt.cpp | 116 +-
src/backend/cpu/memory.cpp | 328 +-
src/backend/cpu/memory.hpp | 17 +-
src/backend/cpu/morph.cpp | 132 +-
src/backend/cpu/nearest_neighbour.cpp | 77 +
src/backend/cpu/nearest_neighbour.hpp | 21 +
src/backend/cpu/orb.cpp | 543 +-
src/backend/cpu/platform.cpp | 257 +-
src/backend/cpu/platform.hpp | 14 +
src/backend/cpu/plot.cpp | 51 +-
src/backend/cpu/plot3.cpp | 51 +
src/backend/cpu/plot3.hpp | 22 +
src/backend/cpu/qr.cpp | 60 +-
src/backend/cpu/queue.hpp | 46 +
src/backend/cpu/random.cpp | 144 +-
src/backend/cpu/range.cpp | 88 +-
src/backend/cpu/reduce.cpp | 318 +-
src/backend/cpu/reduce.hpp | 6 +-
src/backend/cpu/regions.cpp | 178 +-
src/backend/cpu/reorder.cpp | 79 +-
src/backend/cpu/resize.cpp | 202 +-
src/backend/cpu/rotate.cpp | 123 +-
src/backend/cpu/scan.cpp | 109 +-
src/backend/cpu/select.cpp | 66 +
src/backend/cpu/select.hpp | 19 +
src/backend/cpu/set.cpp | 166 +-
src/backend/cpu/shift.cpp | 84 +-
src/backend/cpu/sift.cpp | 67 +
src/backend/cpu/sift.hpp | 27 +
src/backend/cpu/sobel.cpp | 76 +-
src/backend/cpu/solve.cpp | 136 +-
src/backend/cpu/sort.cpp | 83 +-
src/backend/cpu/sort_by_key.cpp | 134 +-
src/backend/cpu/sort_index.cpp | 106 +-
src/backend/cpu/surface.cpp | 51 +
src/backend/cpu/surface.hpp | 22 +
src/backend/cpu/susan.cpp | 80 +
src/backend/cpu/susan.hpp | 24 +
src/backend/cpu/svd.cpp | 135 +
src/backend/cpu/svd.hpp | 20 +
src/backend/cpu/threads | 1 +
src/backend/cpu/tile.cpp | 76 +-
src/backend/cpu/transform.cpp | 156 +-
src/backend/cpu/transform.hpp | 2 +-
src/backend/cpu/transform_interp.hpp | 102 +-
src/backend/cpu/transpose.cpp | 121 +-
src/backend/cpu/triangle.cpp | 63 +-
src/backend/cpu/types.hpp | 1 +
src/backend/cpu/unary.hpp | 7 +
src/backend/cpu/unwrap.cpp | 67 +
src/backend/cpu/unwrap.hpp | 17 +
src/backend/cpu/utility.hpp | 63 +
src/backend/cpu/where.cpp | 81 +-
src/backend/cpu/wrap.cpp | 67 +
src/backend/cpu/wrap.hpp | 21 +
src/backend/cuda/Array.cpp | 70 +-
src/backend/cuda/Array.hpp | 74 +-
src/backend/cuda/CMakeLists.txt | 96 +-
src/backend/cuda/JIT/arith.cu | 2 +
src/backend/cuda/JIT/cast.cu | 22 +-
src/backend/cuda/JIT/exp.cu | 19 +
src/backend/cuda/JIT/hyper.cu | 4 +
src/backend/cuda/JIT/logic.cu | 10 +
src/backend/cuda/JIT/numeric.cu | 63 +-
src/backend/cuda/JIT/trig.cu | 8 +
src/backend/cuda/JIT/types.h | 1 +
src/backend/cuda/all.cu | 4 +
src/backend/cuda/any.cu | 4 +
src/backend/cuda/assign.cu | 8 +-
src/backend/cuda/bilateral.cu | 2 +
src/backend/cuda/blas.cpp | 82 +-
src/backend/cuda/cholesky.cu | 28 +
src/backend/cuda/complex.hpp | 22 +-
src/backend/cuda/convolve.cpp | 8 +-
src/backend/cuda/copy.cu | 53 +-
src/backend/cuda/copy.hpp | 3 +
src/backend/cuda/count.cu | 4 +
src/backend/cuda/cpu_lapack/cpu_cholesky.cpp | 109 +
src/backend/cuda/cpu_lapack/cpu_cholesky.hpp | 22 +
src/backend/cuda/cpu_lapack/cpu_inverse.cpp | 92 +
src/backend/cuda/cpu_lapack/cpu_inverse.hpp | 19 +
src/backend/cuda/cpu_lapack/cpu_lu.cpp | 197 +
src/backend/cuda/cpu_lapack/cpu_lu.hpp | 22 +
src/backend/cuda/cpu_lapack/cpu_qr.cpp | 160 +
src/backend/cuda/cpu_lapack/cpu_qr.hpp | 22 +
src/backend/cuda/cpu_lapack/cpu_solve.cpp | 206 +
src/backend/cuda/cpu_lapack/cpu_solve.hpp | 23 +
src/backend/cuda/cpu_lapack/cpu_svd.cpp | 153 +
src/backend/cuda/cpu_lapack/cpu_svd.hpp | 22 +
src/backend/cuda/cpu_lapack/cpu_triangle.hpp | 52 +
src/backend/cuda/cpu_lapack/lapack_helper.hpp | 35 +
src/backend/cuda/cublasManager.cpp | 1 +
src/backend/cuda/debug_cuda.hpp | 48 +-
src/backend/cuda/diagonal.cu | 2 +
src/backend/cuda/diff.cu | 4 +
src/backend/cuda/dilate.cu | 2 +
src/backend/cuda/dilate3d.cu | 2 +
src/backend/cuda/driver.cpp | 2 +-
src/backend/cuda/erode.cu | 2 +
src/backend/cuda/erode3d.cu | 2 +
src/backend/cuda/err_cuda.hpp | 37 +-
src/backend/cuda/fast.cu | 2 +
src/backend/cuda/fast_pyramid.cu | 2 +
src/backend/cuda/fft.cpp | 155 +-
src/backend/cuda/fft.hpp | 12 +-
src/backend/cuda/fftconvolve.cu | 18 +-
src/backend/cuda/hamming.cu | 62 -
src/backend/cuda/hamming.hpp | 22 -
src/backend/cuda/harris.cu | 59 +
src/backend/cuda/harris.hpp | 23 +
src/backend/cuda/hist_graphics.cu | 5 +-
src/backend/cuda/histogram.cu | 23 +-
src/backend/cuda/histogram.hpp | 2 +-
src/backend/cuda/homography.cu | 77 +
src/backend/cuda/homography.hpp | 22 +
src/backend/cuda/identity.cu | 2 +
src/backend/cuda/iir.cu | 4 +-
src/backend/cuda/image.cu | 5 +-
src/backend/cuda/index.cu | 6 +-
src/backend/cuda/interopManager.cu | 43 +-
src/backend/cuda/interopManager.hpp | 2 +
src/backend/cuda/inverse.cu | 22 +
src/backend/cuda/iota.cu | 4 +
src/backend/cuda/ireduce.cu | 8 +
src/backend/cuda/jit.cpp | 70 +-
src/backend/cuda/join.cu | 22 +-
src/backend/cuda/kernel/approx.hpp | 125 +-
src/backend/cuda/kernel/assign.hpp | 2 +-
src/backend/cuda/kernel/atomics.hpp | 59 +
src/backend/cuda/kernel/bilateral.hpp | 31 +-
src/backend/cuda/kernel/convolve.cu | 53 +-
src/backend/cuda/kernel/convolve_separable.cu | 17 +-
src/backend/cuda/kernel/diagonal.hpp | 4 +-
src/backend/cuda/kernel/diff.hpp | 4 +-
src/backend/cuda/kernel/exampleFunction.hpp | 11 +-
src/backend/cuda/kernel/fast.hpp | 118 +-
src/backend/cuda/kernel/fast_pyramid.hpp | 6 +-
src/backend/cuda/kernel/fftconvolve.hpp | 58 +-
src/backend/cuda/kernel/gradient.hpp | 3 +-
src/backend/cuda/kernel/hamming.hpp | 478 -
src/backend/cuda/kernel/harris.hpp | 385 +
src/backend/cuda/kernel/histogram.hpp | 34 +-
src/backend/cuda/kernel/homography.hpp | 721 ++
src/backend/cuda/kernel/hsv_rgb.hpp | 2 +-
src/backend/cuda/kernel/identity.hpp | 19 +-
src/backend/cuda/kernel/iir.hpp | 2 +-
src/backend/cuda/kernel/index.hpp | 2 +-
src/backend/cuda/kernel/iota.hpp | 5 +-
src/backend/cuda/kernel/ireduce.hpp | 40 +-
src/backend/cuda/kernel/join.hpp | 6 +-
src/backend/cuda/kernel/lookup.hpp | 4 +-
src/backend/cuda/kernel/lu_split.hpp | 4 +-
src/backend/cuda/kernel/match_template.hpp | 3 +-
src/backend/cuda/kernel/meanshift.hpp | 40 +-
src/backend/cuda/kernel/medfilt.hpp | 49 +-
src/backend/cuda/kernel/memcopy.hpp | 14 +-
src/backend/cuda/kernel/morph.hpp | 157 +-
src/backend/cuda/kernel/nearest_neighbour.hpp | 547 +
src/backend/cuda/kernel/orb.hpp | 91 +-
src/backend/cuda/kernel/random.hpp | 117 +-
src/backend/cuda/kernel/range.hpp | 2 +-
src/backend/cuda/kernel/reduce.hpp | 125 +-
src/backend/cuda/kernel/regions.hpp | 36 +-
src/backend/cuda/kernel/reorder.hpp | 5 +-
src/backend/cuda/kernel/resize.hpp | 28 +-
src/backend/cuda/kernel/rotate.hpp | 10 +-
src/backend/cuda/kernel/scan_dim.hpp | 10 +-
src/backend/cuda/kernel/scan_first.hpp | 11 +-
src/backend/cuda/kernel/select.hpp | 155 +
src/backend/cuda/kernel/shared.hpp | 4 +
src/backend/cuda/kernel/shift.hpp | 5 +-
src/backend/cuda/kernel/sobel.hpp | 36 +-
src/backend/cuda/kernel/sort.hpp | 5 +-
src/backend/cuda/kernel/sort_by_key.hpp | 6 +-
src/backend/cuda/kernel/sort_index.hpp | 12 +-
src/backend/cuda/kernel/susan.hpp | 182 +
src/backend/cuda/kernel/tile.hpp | 2 +-
src/backend/cuda/kernel/transform.hpp | 84 +-
src/backend/cuda/kernel/transform_interp.hpp | 101 +-
src/backend/cuda/kernel/transpose.hpp | 4 +-
src/backend/cuda/kernel/transpose_inplace.hpp | 4 +-
src/backend/cuda/kernel/triangle.hpp | 11 +-
src/backend/cuda/kernel/unwrap.hpp | 144 +
src/backend/cuda/kernel/where.hpp | 9 +-
src/backend/cuda/kernel/wrap.hpp | 113 +
src/backend/cuda/lookup.cu | 6 +
src/backend/cuda/lu.cu | 45 +
src/backend/cuda/lu.hpp | 2 +
src/backend/cuda/match_template.cu | 2 +
src/backend/cuda/math.hpp | 12 +
src/backend/cuda/max.cu | 4 +
src/backend/cuda/meanshift.cu | 4 +
src/backend/cuda/medfilt.cu | 2 +
src/backend/cuda/memory.cpp | 469 +-
src/backend/cuda/memory.hpp | 17 +-
src/backend/cuda/min.cu | 4 +
src/backend/cuda/morph3d_impl.hpp | 5 +-
src/backend/cuda/morph_impl.hpp | 5 +-
src/backend/cuda/nearest_neighbour.cu | 81 +
src/backend/cuda/nearest_neighbour.hpp | 23 +
src/backend/cuda/platform.cpp | 211 +-
src/backend/cuda/platform.hpp | 18 +
src/backend/cuda/plot.cu | 5 +-
src/backend/cuda/plot3.cu | 61 +
src/backend/cuda/plot3.hpp | 22 +
src/backend/cuda/product.cu | 6 +-
src/backend/cuda/qr.cu | 29 +
src/backend/cuda/random.cu | 14 +-
src/backend/cuda/range.cu | 4 +
src/backend/cuda/reduce.hpp | 6 +-
src/backend/cuda/reduce_impl.hpp | 13 +-
src/backend/cuda/regions.cu | 2 +
src/backend/cuda/reorder.cu | 2 +
src/backend/cuda/resize.cu | 5 +
src/backend/cuda/rotate.cu | 11 +-
src/backend/cuda/scan.cu | 4 +
src/backend/cuda/select.cu | 53 +
src/backend/cuda/select.hpp | 19 +
src/backend/cuda/set.cu | 21 +-
src/backend/cuda/shift.cu | 4 +
src/backend/cuda/sift.cu | 95 +
src/backend/cuda/sift.hpp | 27 +
src/backend/cuda/sobel.cu | 2 +
src/backend/cuda/solve.cu | 31 +
src/backend/cuda/sort.cu | 4 +
src/backend/cuda/sort_by_key/ascd_s16.cu | 15 +
src/backend/cuda/sort_by_key/ascd_s64.cu | 15 +
src/backend/cuda/sort_by_key/ascd_u16.cu | 15 +
src/backend/cuda/sort_by_key/ascd_u64.cu | 15 +
src/backend/cuda/sort_by_key/desc_s16.cu | 15 +
src/backend/cuda/sort_by_key/desc_s64.cu | 15 +
src/backend/cuda/sort_by_key/desc_u16.cu | 15 +
src/backend/cuda/sort_by_key/desc_u64.cu | 15 +
src/backend/cuda/sort_by_key_impl.hpp | 6 +-
src/backend/cuda/sort_index.cu | 4 +
src/backend/cuda/sum.cu | 12 +
src/backend/cuda/surface.cu | 61 +
src/backend/cuda/surface.hpp | 22 +
src/backend/cuda/susan.cu | 75 +
src/backend/cuda/susan.hpp | 24 +
src/backend/cuda/svd.cu | 183 +
src/backend/cuda/svd.hpp | 20 +
src/backend/cuda/tile.cu | 4 +
src/backend/cuda/transform.cu | 20 +-
src/backend/cuda/transform.hpp | 3 +-
src/backend/cuda/transpose.cu | 2 +
src/backend/cuda/transpose_inplace.cu | 2 +
src/backend/cuda/triangle.cu | 2 +
src/backend/cuda/types.cpp | 8 +
src/backend/cuda/types.hpp | 3 +-
src/backend/cuda/unary.hpp | 1 +
src/backend/cuda/unwrap.cu | 59 +
src/backend/cuda/unwrap.hpp | 17 +
src/backend/cuda/where.cu | 2 +
src/backend/cuda/wrap.cu | 59 +
src/backend/cuda/wrap.hpp | 21 +
src/backend/defines.hpp | 24 +-
src/backend/dim4.cpp | 67 +-
src/{api/c => backend}/dispatch.cpp | 0
src/{api/c => backend}/dispatch.hpp | 0
src/backend/host_memory.cpp | 113 +
src/backend/host_memory.hpp | 18 +
src/backend/lapacke.cpp | 168 +
src/backend/lapacke.hpp | 93 +-
src/backend/opencl/Array.cpp | 68 +-
src/backend/opencl/Array.hpp | 104 +-
src/backend/opencl/CMakeLists.txt | 157 +-
src/backend/opencl/all.cpp | 4 +
src/backend/opencl/any.cpp | 4 +
src/backend/opencl/assign.cpp | 6 +-
src/backend/opencl/bilateral.cpp | 2 +
src/backend/opencl/binary.hpp | 2 +-
src/backend/opencl/blas.cpp | 89 +-
src/backend/opencl/cache.hpp | 27 +
src/backend/opencl/cholesky.cpp | 13 +-
src/backend/opencl/cl.hpp | 10803 ++++++++++---------
src/backend/opencl/convolve.cpp | 8 +-
src/backend/opencl/convolve_separable.cpp | 48 +-
src/backend/opencl/copy.cpp | 31 +-
src/backend/opencl/copy.hpp | 3 +
src/backend/opencl/count.cpp | 4 +
src/backend/opencl/cpu/cpu_blas.cpp | 210 +
src/backend/opencl/cpu/cpu_blas.hpp | 20 +
src/backend/opencl/cpu/cpu_cholesky.cpp | 84 +
src/backend/opencl/cpu/cpu_cholesky.hpp | 22 +
src/backend/opencl/cpu/cpu_helper.hpp | 74 +
src/backend/opencl/cpu/cpu_inverse.cpp | 76 +
src/backend/opencl/cpu/cpu_inverse.hpp | 19 +
src/backend/opencl/cpu/cpu_lu.cpp | 178 +
src/backend/opencl/cpu/cpu_lu.hpp | 22 +
src/backend/opencl/cpu/cpu_qr.cpp | 118 +
src/backend/opencl/cpu/cpu_qr.hpp | 22 +
src/backend/opencl/cpu/cpu_solve.cpp | 176 +
src/backend/opencl/cpu/cpu_solve.hpp | 23 +
src/backend/opencl/cpu/cpu_svd.cpp | 112 +
src/backend/opencl/cpu/cpu_svd.hpp | 22 +
src/backend/opencl/cpu/cpu_triangle.hpp | 57 +
src/backend/opencl/debug_opencl.hpp | 7 +-
src/backend/opencl/diagonal.cpp | 2 +
src/backend/opencl/diff.cpp | 4 +
src/backend/opencl/dilate.cpp | 2 +
src/backend/opencl/dilate3d.cpp | 2 +
src/backend/opencl/erode.cpp | 2 +
src/backend/opencl/erode3d.cpp | 2 +
src/backend/opencl/err_opencl.hpp | 27 +-
src/backend/opencl/fast.cpp | 2 +
src/backend/opencl/fft.cpp | 225 +-
src/backend/opencl/fft.hpp | 12 +-
src/backend/opencl/fftconvolve.cpp | 16 +-
src/backend/opencl/hamming.cpp | 143 -
src/backend/opencl/hamming.hpp | 22 -
src/backend/opencl/harris.cpp | 57 +
src/backend/opencl/harris.hpp | 23 +
src/backend/opencl/hist_graphics.cpp | 2 +
src/backend/opencl/histogram.cpp | 23 +-
src/backend/opencl/histogram.hpp | 2 +-
src/backend/opencl/homography.cpp | 94 +
src/backend/opencl/homography.hpp | 22 +
src/backend/opencl/identity.cpp | 2 +
src/backend/opencl/iir.cpp | 4 +-
src/backend/opencl/image.cpp | 6 +-
src/backend/opencl/index.cpp | 6 +-
src/backend/opencl/interopManager.cpp | 24 +
src/backend/opencl/interopManager.hpp | 2 +
src/backend/opencl/inverse.cpp | 6 +
src/backend/opencl/iota.cpp | 4 +
src/backend/opencl/ireduce.cpp | 8 +
src/backend/opencl/jit.cpp | 54 +-
src/backend/opencl/join.cpp | 4 +
src/backend/opencl/kernel/approx.hpp | 30 +-
src/backend/opencl/kernel/approx1.cl | 51 +-
src/backend/opencl/kernel/approx2.cl | 69 +-
src/backend/opencl/kernel/assign.cl | 5 +-
src/backend/opencl/kernel/bilateral.cl | 32 +-
src/backend/opencl/kernel/convolve.hpp | 7 +-
src/backend/opencl/kernel/convolve/conv1.cpp | 4 +
src/backend/opencl/kernel/convolve/conv2_impl.hpp | 117 +-
src/backend/opencl/kernel/convolve/conv2_s16.cpp | 23 +
src/backend/opencl/kernel/convolve/conv2_s64.cpp | 23 +
src/backend/opencl/kernel/convolve/conv2_u16.cpp | 23 +
src/backend/opencl/kernel/convolve/conv2_u64.cpp | 23 +
src/backend/opencl/kernel/convolve/conv3.cpp | 4 +
src/backend/opencl/kernel/convolve_separable.cpp | 135 +
src/backend/opencl/kernel/convolve_separable.hpp | 79 +-
src/backend/opencl/kernel/fast.cl | 2 +-
src/backend/opencl/kernel/fast.hpp | 100 +-
src/backend/opencl/kernel/fftconvolve.hpp | 24 +-
src/backend/opencl/kernel/fftconvolve_multiply.cl | 6 +-
src/backend/opencl/kernel/fftconvolve_reorder.cl | 15 +-
src/backend/opencl/kernel/gradient.cl | 3 +-
src/backend/opencl/kernel/gradient.hpp | 7 +-
src/backend/opencl/kernel/hamming.cl | 360 -
src/backend/opencl/kernel/hamming.hpp | 138 -
src/backend/opencl/kernel/harris.cl | 117 +
src/backend/opencl/kernel/harris.hpp | 343 +
src/backend/opencl/kernel/histogram.cl | 24 +-
src/backend/opencl/kernel/histogram.hpp | 15 +-
src/backend/opencl/kernel/homography.cl | 519 +
src/backend/opencl/kernel/homography.hpp | 257 +
src/backend/opencl/kernel/hsv_rgb.cl | 2 +-
src/backend/opencl/kernel/iir.cl | 4 +-
src/backend/opencl/kernel/index.cl | 5 +-
src/backend/opencl/kernel/iops.cl | 24 +-
src/backend/opencl/kernel/ireduce.hpp | 210 +-
src/backend/opencl/kernel/jit.cl | 2 +
src/backend/opencl/kernel/lookup.cl | 5 +-
src/backend/opencl/kernel/matchTemplate.cl | 2 +-
src/backend/opencl/kernel/meanshift.cl | 36 +-
src/backend/opencl/kernel/medfilt.cl | 37 +-
src/backend/opencl/kernel/morph.cl | 128 +-
src/backend/opencl/kernel/nearest_neighbour.cl | 384 +
src/backend/opencl/kernel/nearest_neighbour.hpp | 173 +
src/backend/opencl/kernel/ops.cl | 8 +-
src/backend/opencl/kernel/orb.hpp | 44 +-
src/backend/opencl/kernel/random.cl | 45 +-
src/backend/opencl/kernel/random.hpp | 4 +
src/backend/opencl/kernel/reduce.hpp | 262 +-
src/backend/opencl/kernel/reduce_dim.cl | 4 +-
src/backend/opencl/kernel/reduce_first.cl | 4 +-
src/backend/opencl/kernel/regions.cl | 6 +-
src/backend/opencl/kernel/regions.hpp | 5 +
src/backend/opencl/kernel/resize.cl | 21 +
src/backend/opencl/kernel/resize.hpp | 3 +-
src/backend/opencl/kernel/rotate.cl | 1 +
src/backend/opencl/kernel/rotate.hpp | 11 +-
src/backend/opencl/kernel/scan_dim.cl | 2 +
src/backend/opencl/kernel/scan_dim.hpp | 207 +-
src/backend/opencl/kernel/scan_first.hpp | 196 +-
src/backend/opencl/kernel/select.cl | 97 +
src/backend/opencl/kernel/select.hpp | 175 +
src/backend/opencl/kernel/set.cl | 20 -
src/backend/opencl/kernel/set.hpp | 67 -
src/backend/opencl/kernel/sobel.cl | 30 +-
src/backend/opencl/kernel/sort.hpp | 30 +-
src/backend/opencl/kernel/sort_by_key.hpp | 26 +-
src/backend/opencl/kernel/sort_index.hpp | 30 +-
src/backend/opencl/kernel/susan.cl | 116 +
src/backend/opencl/kernel/susan.hpp | 145 +
src/backend/opencl/kernel/transform.cl | 33 +-
src/backend/opencl/kernel/transform.hpp | 23 +-
src/backend/opencl/kernel/transform_interp.cl | 91 +-
src/backend/opencl/kernel/unwrap.cl | 77 +
src/backend/opencl/kernel/unwrap.hpp | 117 +
src/backend/opencl/kernel/where.cl | 2 +-
src/backend/opencl/kernel/where.hpp | 11 +-
src/backend/opencl/kernel/wrap.cl | 74 +
src/backend/opencl/kernel/wrap.hpp | 112 +
src/backend/opencl/lookup.cpp | 6 +
src/backend/opencl/lu.cpp | 21 +-
src/backend/opencl/lu.hpp | 2 +
src/backend/opencl/magma/gebrd.cpp | 368 +
src/backend/opencl/magma/geqrf2.cpp | 19 +-
src/backend/opencl/magma/geqrf3.cpp | 19 +-
src/backend/opencl/magma/getrf.cpp | 130 +-
src/backend/opencl/magma/getrs.cpp | 34 +-
src/backend/opencl/magma/labrd.cpp | 675 ++
src/backend/opencl/magma/larfb.cpp | 125 +-
src/backend/opencl/magma/laset.cpp | 44 +
src/backend/opencl/magma/laset_band.cpp | 44 +
src/backend/opencl/magma/laswp.cpp | 44 +
src/backend/opencl/magma/magma.h | 21 +
src/backend/opencl/magma/magma_blas.h | 57 +-
src/backend/opencl/magma/magma_cpu_blas.h | 104 +
src/backend/opencl/magma/magma_cpu_lapack.h | 143 +-
src/backend/opencl/magma/magma_helper.cpp | 35 +
src/backend/opencl/magma/magma_helper.h | 3 +
src/backend/opencl/magma/potrf.cpp | 134 +-
src/backend/opencl/magma/transpose.cpp | 44 +
src/backend/opencl/magma/transpose_inplace.cpp | 44 +
src/backend/opencl/magma/ungqr.cpp | 11 +-
src/backend/opencl/magma/unmqr.cpp | 11 +-
src/backend/opencl/magma/unmqr2.cpp | 10 +-
src/backend/opencl/match_template.cpp | 2 +
src/backend/opencl/math.hpp | 15 +
src/backend/opencl/max.cpp | 4 +
src/backend/opencl/meanshift.cpp | 4 +
src/backend/opencl/medfilt.cpp | 2 +
src/backend/opencl/memory.cpp | 478 +-
src/backend/opencl/memory.hpp | 21 +-
src/backend/opencl/min.cpp | 4 +
src/backend/opencl/nearest_neighbour.cpp | 114 +
src/backend/opencl/nearest_neighbour.hpp | 23 +
src/backend/opencl/platform.cpp | 543 +-
src/backend/opencl/platform.hpp | 52 +-
src/backend/opencl/plot.cpp | 2 +
src/backend/opencl/plot.hpp | 1 -
src/backend/opencl/plot3.cpp | 72 +
src/backend/opencl/plot3.hpp | 22 +
src/backend/opencl/product.cpp | 4 +
src/backend/opencl/program.hpp | 5 +-
src/backend/opencl/qr.cpp | 19 +-
src/backend/opencl/random.cpp | 4 +
src/backend/opencl/range.cpp | 4 +
src/backend/opencl/reduce.hpp | 5 +-
src/backend/opencl/reduce_impl.hpp | 13 +-
src/backend/opencl/regions.cpp | 2 +
src/backend/opencl/reorder.cpp | 2 +
src/backend/opencl/resize.cpp | 5 +
src/backend/opencl/rotate.cpp | 14 +-
src/backend/opencl/scan.cpp | 16 +-
src/backend/opencl/select.cpp | 54 +
src/backend/opencl/select.hpp | 19 +
src/backend/opencl/set.cpp | 48 +-
src/backend/opencl/shift.cpp | 4 +
src/backend/opencl/sift.cpp | 87 +
src/backend/opencl/sift.hpp | 27 +
src/backend/opencl/sobel.cpp | 2 +
src/backend/opencl/solve.cpp | 99 +-
src/backend/opencl/sort.cpp | 4 +
src/backend/opencl/sort_by_key.cpp | 61 -
src/backend/opencl/sort_by_key/b8.cpp | 16 +
src/backend/opencl/sort_by_key/f32.cpp | 16 +
src/backend/opencl/sort_by_key/f64.cpp | 16 +
src/backend/opencl/sort_by_key/impl.hpp | 57 +
src/backend/opencl/sort_by_key/s16.cpp | 16 +
src/backend/opencl/sort_by_key/s32.cpp | 16 +
src/backend/opencl/sort_by_key/s64.cpp | 16 +
src/backend/opencl/sort_by_key/u16.cpp | 16 +
src/backend/opencl/sort_by_key/u32.cpp | 16 +
src/backend/opencl/sort_by_key/u64.cpp | 16 +
src/backend/opencl/sort_by_key/u8.cpp | 16 +
src/backend/opencl/sort_index.cpp | 4 +
src/backend/opencl/sum.cpp | 12 +
src/backend/opencl/surface.cpp | 75 +
src/backend/opencl/surface.hpp | 23 +
src/backend/opencl/susan.cpp | 84 +
src/backend/opencl/susan.hpp | 24 +
src/backend/opencl/svd.cpp | 270 +
src/backend/opencl/svd.hpp | 19 +
src/backend/opencl/tile.cpp | 4 +
src/backend/opencl/traits.hpp | 10 -
src/backend/opencl/transform.cpp | 111 +-
src/backend/opencl/transform.hpp | 2 +-
src/backend/opencl/transpose.cpp | 2 +
src/backend/opencl/transpose_inplace.cpp | 2 +
src/backend/opencl/triangle.cpp | 2 +
src/backend/opencl/types.cpp | 2 +
src/backend/opencl/types.hpp | 1 +
src/backend/opencl/unary.hpp | 3 +-
src/backend/opencl/unwrap.cpp | 58 +
src/backend/opencl/unwrap.hpp | 17 +
src/backend/opencl/where.cpp | 2 +
src/backend/opencl/wrap.cpp | 59 +
src/backend/opencl/wrap.hpp | 22 +
src/backend/util.cpp | 37 +
src/backend/util.hpp | 16 +
test/CMakeLists.txt | 268 +-
test/CMakeModules/FindOpenCL.cmake | 190 +
test/CMakeModules/build_gtest.cmake | 100 +
test/approx1.cpp | 47 +
test/approx2.cpp | 53 +
test/array.cpp | 91 +-
test/assign.cpp | 195 +-
test/backend.cpp | 92 +
test/basic_c.c | 4 +-
test/bilateral.cpp | 4 +-
test/binary.cpp | 50 +-
test/blas.cpp | 37 +-
test/cholesky_dense.cpp | 2 +
test/complex.cpp | 18 +-
test/constant.cpp | 2 +-
test/convolve.cpp | 3 +-
test/corrcoef.cpp | 94 +
test/covariance.cpp | 135 +
test/data | 2 +-
test/diagonal.cpp | 9 +
test/diff1.cpp | 2 +-
test/diff2.cpp | 2 +-
test/dog.cpp | 76 +
test/dot.cpp | 188 +
test/fast.cpp | 114 +-
test/fft.cpp | 107 +-
test/fft_real.cpp | 121 +
test/fftconvolve.cpp | 42 +-
test/getting_started.cpp | 9 +-
test/gfor.cpp | 281 +
test/hamming.cpp | 4 +-
test/harris.cpp | 215 +
test/histogram.cpp | 22 +-
test/homography.cpp | 281 +
test/imageio.cpp | 226 +-
test/index.cpp | 234 +-
test/info.cpp | 51 +-
test/inverse_dense.cpp | 2 +
test/iota.cpp | 2 +-
test/ireduce.cpp | 80 +-
test/join.cpp | 2 +-
test/lu_dense.cpp | 4 +
test/main.cpp | 6 +
test/manual_memory_test.cpp | 69 +
test/match_template.cpp | 2 +-
test/math.cpp | 23 +
test/mean.cpp | 206 +-
test/meanshift.cpp | 20 +-
test/medfilt.cpp | 4 +-
test/median.cpp | 27 +-
test/memory.cpp | 261 +-
test/memory_lock.cpp | 95 +
test/missing.cpp | 2 -
test/moddims.cpp | 2 +-
test/morph.cpp | 5 +-
test/nearest_neighbour.cpp | 230 +
test/ocl_ext_context.cpp | 131 +
test/orb.cpp | 15 +-
test/qr_dense.cpp | 3 +
test/random.cpp | 90 +-
test/range.cpp | 2 +-
test/rank_dense.cpp | 126 +
test/reduce.cpp | 284 +-
test/regions.cpp | 2 +-
test/reorder.cpp | 2 +-
test/replace.cpp | 175 +
test/resize.cpp | 5 +-
test/rotate.cpp | 3 +-
test/rotate_linear.cpp | 17 +-
test/sat.cpp | 45 +
test/scan.cpp | 20 +-
test/select.cpp | 178 +
test/set.cpp | 8 +
test/shift.cpp | 2 +-
test/sobel.cpp | 2 +-
test/solve_dense.cpp | 18 +-
test/sort.cpp | 2 +-
test/sort_by_key.cpp | 17 +-
test/sort_index.cpp | 16 +-
test/stdev.cpp | 207 +
test/susan.cpp | 183 +
test/svd_dense.cpp | 101 +
test/testHelpers.hpp | 71 +-
test/tile.cpp | 2 +-
test/transform.cpp | 268 +
test/transform_coordinates.cpp | 118 +
test/translate.cpp | 3 +-
test/transpose.cpp | 21 +-
test/transpose_inplace.cpp | 2 +-
test/triangle.cpp | 3 +-
test/unwrap.cpp | 177 +
test/var.cpp | 16 +-
test/where.cpp | 20 +-
test/wrap.cpp | 180 +
test/write.cpp | 2 +-
test/ycbcr_rgb.cpp | 84 +
1030 files changed, 60073 insertions(+), 19986 deletions(-)
delete mode 100644 ArrayFireConfig.cmake.in
delete mode 100644 ArrayFireConfigVersion.cmake.in
create mode 100644 CMakeModules/ArrayFireConfig.cmake.in
create mode 100644 CMakeModules/ArrayFireConfigVersion.cmake.in
create mode 100644 CMakeModules/CPackConfig.cmake
create mode 100644 CMakeModules/MinBuildTime.cmake
delete mode 100644 CMakeModules/build_gtest.cmake
create mode 100644 CMakeModules/examples.dox.in
create mode 100644 CMakeModules/version.hpp.in
delete mode 100644 CPack.cmake
create mode 100644 LICENSES/OpenSIFT License.txt
create mode 100644 debian/README.Debian
create mode 100644 debian/README.source
create mode 100644 debian/libarrayfire-dev.install
delete mode 100644 debian/libarrayfire-doc.links
create mode 100644 debian/libarrayfire-opencl-dev.install
create mode 100644 debian/libarrayfire-opencl3.install
create mode 100644 debian/libarrayfire-opencl3.lintian-overrides
create mode 100644 debian/libarrayfire-unified-dev.install
create mode 100644 debian/libarrayfire-unified3.install
create mode 100644 debian/libarrayfire-unified3.lintian-overrides
create mode 100644 debian/patches/Disable-git-usage-in-Doxygen-settings.patch
create mode 100644 debian/patches/Disable-privacy-breaching-settings.patch
create mode 100644 debian/patches/Use-system-gtest.patch
create mode 100644 debian/patches/Use-system-mathjax.patch
delete mode 100644 debian/patches/disable-privacy-breach-doc.patch
delete mode 100644 debian/patches/fix-doxygen-settings.patch
delete mode 100644 debian/source/lintian-overrides
create mode 100755 debian/tests/build-examples
create mode 100755 debian/tests/build-testsuite
create mode 100644 debian/tests/control
create mode 100644 debian/upstream/metadata
create mode 100644 docs/details/backend.dox
create mode 100644 docs/details/index.dox
create mode 100644 docs/details/util.dox
delete mode 100644 docs/highlight.pack.js
delete mode 100644 docs/highlight_js_doxygen.css
create mode 100644 docs/pages/forge_visualization.md
create mode 100644 docs/pages/interop_cuda.md
create mode 100644 docs/pages/interop_opencl.md
create mode 100644 docs/pages/unified_backend.md
create mode 100644 docs/pages/vectorization.md
create mode 100644 examples/computer_vision/matching.cpp
create mode 100644 examples/computer_vision/susan.cpp
create mode 100644 examples/financial/heston_model.cpp
create mode 100644 examples/graphics/gravity_sim.cpp
create mode 100644 examples/graphics/plot3.cpp
create mode 100644 examples/graphics/surface.cpp
create mode 100644 examples/lin_algebra/svd.cpp
create mode 100644 examples/pde/swe.cpp
create mode 100644 examples/unified/basic.cpp
create mode 100644 include/af/backend.h
create mode 100644 include/af/cuda.h
create mode 100644 include/af/macros.h
create mode 100644 src/api/c/dog.cpp
create mode 100644 src/api/c/fft_common.hpp
create mode 100644 src/api/c/harris.cpp
create mode 100644 src/api/c/homography.cpp
create mode 100644 src/api/c/imageio2.cpp
create mode 100644 src/api/c/imageio_helper.h
create mode 100644 src/api/c/memory.cpp
create mode 100644 src/api/c/nearest_neighbour.cpp
create mode 100644 src/api/c/plot3.cpp
create mode 100644 src/api/c/replace.cpp
create mode 100644 src/api/c/sat.cpp
create mode 100644 src/api/c/select.cpp
create mode 100644 src/api/c/sift.cpp
create mode 100644 src/api/c/stream.cpp
create mode 100644 src/api/c/surface.cpp
create mode 100644 src/api/c/susan.cpp
create mode 100644 src/api/c/svd.cpp
create mode 100644 src/api/c/transform_coordinates.cpp
create mode 100644 src/api/c/unwrap.cpp
create mode 100644 src/api/c/util.cpp
create mode 100644 src/api/c/version.cpp
create mode 100644 src/api/c/wrap.cpp
create mode 100644 src/api/c/ycbcr_rgb.cpp
create mode 100644 src/api/cpp/dog.cpp
create mode 100644 src/api/cpp/harris.cpp
create mode 100644 src/api/cpp/homography.cpp
create mode 100644 src/api/cpp/nearest_neighbour.cpp
create mode 100644 src/api/cpp/sat.cpp
create mode 100644 src/api/cpp/sift.cpp
create mode 100644 src/api/cpp/susan.cpp
create mode 100644 src/api/cpp/transform_coordinates.cpp
create mode 100644 src/api/cpp/unwrap.cpp
create mode 100644 src/api/cpp/wrap.cpp
create mode 100644 src/api/cpp/ycbcr_rgb.cpp
create mode 100644 src/api/unified/CMakeLists.txt
create mode 100644 src/api/unified/algorithm.cpp
create mode 100644 src/api/unified/arith.cpp
create mode 100644 src/api/unified/array.cpp
create mode 100644 src/api/unified/blas.cpp
create mode 100644 src/api/unified/data.cpp
create mode 100644 src/api/unified/device.cpp
create mode 100644 src/api/unified/features.cpp
create mode 100644 src/api/unified/graphics.cpp
create mode 100644 src/api/unified/image.cpp
create mode 100644 src/api/unified/index.cpp
create mode 100644 src/api/unified/lapack.cpp
create mode 100644 src/api/unified/signal.cpp
create mode 100644 src/api/unified/statistics.cpp
create mode 100644 src/api/unified/symbol_manager.cpp
create mode 100644 src/api/unified/symbol_manager.hpp
create mode 100644 src/api/unified/util.cpp
create mode 100644 src/api/unified/vision.cpp
create mode 100644 src/backend/MemoryManager.cpp
create mode 100644 src/backend/MemoryManager.hpp
delete mode 100644 src/backend/cpu/hamming.cpp
delete mode 100644 src/backend/cpu/hamming.hpp
create mode 100644 src/backend/cpu/harris.cpp
create mode 100644 src/backend/cpu/harris.hpp
create mode 100644 src/backend/cpu/homography.cpp
create mode 100644 src/backend/cpu/homography.hpp
create mode 100644 src/backend/cpu/kernel/Array.hpp
create mode 100644 src/backend/cpu/kernel/approx1.hpp
create mode 100644 src/backend/cpu/kernel/approx2.hpp
create mode 100644 src/backend/cpu/kernel/assign.hpp
create mode 100644 src/backend/cpu/kernel/bilateral.hpp
create mode 100644 src/backend/cpu/kernel/convolve.hpp
create mode 100644 src/backend/cpu/kernel/copy.hpp
create mode 100644 src/backend/cpu/kernel/diagonal.hpp
create mode 100644 src/backend/cpu/kernel/diff.hpp
create mode 100644 src/backend/cpu/kernel/dot.hpp
create mode 100644 src/backend/cpu/kernel/fast.hpp
create mode 100644 src/backend/cpu/kernel/fft.hpp
create mode 100644 src/backend/cpu/kernel/fftconvolve.hpp
create mode 100644 src/backend/cpu/kernel/gradient.hpp
create mode 100644 src/backend/cpu/kernel/harris.hpp
create mode 100644 src/backend/cpu/kernel/histogram.hpp
create mode 100644 src/backend/cpu/kernel/hsv_rgb.hpp
create mode 100644 src/backend/cpu/kernel/identity.hpp
create mode 100644 src/backend/cpu/kernel/iir.hpp
create mode 100644 src/backend/cpu/kernel/index.hpp
create mode 100644 src/backend/cpu/kernel/iota.hpp
create mode 100644 src/backend/cpu/kernel/ireduce.hpp
create mode 100644 src/backend/cpu/kernel/join.hpp
create mode 100644 src/backend/cpu/kernel/lookup.hpp
create mode 100644 src/backend/cpu/kernel/lu.hpp
create mode 100644 src/backend/cpu/kernel/match_template.hpp
create mode 100644 src/backend/cpu/kernel/meanshift.hpp
create mode 100644 src/backend/cpu/kernel/medfilt.hpp
create mode 100644 src/backend/cpu/kernel/morph.hpp
create mode 100644 src/backend/cpu/kernel/nearest_neighbour.hpp
create mode 100644 src/backend/cpu/kernel/orb.hpp
create mode 100644 src/backend/cpu/kernel/random.hpp
create mode 100644 src/backend/cpu/kernel/range.hpp
create mode 100644 src/backend/cpu/kernel/reduce.hpp
create mode 100644 src/backend/cpu/kernel/regions.hpp
create mode 100644 src/backend/cpu/kernel/reorder.hpp
create mode 100644 src/backend/cpu/kernel/resize.hpp
create mode 100644 src/backend/cpu/kernel/rotate.hpp
create mode 100644 src/backend/cpu/kernel/scan.hpp
create mode 100644 src/backend/cpu/kernel/select.hpp
create mode 100644 src/backend/cpu/kernel/shift.hpp
create mode 100644 src/backend/cpu/kernel/sobel.hpp
create mode 100644 src/backend/cpu/kernel/sort.hpp
create mode 100644 src/backend/cpu/kernel/sort_by_key.hpp
create mode 100644 src/backend/cpu/kernel/sort_index.hpp
create mode 100644 src/backend/cpu/kernel/susan.hpp
create mode 100644 src/backend/cpu/kernel/tile.hpp
create mode 100644 src/backend/cpu/kernel/transform.hpp
create mode 100644 src/backend/cpu/kernel/transpose.hpp
create mode 100644 src/backend/cpu/kernel/triangle.hpp
create mode 100644 src/backend/cpu/kernel/unwrap.hpp
create mode 100644 src/backend/cpu/kernel/wrap.hpp
create mode 100644 src/backend/cpu/nearest_neighbour.cpp
create mode 100644 src/backend/cpu/nearest_neighbour.hpp
create mode 100644 src/backend/cpu/plot3.cpp
create mode 100644 src/backend/cpu/plot3.hpp
create mode 100644 src/backend/cpu/queue.hpp
create mode 100644 src/backend/cpu/select.cpp
create mode 100644 src/backend/cpu/select.hpp
create mode 100644 src/backend/cpu/sift.cpp
create mode 100644 src/backend/cpu/sift.hpp
create mode 100644 src/backend/cpu/surface.cpp
create mode 100644 src/backend/cpu/surface.hpp
create mode 100644 src/backend/cpu/susan.cpp
create mode 100644 src/backend/cpu/susan.hpp
create mode 100644 src/backend/cpu/svd.cpp
create mode 100644 src/backend/cpu/svd.hpp
create mode 160000 src/backend/cpu/threads
create mode 100644 src/backend/cpu/unwrap.cpp
create mode 100644 src/backend/cpu/unwrap.hpp
create mode 100644 src/backend/cpu/utility.hpp
create mode 100644 src/backend/cpu/wrap.cpp
create mode 100644 src/backend/cpu/wrap.hpp
create mode 100644 src/backend/cuda/cpu_lapack/cpu_cholesky.cpp
create mode 100644 src/backend/cuda/cpu_lapack/cpu_cholesky.hpp
create mode 100644 src/backend/cuda/cpu_lapack/cpu_inverse.cpp
create mode 100644 src/backend/cuda/cpu_lapack/cpu_inverse.hpp
create mode 100644 src/backend/cuda/cpu_lapack/cpu_lu.cpp
create mode 100644 src/backend/cuda/cpu_lapack/cpu_lu.hpp
create mode 100644 src/backend/cuda/cpu_lapack/cpu_qr.cpp
create mode 100644 src/backend/cuda/cpu_lapack/cpu_qr.hpp
create mode 100644 src/backend/cuda/cpu_lapack/cpu_solve.cpp
create mode 100644 src/backend/cuda/cpu_lapack/cpu_solve.hpp
create mode 100644 src/backend/cuda/cpu_lapack/cpu_svd.cpp
create mode 100644 src/backend/cuda/cpu_lapack/cpu_svd.hpp
create mode 100644 src/backend/cuda/cpu_lapack/cpu_triangle.hpp
create mode 100644 src/backend/cuda/cpu_lapack/lapack_helper.hpp
delete mode 100644 src/backend/cuda/hamming.cu
delete mode 100644 src/backend/cuda/hamming.hpp
create mode 100644 src/backend/cuda/harris.cu
create mode 100644 src/backend/cuda/harris.hpp
create mode 100644 src/backend/cuda/homography.cu
create mode 100644 src/backend/cuda/homography.hpp
create mode 100644 src/backend/cuda/kernel/atomics.hpp
delete mode 100644 src/backend/cuda/kernel/hamming.hpp
create mode 100644 src/backend/cuda/kernel/harris.hpp
create mode 100644 src/backend/cuda/kernel/homography.hpp
create mode 100644 src/backend/cuda/kernel/nearest_neighbour.hpp
create mode 100644 src/backend/cuda/kernel/select.hpp
create mode 100644 src/backend/cuda/kernel/susan.hpp
create mode 100644 src/backend/cuda/kernel/unwrap.hpp
create mode 100644 src/backend/cuda/kernel/wrap.hpp
create mode 100644 src/backend/cuda/nearest_neighbour.cu
create mode 100644 src/backend/cuda/nearest_neighbour.hpp
create mode 100644 src/backend/cuda/plot3.cu
create mode 100644 src/backend/cuda/plot3.hpp
create mode 100644 src/backend/cuda/select.cu
create mode 100644 src/backend/cuda/select.hpp
create mode 100644 src/backend/cuda/sift.cu
create mode 100644 src/backend/cuda/sift.hpp
create mode 100644 src/backend/cuda/sort_by_key/ascd_s16.cu
create mode 100644 src/backend/cuda/sort_by_key/ascd_s64.cu
create mode 100644 src/backend/cuda/sort_by_key/ascd_u16.cu
create mode 100644 src/backend/cuda/sort_by_key/ascd_u64.cu
create mode 100644 src/backend/cuda/sort_by_key/desc_s16.cu
create mode 100644 src/backend/cuda/sort_by_key/desc_s64.cu
create mode 100644 src/backend/cuda/sort_by_key/desc_u16.cu
create mode 100644 src/backend/cuda/sort_by_key/desc_u64.cu
create mode 100644 src/backend/cuda/surface.cu
create mode 100644 src/backend/cuda/surface.hpp
create mode 100644 src/backend/cuda/susan.cu
create mode 100644 src/backend/cuda/susan.hpp
create mode 100644 src/backend/cuda/svd.cu
create mode 100644 src/backend/cuda/svd.hpp
create mode 100644 src/backend/cuda/unwrap.cu
create mode 100644 src/backend/cuda/unwrap.hpp
create mode 100644 src/backend/cuda/wrap.cu
create mode 100644 src/backend/cuda/wrap.hpp
rename src/{api/c => backend}/dispatch.cpp (100%)
rename src/{api/c => backend}/dispatch.hpp (100%)
create mode 100644 src/backend/host_memory.cpp
create mode 100644 src/backend/host_memory.hpp
create mode 100644 src/backend/opencl/cache.hpp
create mode 100644 src/backend/opencl/cpu/cpu_blas.cpp
create mode 100644 src/backend/opencl/cpu/cpu_blas.hpp
create mode 100644 src/backend/opencl/cpu/cpu_cholesky.cpp
create mode 100644 src/backend/opencl/cpu/cpu_cholesky.hpp
create mode 100644 src/backend/opencl/cpu/cpu_helper.hpp
create mode 100644 src/backend/opencl/cpu/cpu_inverse.cpp
create mode 100644 src/backend/opencl/cpu/cpu_inverse.hpp
create mode 100644 src/backend/opencl/cpu/cpu_lu.cpp
create mode 100644 src/backend/opencl/cpu/cpu_lu.hpp
create mode 100644 src/backend/opencl/cpu/cpu_qr.cpp
create mode 100644 src/backend/opencl/cpu/cpu_qr.hpp
create mode 100644 src/backend/opencl/cpu/cpu_solve.cpp
create mode 100644 src/backend/opencl/cpu/cpu_solve.hpp
create mode 100644 src/backend/opencl/cpu/cpu_svd.cpp
create mode 100644 src/backend/opencl/cpu/cpu_svd.hpp
create mode 100644 src/backend/opencl/cpu/cpu_triangle.hpp
delete mode 100644 src/backend/opencl/hamming.cpp
delete mode 100644 src/backend/opencl/hamming.hpp
create mode 100644 src/backend/opencl/harris.cpp
create mode 100644 src/backend/opencl/harris.hpp
create mode 100644 src/backend/opencl/homography.cpp
create mode 100644 src/backend/opencl/homography.hpp
create mode 100644 src/backend/opencl/kernel/convolve/conv2_s16.cpp
create mode 100644 src/backend/opencl/kernel/convolve/conv2_s64.cpp
create mode 100644 src/backend/opencl/kernel/convolve/conv2_u16.cpp
create mode 100644 src/backend/opencl/kernel/convolve/conv2_u64.cpp
create mode 100644 src/backend/opencl/kernel/convolve_separable.cpp
delete mode 100644 src/backend/opencl/kernel/hamming.cl
delete mode 100644 src/backend/opencl/kernel/hamming.hpp
create mode 100644 src/backend/opencl/kernel/harris.cl
create mode 100644 src/backend/opencl/kernel/harris.hpp
create mode 100644 src/backend/opencl/kernel/homography.cl
create mode 100644 src/backend/opencl/kernel/homography.hpp
create mode 100644 src/backend/opencl/kernel/nearest_neighbour.cl
create mode 100644 src/backend/opencl/kernel/nearest_neighbour.hpp
create mode 100644 src/backend/opencl/kernel/select.cl
create mode 100644 src/backend/opencl/kernel/select.hpp
delete mode 100644 src/backend/opencl/kernel/set.cl
delete mode 100644 src/backend/opencl/kernel/set.hpp
create mode 100644 src/backend/opencl/kernel/susan.cl
create mode 100644 src/backend/opencl/kernel/susan.hpp
create mode 100644 src/backend/opencl/kernel/unwrap.cl
create mode 100644 src/backend/opencl/kernel/unwrap.hpp
create mode 100644 src/backend/opencl/kernel/wrap.cl
create mode 100644 src/backend/opencl/kernel/wrap.hpp
create mode 100644 src/backend/opencl/magma/gebrd.cpp
create mode 100644 src/backend/opencl/magma/labrd.cpp
create mode 100644 src/backend/opencl/magma/magma_cpu_blas.h
create mode 100644 src/backend/opencl/nearest_neighbour.cpp
create mode 100644 src/backend/opencl/nearest_neighbour.hpp
create mode 100644 src/backend/opencl/plot3.cpp
create mode 100644 src/backend/opencl/plot3.hpp
create mode 100644 src/backend/opencl/select.cpp
create mode 100644 src/backend/opencl/select.hpp
create mode 100644 src/backend/opencl/sift.cpp
create mode 100644 src/backend/opencl/sift.hpp
delete mode 100644 src/backend/opencl/sort_by_key.cpp
create mode 100644 src/backend/opencl/sort_by_key/b8.cpp
create mode 100644 src/backend/opencl/sort_by_key/f32.cpp
create mode 100644 src/backend/opencl/sort_by_key/f64.cpp
create mode 100644 src/backend/opencl/sort_by_key/impl.hpp
create mode 100644 src/backend/opencl/sort_by_key/s16.cpp
create mode 100644 src/backend/opencl/sort_by_key/s32.cpp
create mode 100644 src/backend/opencl/sort_by_key/s64.cpp
create mode 100644 src/backend/opencl/sort_by_key/u16.cpp
create mode 100644 src/backend/opencl/sort_by_key/u32.cpp
create mode 100644 src/backend/opencl/sort_by_key/u64.cpp
create mode 100644 src/backend/opencl/sort_by_key/u8.cpp
create mode 100644 src/backend/opencl/surface.cpp
create mode 100644 src/backend/opencl/surface.hpp
create mode 100644 src/backend/opencl/susan.cpp
create mode 100644 src/backend/opencl/susan.hpp
create mode 100644 src/backend/opencl/svd.cpp
create mode 100644 src/backend/opencl/svd.hpp
create mode 100644 src/backend/opencl/unwrap.cpp
create mode 100644 src/backend/opencl/unwrap.hpp
create mode 100644 src/backend/opencl/wrap.cpp
create mode 100644 src/backend/opencl/wrap.hpp
create mode 100644 src/backend/util.cpp
create mode 100644 src/backend/util.hpp
create mode 100644 test/CMakeModules/FindOpenCL.cmake
create mode 100644 test/CMakeModules/build_gtest.cmake
create mode 100644 test/backend.cpp
create mode 100644 test/corrcoef.cpp
create mode 100644 test/covariance.cpp
create mode 100644 test/dog.cpp
create mode 100644 test/dot.cpp
create mode 100644 test/fft_real.cpp
create mode 100644 test/harris.cpp
create mode 100644 test/homography.cpp
create mode 100644 test/main.cpp
create mode 100644 test/manual_memory_test.cpp
create mode 100644 test/memory_lock.cpp
create mode 100644 test/nearest_neighbour.cpp
create mode 100644 test/ocl_ext_context.cpp
create mode 100644 test/rank_dense.cpp
create mode 100644 test/replace.cpp
create mode 100644 test/sat.cpp
create mode 100644 test/select.cpp
create mode 100644 test/stdev.cpp
create mode 100644 test/susan.cpp
create mode 100644 test/svd_dense.cpp
create mode 100644 test/transform.cpp
create mode 100644 test/transform_coordinates.cpp
create mode 100644 test/unwrap.cpp
create mode 100644 test/wrap.cpp
create mode 100644 test/ycbcr_rgb.cpp
--
Alioth's /usr/local/bin/git-commit-notice on /srv/git.debian.org/git/debian-science/packages/arrayfire.git
More information about the debian-science-commits
mailing list