[arrayfire] branch master updated (7527e5b -> 92e08e9)
Ghislain Vaillant
ghisvail-guest at moszumanska.debian.org
Mon Sep 21 19:20:35 UTC 2015
This is an automated email from the git hooks/post-receive script.
ghisvail-guest pushed a change to branch master
in repository arrayfire.
omits 7527e5b Merge remote-tracking branch 'upstream/master'
omits d86f4f5 Merge remote-tracking branch 'upstream/master'
adds 36708b0 Merge pull request #815 from arrayfire/hotfixes-3.0.1
adds 3053edd Implement CUDA complex dot product
adds ab35ad1 Add complex dot product for CPU backend
adds 0372645 Add complex dot product for OpenCL backend
adds 0bd1110 Add forgotten instatiations
adds 075f48d Merge pull request #817 from FilipeMaia/complex_dot_product
adds f00a396 FEAT: Difference of Gaussians
adds b037a8c Merge pull request #835 from 9prady9/dog
adds b0e4992 FEAT Add CPU backend for unwrap function
adds 73d9038 Added CUDA backend for Unwrap
adds 669b4b2 Added OpenCL backend for Unwrap
adds 4badf65 Bugfixes, comments
adds 9b13e8a Adding padding for strides > 1
adds 9281f4e Added test for unwrap, updated data submodule
adds ec5c0d1 Added documentation for unwrap
adds 2816c01 Changing behavior of unwrap using padding
adds b3d5bba Changed unwrap tests to new behavior
adds dafe47a Updated documentation for unwrap
adds dc18c34 Unwrap: Remove duplicate asserts, add intl, uintl to test
adds ce0d03f Merge branch 'unwrap' of shehzan10/arrayfire into devel
adds 967ffb0 Make dim4 a POD object
adds 4c9962b Make ArrayInfo a POD object
adds 8204912 Add static checks for POD for dim4 and ArrayInfo
adds 91f7431 Make Array<T> a standard layout type
adds f037380 Ensure Array<T> is standard layout using static_asserts
adds 409b295 Revert "Make dim4 a POD object"
adds 67d0f60 Reduce size of Array<T> by rearranging mem vars
adds 0492397 Merge pull request #842 from umar456/pod
adds ee72370 Add static checks to make sure ArrayInfo is the first mem var
adds 19b7067 Merge pull request #843 from umar456/offset_check
adds 152fccf FEAT: Implementing array::lock() and array::unlock()
adds 691f0d1 FEAT / TEST: Adding af::copy()
adds 94909e1 Merge pull request #848 from pavanky/new_additions
adds 6bd71f0 DOCS: Fixing documentation for lock and unlock
adds 2175ccb Added AF_INTERP_LOWER and implementation for resize
adds d6bf32e Merge pull request #852 from shehzan10/resize_lower
adds 470cf99 FEAT: intl/uintl for random
adds 7883d32 FEAT: intl/uintl support for all reduce functions
adds 16b4c0a TEST: Refactor reduce tests. Test intl/uintl
adds cf58d66 BUILD: Fix redefinition warning in blas.
adds 29d0df7 Revert "BUILD: Fix redefinition warning in blas."
adds 6f3e9f8 Merge pull request #855 from umar456/long_long
adds e2ee96e FEAT Added nearest neighbour with SSD, SAD and SHD
adds 9f46f19 Added tests for nearest neighbour
adds f3d4660 Added documentation for nearest neighbour
adds 7f9b423 Merge remote-tracking branch 'upstream/devel' into nearestNeighbour
adds f999b05 Fix double compilation
adds 90a9ffb Remove redefition warning for blas
adds b626967 Added options for dotc and dotu to dot function
adds 354a40b Bug fixes for nearest neighbour and hamming
adds 8540fa0 Added tests for dot
adds 87ca992 Merge pull request #853 from shehzan10/nearestNeighbour
adds fd8d75d Merge remote-tracking branch 'upstream/master' into devel
adds 74c2b05 Compilation and warning fixes
adds 2ae60bb Merge branch 'master' into devel
adds 8aae20b BUFIX: Remove memory leak in af::copy()
adds 1161ff5 Updating forge tag to fix build issuess with ninja
adds 8ac5cb9 Merge pull request #863 from pavanky/bugfix
adds 0672f56 FEAT: Adding function to get use_count of shared pointers
adds 51b90c7 PERF: Do not make copies if the number of references is only 1
adds 44d857d Merge pull request #870 from pavanky/refs
adds a815801 Don't link the installed version of forge on Linux/OSX
adds 7b657e4 Merge pull request #866 from JuliaComputing/kf/forgerpath
adds 91cea3b Add support for BLAS symbol renaming
adds 319753f Merge pull request #867 from JuliaComputing/kf/blasrename
adds 28d9e71 Add lower interpolation to rotate and transform
adds 99a9234 Allow users to set precision when using print
adds c62a250 Changed af_print macro, added documentation
adds e12dceb Merge pull request #873 from shehzan10/more-lower
adds 4dc9836 Allow building against 64bit index OpenBlas
adds c9fbfc5 Also use the void* interface for MKL on windows
adds 1b315f9 STYLE: Remove macros; Simplify templates;
adds b774d57 Merge pull request #1 from umar456/JuliaComputing-kf/openblas64
adds 895d6b2 Merge pull request #865 from JuliaComputing/kf/openblas64
adds 8f37d8c Fix print macro
adds 998a0b8 Change build labels to be for devel branch
adds 9d09831 Added noDoubleTest for nearest neighbour and dot test
adds 3ebc19e BUGFIX: Ignoring NaN values in min and max for all backends
adds 3b2ce09 FEAT: Added support to substitute nan values for sum and product
adds 9036d73 Adding missing instantiations for compat functions
adds 248de68 TEST: Adding tests for reductions when using NaNs
adds 93a581d Merge pull request #875 from pavanky/reduce-nan
adds 7c41de9 Added C API for Harris corner detector
adds 4049083 Added C++ API for Harris corner detector
adds ea3c150 Added CPU backend for Harris corner detector
adds e9e8474 Added CUDA backend for Harris corner detector
adds d183732 Added OpenCL backend for Harris corner detector
adds cef733b Added Harris corner detector unit tests
adds cfb97ca Added documentation and function definitions for Harris
adds 2b6712b Fixed FAST unit tests
adds 311b2c5 Updated data
adds 7d1da23 Removed unused variables from OpenCL's sort_index
adds 0cf4d80 Merge remote-tracking branch 'upstream/devel' into harris
adds a257ef4 Merge pull request #874 from pentschev/harris
adds bd21a82 BUGFIX: Fixing casting to and from complex numbers in CPU backend
adds 56a000d Adding more operator overloading for af::cfloat and af::cdouble
adds bb7bb77 BUGFIX: Making sure c32/c64 imitate f32/f64 when operating with scalars
adds 2ee28e7 TEST: Add mixed type tests with complex inputs
adds 5ef10b3 Merge pull request #877 from pavanky/cplx_fixes
adds f5ac99e Fixed bug affecting Harris on AMD GPUs
adds d656a20 Renaming enums for convolve batch modes
adds d08400d PERF: Improvements for non linear JIT kernels in OpenCL backend
adds 9ee690c TEST: Adding batched mode tests
adds f8e9398 PERF: Speeding up JIT for 3D arrays in OpenCL backends
adds 5b49f07 BUGFIX: median of all elements is now fixed
adds 05de8bf PERF: Improvements to tile when tiling along singleton dimensions
adds ecdab67 PERF: Improvements to CUDA JIT for non linear 3D and 4D arrays
adds f8f072c FEAT: Adding support for non overlapping batched convolution
adds 2ca72cb TEST: Adding tests for non overlapping convolves
adds 73dbeb9 DOCS: Updating the documentation for convolution
adds c4b6ace Merge pull request #889 from pavanky/features-3.1
adds a8c6ae9 Adding missing license for a few files
adds 218043d Updating version to 3.1.0
adds 3a163f5 Changes in examples to reflect asset modifications
adds 210978f Merge pull request #891 from 9prady9/assets_changes
adds 4ff4de3 turned off clFFT examples in external project build
adds eb9367a fix forge dependency target errors in cpu, opencl backends
adds 79fbe5d Merge pull request #892 from 9prady9/cmake_fixes
adds fb17b53 Added AF_API_VERSION
adds 724450f Merge pull request #893 from shehzan10/devel
adds 30cbbc7 Fix double free issue.
adds 9cbc641 Add test for index copy assignment.
adds 4d23d41 Add move constructor and move assignment op
adds 4cc3a9b Merge pull request #895 from FilipeMaia/devel
adds a5efdea BUGFIX: Check for NULL values when allocating memory on CPU backend
adds 981cb4a Ensure CUDA and OpenCL return proper errors when out of memory
adds 171d8dd TEST: Adding test to trash the memory manager and see if it recovers
adds d22c0bb FEAT,TEST: Adding sigmoid function for all backends
adds bc2b8f5 Adding the option to remove tests from ctest
adds 1a2b405 Merge pull request #897 from pavanky/minor_changes
adds 8253205 BUGFIX/TEST: Fixing bug in rank. Added appropriate tests
adds e7d75be BUGFIX/TEST: Fixing not for C API. Added relevant tests.
adds 1bde871 BUGFIX: Fixing a bug in randn for CPU backend
adds 05509c9 BUGFIX: Fixing setSeed for randu
adds b8cfd77 TEST: Updating and fixing the randu/randn tests
adds bd39083 TEST: Updating random tests to properly reset seeds
adds cd78911 TEST: Fixing out of bounds access in fft tests
adds 3fb199d BUGFIX in randn for apple systems
adds 214acf1 Renaming rank test to rank_dense
adds ed3a830 Merge pull request #906 from pavanky/bugfixes
adds c69ca0a Added 64-bit integer type support for functions
adds 426dc0b Added print errors to documentation
adds a86171a Merge pull request #910 from shehzan10/devel
adds 89d7e65 Automatically enable and build CUDA and OpenCL backends by default.
adds 5ad9ada Suppress FIND output for non-essential libraries.
adds 88a2c26 Merge pull request #909 from bkloppenborg/autobuild-backends
adds 5df0554 BUGFIX: Fixed issues with mixed real and complex types
adds 9564317 Fixing the checks for skew
adds fb24097 BUGFIX: conjg no longer errors out for real inputs
adds d4cd847 af_scale now checks for default parameters properly
adds 3b19720 Merge pull request #915 from pavanky/fixes
adds f3df808 matchTemplate example
adds aca80f8 matchTemplate fix in opencl to support indexed template images
adds d322941 Removed uncessary copy in matchTemplate example
adds 314bc9e Added heat colormap to display disparity values for matching
adds 51d441c Merge pull request #922 from 9prady9/match_template_example
adds 4c34bf2 Made cpu::Array constructor consistent with CUDA & OpenCL
adds 3499b98 Fixed filenames for the files used in computer vision examples
adds de9ba74 SUSAN Corner Detector
adds 1a8892b Added error checks non-array parameters of susan API
adds 2c3e0d8 CUDA backend for SUSAN dectector
adds e57ed91 OpenCL backend for SUSAN dectector
adds 220f027 fixes: typos; specific to windows; additional unit test
adds 307040b Corrected memory allocation bug in cpu backend for orb, where
adds 2f5c672 Removed uncessary corner sorting for SUSAN
adds e9535b9 Changed CUDA/OpenCL kernels to use zero leading dimension
adds 57ffa26 typo fix in cuda SUSAN kernel
adds 26df8b3 SUSAN CUDA/OpenCL: Added bound checks to load shared/local Memory
adds a9fd10c Replaced static shared memory with dynamic in SUSAN CUDA kernel
adds 3d405b9 Merge pull request #921 from 9prady9/susan
adds 61ea091 FEAT: Summed Area Tables (sat, af_sat) a.k.a integral images
adds 8f250e7 Added check for skipping double type test when not supported
adds d8760f6 Added uintl, intl support for jit operations in cuda backend
adds 27f3385 Merge pull request #931 from 9prady9/sat
adds 3ede60b Changed default cuda stream to be non-zero
adds 1363441 Wrapped cuda kernel launches with CUDA_LAUNCH macro
adds d46b15b Added stream parameter for upstream{thrust, cufft, cublas} calls
adds 60c2be9 thrust fixes for cuda stream selection on cuda < 7.0
adds f19ecf8 Merge pull request #929 from 9prady9/cuda_default_stream
adds 16d8e27 Extended support for interleaved convolution
adds 8aaf299 Updating the COPYRIGHT.md document
adds ddd704b Updating the language in COPYRIGHT document
adds 3cedee7 Added new example: heston_model in financial
adds 4358056 Moving af_array info gathering functions from util.h to array.h
adds 75c81f1 Merge pull request #928 from pavanky/minor_changes
adds 499dc09 shared/local memory loading fix
adds 8086d34 Merge pull request #933 from 9prady9/ker_fixes
adds 60c4979 namespace fix for POST_LAUNCH_CHECK macro: cuda backend
adds 6cf4a5a Replaced cuda Memcopy/Memset with async versions
adds e2994d9 Merge pull request #935 from 9prady9/cudaMem_stream
adds f816f41 Making fft_inplace consistent across all backends
adds 61968d6 FEAT / TEST: Adding support for inplace fft
adds 822fd5a Moving general fft implementation to src/api/c
adds b92e629 Removing consts from the fftInplace API
adds 155293f FEAT/TEST: Adding R2C and C2R FFT transforms for all backends
adds 91f6e35 DOCS: Adding documentation for real to complex transforms
adds 7900fca Fixing a minor issue in ArrayFireConfigVersion.cmake file
adds d65d048 Fixing issue in documentation
adds 0efb69a Cleaning up multiply_inplace in cpu backend
adds 0a1adee Enabling memory manager back in cuda backend
adds d2e182f Merge pull request #936 from pavanky/fft
adds 3b4d983 typo fix in opencl morph kernel
adds dc38b24 graphics window set size functions
adds 6efd999 style fixes in graphics examples
adds b47dc5b Merge pull request #937 from 9prady9/wind_resize
adds 910feb7 FEAT: Adding select for CPU backend
adds 97c916a FEAT: Select added for CUDA backend
adds 773c02d FEAT: Select for opencl backend
adds 981c5e6 FEAT: replace for all backends
adds 0472884 TEST: Adding tests for select and replace
adds 5df1b8c Merge pull request #941 from pavanky/new_funcs_31
adds b259259 YCbCr <-> RGB conversion functions
adds 4c876fc unit tests for YCbCr <-> RGB conversion
adds ce88eec Documentation for ycbcr_rgb conversion functions
adds 587aa2d Modified colorspace function to handle new colorspace - YCbCr
adds b55a459 Modifed colorspace function wrapper code for efficiency
adds 384cbc7 Replaced padArray calls with Jit operations in ycbcr conversions
adds f4cc97f Updated forge tag
adds 0da5291 Merge pull request #942 from 9prady9/ycbcr
adds 1e60132 BUGFIX: Fixing offset issue with CPU element wise operations
adds 26a2e40 PERF: improvements to element wise operations in CPU backend
adds 9e20a23 Merge pull request #945 from pavanky/jit_fixes
adds 77c31e6 FEAT: adding complex support for exp
adds 345c579 Binary operations with floating point scalars default to single precision
adds cbdd5be Merge pull request #944 from pavanky/minor
adds 3e3be90 Fix type in linux doc
adds 62b0360 Fix signed-usigned comp warnings
adds 07ffda7 Fixes to print functions
adds 50699cc FEAT Added saveArray and readArray functions for file read/write
adds 2a9e855 Removed af_print_array_c. af::print now calls af_print_array_p
adds 57449b0 saveArray returns index of array
adds e59c953 Rename af_print_array_p to af_print_array_gen
adds ac92dda FEAT added to string function
adds dc6b11c Merge pull request #943 from shehzan10/stream
adds 30dd47a modified cpu::getInfo to display CPU information
adds 4cd73be Fixes for cpu backend getInfo on Tegra platform
adds 2fa808b Fixed missing header error for windows platform
adds 791dfcc Merge pull request #952 from 9prady9/cpuinfo
adds 817ccc3 FEAT Added image IO using memory functions and tests
adds c43af47 Changed saveImageMem API. Added image format enum
adds 05c3d62 Add typedef af_image_format af::imageFormat
adds 0681ddd Merge pull request #954 from shehzan10/imageio_mem
adds 78b9b99 Style changes to code in unwrap
adds 2d60aed FEAT: Adding support to unwrap along rows as well as columns
adds a958628 BUGFIX: Fixed a bug for unwrap in all backends
adds 7df4dcd FEAT/TEST/DOC: Adding wrap for CPU backend
adds 1909fa5 FEAT: wrap for CUDA backend
adds 1fab923 Adding atomics.hpp file for CUDA that can be used in the future
adds 365afb4 Moving the kernel cache map to a centralized location
adds 2e05dd4 FEAT: wrap for OpenCL backend
adds a8113b0 Removing faulty test
adds aca30fa Cleaning up unwrap code in OpenCL by using cache store
adds 08b6ecf Fixing the compile error on windows
adds f89e37b Merge pull request #955 from pavanky/wrap
adds cd097af Fix conjugate transpose for vectors
adds 5867404 Merge pull request #958 from shehzan10/ctrans_fix
adds 1580652 fix #957: remove unused dtype trait for size_t
adds a8d8837 Merge pull request #959 from ghisvail/bugfix/remove-unused-dtype-traits
adds 7f06c0d update cl.hpp to upstream version 1.2.7
adds 30f9e3c fix instantiation of Platform objects
adds 162e258 fix instantiation of cl::Platform objects
adds 7b759ce adding GL headers in platform.cpp
adds af598d1 Moved GL headers in platform.cpp inside WITH_GRAPHICS block
adds 9dbf2df Merge pull request #961 from 9prady9/cl_hpp_fixes
adds 3121ca1 Create instances for const index member functions
adds cc0f401 BUGFIX: in array_proxy::get() const
adds 1ba33a6 Merge pull request #953 from umar456/fix_951
adds b6d8e2d Updated boost compute version tags
adds 0ce0df2 Merge pull request #962 from shehzan10/update-deps
adds 028960f Added SIFT prototypes and parameter documentation
adds 60546a1 Added C API for SIFT
adds efe1331 Added C++ API for SIFT
adds 3ef9fdf Added CUDA backend for SIFT
adds b87ba86 Added OpenCL backend for SIFT
adds 25344cb Added CPU backend for SIFT
adds 48e909a Added AF_ERR_NONFREE to defines.h
adds 1ee6b70 Added BUILD_NONFREE option to CMake
adds b177218 Added SIFT Copyright information
adds b69215c Added OpenSIFT License
adds afe1e46 Added SIFT documentation
adds d1d122e Updated test data
adds ca57067 Added SIFT unit tests
adds 43e5fe1 Made SIFT image indexing more readable in CPU backend
adds a6ae78b SIFT fix for CUDA on Windows, made it more readable
adds 9d09e4d Made SIFT image indexing more readable in OpenCL backend
adds cce0e73 Templated SIFT gaussianElimination() in CPU and CUDA backends
adds a5e8a5e Added missing CUDA_LAUNCH and THRUST_SELECT to SIFT
adds bda81b0 Improved CUDA SIFT coalescing and performance
adds dd97ce5 Improved OpenCL SIFT coalescing and performance
adds a9a8140 Passing shared size memory dynamically to CUDA SIFT
adds 3ada93a Moved OpenCL's conv2Helper to kernel directory
adds b5cf8d5 Improved SIFT OpenCL code
adds 079f4df Using pre-defined constants for workgroup sizes in CUDA SIFT
adds c17da09 Using 3D arrays for Gaussian/DoG pyramids in CUDA SIFT
adds 9bd1693 Using cudaMemsetAsync for SIFT
adds 966a130 Fixed OpenCL SIFT bug causing segmentation faults on Intel
adds 45a4966 Added missing buffer freeing call to OpenCL SIFT
adds 298c4d8 Fixed CUDA SIFT on unused memory buffer
adds b563615 Changed SIFT unit test to use std::stable_sort()
adds faefa30 Moved syncthreads/barriers out of thread conditionals
adds ee4d0bd Moving CUDA SIFT syncthreads calls out of thread conditionals
adds c88929e Merge pull request #948 from pentschev/sift
adds b372e49 Reorganizing non free build process.
adds b9fac73 Changing build flags to build non free algorithms
adds 5e0ceac Templated options are now runtime compile options for opencl reductions
adds 8bf6589 Templated options are now runtime compile options for opencl convolutions
adds bf69efc Templated options are now runtime compile options for opencl indexed min/max
adds 5c0da49 Templated options are now runtime compile options for opencl scan
adds 39f9977 Templated options are now runtime compile options for opencl nearest neighbor
adds d03c3c0 Removing unnecessary switch case from opencl ireduce
adds 2c00e64 Templated options are now runtime compile options for opencl FAST
adds 72f7ff5 Splitting up opencl sort_by_key files to compile in parallel
adds 398b55c Splitting sort_by_key across too many files slows down compile times
adds 98f6f7c Fixing a bug introduced a couple of commits ago in OpenCL SIFT
adds 8bb6ad7 Merge pull request #966 from pavanky/compile_fixes
adds 0183b6a Fixing the output of af::info() for OpenCL backend
adds f34693e Updating documentation and adding version guards for for 3.1
adds c071d86 Merge pull request #964 from pavanky/fixes_310
adds 7fb28f6 Merge branch 'devel' into nonfree_fixes
adds af7681f Merge pull request #965 from pavanky/nonfree_fixes
adds 71c12e8 Remove set_scalar(x, 0) instructions
adds d0e1e7b Remove unused opencl/kernel/set files
adds f52389e Merge pull request #968 from shehzan10/opencl_fixes
adds d1b1a0e Fix typo
adds 12e4f9d Add SVD API
adds 369013c Add SVD Cuda backend
adds 2969c73 Add SVD CPU Backend
adds 82d039f Add SVD OpenCL Stump
adds 864c51d Add SVD example
adds 71da964 Merge pull request #882 from klemmster/cusolver_svd
adds 3794c80 Merge branch 'devel' into svd
adds bdbf30e Changes to style and fix compile errors
adds 20b5f51 Cleaning up cpu blas / lapack in OpenCL backend
adds e7e38b5 Fixes to supress annoying compiler warnings in OpenCL backend
adds fde6380 Adding functions from clMagma necessary for OpenCL SVD:
adds 73b8081 Initial support for SVD in OpenCL backend
adds 163ab37 Adding proper error checking in magma
adds eb66094 Fixing svd params to reflect clmagma
adds c265948 Merge remote-tracking branch 'upstream/devel' into svd
adds 549f6b5 Work around for issues in OpenCL svd
adds 5fc32dc API clean up and adding support for complex numbers for SVD
adds 4118733 Fixing various typos and bug fixes for SVD in CUDA and OpenCL
adds b3c5f0f TEST: for SVD
adds 974856b DOCS: Updating the documentation for SVD
adds 9ef664c Adding version guards for svd
adds 73717c1 Adding more pragma directives to supress GCC warnings
adds 083f6b8 TEST: updating SVD tests to contain all four floating point data types
adds 25975bb Fixing svd example to reflect the change in API
adds 0b76aa3 Revert "Updated boost compute version tags"
adds ed9e1be Compilation fixes for OSX
adds 8bcbf98 Use xGESVD instead of xGESDD for ARM platforms
adds 80593d5 Merge pull request #969 from pavanky/svd
adds 8755004 Added missing 3.1 version gaurds
adds 40912e5 BUGFIX SVD use gesdd only with MKL, use gesvd with atlas
adds 52923a1 Updated release notes for v3.1.0
adds 7f8799d Version guards for complex operators
adds ff39491 Added SIFT license info to release notes
adds e1bee7a SVD using gesdd on Apple
adds 2c7044b Merge pull request #970 from shehzan10/rel_31
adds aa8bc95 FEAT: Adding support for linear assignment in C API
adds 654dffb Restore original shape after flattening input for linear indexing
adds 5b671c2 Merge pull request #971 from pavanky/assign
adds 4b5725b FEAT: Adding support for linear indexing in C API
adds d2741e0 OpenCL backend af_info function string fixes
adds 52f45e9 Documentation for CUDA backend specific API
adds 41d517e Documentation fixes for 3.1 release
adds d27f91b Updated forge upstream tag for 3.1 release
adds 2712182 Documentation for missing index header functions
adds 164b39e Added missing docs for complex and opencl backend specific fns
adds 4f60872 Added missing docs for operator%, array::H and array::T
adds 1a52ee1 Merge pull request #973 from 9prady9/upstream_updates
adds e6acad6 Merge pull request #974 from arrayfire/devel
adds 4d7e369 Increment version to 3.1.1
adds e0b8b5b Fixes for snprintf on windows
adds 095f29e BUGFIX: For calculating number of elements for a buffer in CUDA backend
adds 619ecf7 BUGFIX: For calculating number of elements for a buffer in OpenCL backend
adds bce6967 TEST: Adding tests for indexed reductions
adds 704c0dc Merge pull request #983 from pavanky/indexed_reduce_fixes
adds 64a0dbf DOC fix for AF_PATH rendering missing %
adds e87a003 Read me fixes
adds 644d074 Try PkgConf first to find CBLAS
adds 7788c3b Merge pull request #990 from marbre/hotfixes-3.1.1-cblas
adds fc0ed41 Try PkgConf first to find LAPACKE
adds a083294 Merge pull request #991 from marbre/hotfixes-3.1.1-lapacke
adds 887ae72 Changes to Heston model to remove c++11 dependencies
adds 714cdc0 BUGFIX: seq.begin can now use negative offsets just like seq.end
adds 01af962 Updating Release notes for 3.1.1
adds eb9fde9 BUG: Fixing seq when passing af::end to af::seq
adds 5e91f81 Merge pull request #996 from pavanky/hotfixes-3.1.1
adds 3d7f26b bug fix in image_editing example
adds f2474cf Updated forge tag for 3.1 release
adds 90b61ce Merge pull request #997 from 9prady9/gfx_examples_fixes
adds 169af41 Updating release notes
adds 92e08e9 Merge pull request #998 from arrayfire/hotfixes-3.1.1
This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version. This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:
* -- * -- B -- O -- O -- O (7527e5b)
\
N -- N -- N refs/heads/master (92e08e9)
You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.
Any revisions marked "omits" are not gone; other references still
refer to them. Any revisions marked "discards" are gone forever.
No new revisions were added by this update.
Summary of changes:
ArrayFireConfigVersion.cmake.in | 6 +-
CMakeLists.txt | 44 +-
CMakeModules/FindCBLAS.cmake | 25 +
CMakeModules/FindLAPACKE.cmake | 144 +-
CMakeModules/Version.cmake | 23 +-
CMakeModules/build_clFFT.cmake | 1 +
CMakeModules/build_forge.cmake | 18 +-
CMakeModules/version.h.in | 2 +
COPYRIGHT.md | 40 +-
LICENSES/OpenSIFT License.txt | 57 +
README.md | 12 +-
assets | 2 +-
docs/details/data.dox | 19 +-
docs/details/image.dox | 223 +
docs/details/index.dox | 19 +
docs/details/lapack.dox | 22 +
docs/details/signal.dox | 65 +-
docs/details/util.dox | 148 +
docs/details/vision.dox | 92 +
docs/doxygen.mk | 6 +-
docs/header.htm | 3 -
docs/highlight.pack.js | 1 -
docs/highlight_js_doxygen.css | 93 -
docs/pages/README.md | 6 +-
docs/pages/configuring_arrayfire_environment.md | 10 +
docs/pages/release_notes.md | 172 +-
docs/pages/using_on_linux.md | 2 +-
docs/pages/using_on_windows.md | 8 +-
examples/computer_vision/fast.cpp | 2 +-
examples/computer_vision/harris.cpp | 2 +-
examples/computer_vision/matching.cpp | 120 +
examples/computer_vision/{fast.cpp => susan.cpp} | 26 +-
examples/financial/heston_model.cpp | 115 +
examples/graphics/histogram.cpp | 4 +-
examples/image_processing/edge.cpp | 2 +-
examples/image_processing/filters.cpp | 8 +-
examples/image_processing/image_editing.cpp | 4 +-
examples/image_processing/morphing.cpp | 2 +-
examples/lin_algebra/{qr.cpp => svd.cpp} | 37 +-
examples/machine_learning/deep_belief_net.cpp | 6 -
examples/machine_learning/logistic_regression.cpp | 6 -
examples/machine_learning/neural_network.cpp | 6 -
examples/machine_learning/perceptron.cpp | 6 -
examples/machine_learning/rbm.cpp | 6 -
include/af/algorithm.h | 143 +
include/af/arith.h | 25 +
include/af/array.h | 273 +-
include/af/blas.h | 20 +-
include/af/complex.h | 64 +-
include/af/constants.h | 8 +
include/af/cuda.h | 93 +
include/af/data.h | 145 +-
include/af/defines.h | 46 +-
include/af/device.h | 34 +-
include/af/graphics.h | 26 +
include/af/image.h | 338 +-
include/af/index.h | 60 +-
include/af/lapack.h | 55 +
include/af/opencl.h | 31 +
include/af/signal.h | 540 +-
include/af/util.h | 261 +-
include/af/vision.h | 273 +
include/arrayfire.h | 15 +-
src/api/c/assign.cpp | 89 +-
src/api/c/blas.cpp | 16 +-
src/api/c/colorspace.cpp | 60 +-
src/api/c/complex.cpp | 2 +-
src/api/c/convolve.cpp | 22 +-
src/api/c/convolve_common.hpp | 11 +-
src/api/c/data.cpp | 34 +
src/api/c/device.cpp | 73 +
src/api/c/diff.cpp | 4 +
src/api/c/dog.cpp | 70 +
src/api/c/features.hpp | 8 +
src/api/c/fft.cpp | 204 +-
src/api/c/fft_common.hpp | 78 +
src/api/c/fftconvolve.cpp | 115 +-
src/api/c/flip.cpp | 2 +
src/api/c/hamming.cpp | 54 +-
src/api/c/harris.cpp | 84 +
src/api/c/image.cpp | 20 +-
src/api/c/imageio.cpp | 336 +-
src/api/c/index.cpp | 24 +-
src/api/c/join.cpp | 2 +
src/api/c/median.cpp | 5 +-
src/api/c/nearest_neighbour.cpp | 90 +
src/api/c/ops.hpp | 21 +
src/api/c/optypes.hpp | 2 +
src/api/c/print.cpp | 117 +-
src/api/c/rank.cpp | 14 +-
src/api/c/reduce.cpp | 79 +-
src/api/c/replace.cpp | 109 +
src/api/c/resize.cpp | 5 +-
src/api/c/rotate.cpp | 5 +-
src/api/c/sat.cpp | 57 +
src/api/c/scan.cpp | 2 +
src/api/c/select.cpp | 156 +
src/api/c/shift.cpp | 2 +
src/api/c/sift.cpp | 90 +
src/api/c/stream.cpp | 360 +
src/api/c/susan.cpp | 79 +
src/api/c/svd.cpp | 128 +
src/api/c/tile.cpp | 30 +-
src/api/c/transform.cpp | 20 +-
src/api/c/transpose.cpp | 16 +-
src/api/c/unary.cpp | 51 +-
src/api/c/unwrap.cpp | 64 +
src/api/c/wrap.cpp | 78 +
src/api/c/ycbcr_rgb.cpp | 160 +
src/api/cpp/array.cpp | 83 +-
src/api/cpp/complex.cpp | 61 +-
src/api/cpp/constants.cpp | 8 +
src/api/cpp/data.cpp | 32 +
src/api/cpp/{covariance.cpp => dog.cpp} | 8 +-
src/api/cpp/fft.cpp | 137 +
src/api/cpp/graphics.cpp | 5 +
src/api/cpp/hamming.cpp | 2 +-
src/api/cpp/{fast.cpp => harris.cpp} | 12 +-
src/api/cpp/imageio.cpp | 19 +
src/api/cpp/index.cpp | 43 +
src/api/cpp/lapack.cpp | 18 +
src/api/cpp/{hamming.cpp => nearest_neighbour.cpp} | 9 +-
src/api/cpp/reduce.cpp | 84 +-
src/api/cpp/{rotate.cpp => sat.cpp} | 4 +-
src/api/cpp/seq.cpp | 6 +-
src/api/cpp/{orb.cpp => sift.cpp} | 17 +-
src/api/cpp/{fast.cpp => susan.cpp} | 10 +-
src/api/cpp/unary.cpp | 1 +
src/api/cpp/{scan.cpp => unwrap.cpp} | 7 +-
src/api/cpp/util.cpp | 44 +-
src/api/cpp/{diff.cpp => wrap.cpp} | 20 +-
src/api/cpp/{regions.cpp => ycbcr_rgb.cpp} | 11 +-
src/backend/ArrayInfo.hpp | 5 +-
src/backend/cblas.cpp | 74 +-
src/backend/cpu/Array.cpp | 93 +-
src/backend/cpu/Array.hpp | 61 +-
src/backend/cpu/CMakeLists.txt | 57 +-
src/backend/cpu/TNJ/BinaryNode.hpp | 19 +-
src/backend/cpu/TNJ/BufferNode.hpp | 45 +-
src/backend/cpu/TNJ/Node.hpp | 10 +-
src/backend/cpu/TNJ/ScalarNode.hpp | 9 +-
src/backend/cpu/TNJ/UnaryNode.hpp | 15 +-
src/backend/cpu/blas.cpp | 205 +-
src/backend/cpu/blas.hpp | 12 +
src/backend/cpu/cast.hpp | 17 +
src/backend/cpu/convolve.cpp | 12 +-
src/backend/cpu/copy.cpp | 6 +
src/backend/cpu/copy.hpp | 3 +
src/backend/cpu/diff.cpp | 2 +
src/backend/cpu/fft.cpp | 178 +-
src/backend/cpu/fft.hpp | 10 +-
src/backend/cpu/fftconvolve.cpp | 14 +-
src/backend/cpu/hamming.cpp | 103 -
src/backend/cpu/harris.cpp | 265 +
src/backend/cpu/{fast.hpp => harris.hpp} | 11 +-
src/backend/cpu/iir.cpp | 4 +-
src/backend/cpu/iota.cpp | 2 +
src/backend/cpu/ireduce.cpp | 4 +
src/backend/cpu/memory.cpp | 46 +-
src/backend/cpu/memory.hpp | 3 +-
src/backend/cpu/nearest_neighbour.cpp | 175 +
.../cpu/{hamming.hpp => nearest_neighbour.hpp} | 9 +-
src/backend/cpu/orb.cpp | 9 +-
src/backend/cpu/platform.cpp | 177 +-
src/backend/cpu/random.cpp | 9 +-
src/backend/cpu/range.cpp | 2 +
src/backend/cpu/reduce.cpp | 49 +-
src/backend/cpu/reduce.hpp | 6 +-
src/backend/cpu/resize.cpp | 30 +
src/backend/cpu/rotate.cpp | 9 +-
src/backend/cpu/scan.cpp | 2 +
src/backend/cpu/select.cpp | 143 +
src/backend/cpu/{iota.hpp => select.hpp} | 10 +-
src/backend/cpu/shift.cpp | 2 +
src/backend/cpu/sift.cpp | 62 +
src/backend/cpu/{orb.hpp => sift.hpp} | 15 +-
src/backend/cpu/sift_nonfree.hpp | 1033 ++
src/backend/cpu/susan.cpp | 136 +
src/backend/cpu/susan.hpp | 24 +
src/backend/cpu/svd.cpp | 121 +
src/backend/cpu/{exampleFunction.hpp => svd.hpp} | 8 +-
src/backend/cpu/tile.cpp | 2 +
src/backend/cpu/transform.cpp | 16 +-
src/backend/cpu/transform_interp.hpp | 36 +
src/backend/cpu/unary.hpp | 7 +
src/backend/cpu/unwrap.cpp | 122 +
src/backend/cpu/{diff.hpp => unwrap.hpp} | 7 +-
src/backend/cpu/where.cpp | 3 +-
src/backend/cpu/wrap.cpp | 122 +
src/backend/cpu/{fftconvolve.hpp => wrap.hpp} | 12 +-
src/backend/cuda/Array.cpp | 33 +-
src/backend/cuda/Array.hpp | 52 +-
src/backend/cuda/JIT/cast.cu | 2 +
src/backend/cuda/JIT/exp.cu | 15 +
src/backend/cuda/JIT/hyper.cu | 2 +
src/backend/cuda/JIT/logic.cu | 2 +
src/backend/cuda/JIT/numeric.cu | 16 +
src/backend/cuda/JIT/trig.cu | 4 +
src/backend/cuda/all.cu | 2 +
src/backend/cuda/any.cu | 2 +
src/backend/cuda/blas.cpp | 102 +-
src/backend/cuda/convolve.cpp | 4 +-
src/backend/cuda/copy.cu | 14 +-
src/backend/cuda/copy.hpp | 3 +
src/backend/cuda/count.cu | 2 +
src/backend/cuda/cublasManager.cpp | 1 +
src/backend/cuda/debug_cuda.hpp | 38 +-
src/backend/cuda/diff.cu | 2 +
src/backend/cuda/err_cuda.hpp | 33 +-
src/backend/cuda/fft.cpp | 155 +-
src/backend/cuda/fft.hpp | 12 +-
src/backend/cuda/fftconvolve.cu | 14 +-
src/backend/cuda/hamming.cu | 62 -
src/backend/cuda/harris.cu | 59 +
src/backend/cuda/{fast.hpp => harris.hpp} | 10 +-
src/backend/cuda/hist_graphics.cu | 3 +-
src/backend/cuda/iir.cu | 4 +-
src/backend/cuda/image.cu | 3 +-
src/backend/cuda/iota.cu | 2 +
src/backend/cuda/ireduce.cu | 4 +
src/backend/cuda/jit.cpp | 68 +-
src/backend/cuda/kernel/approx.hpp | 8 +-
src/backend/cuda/kernel/assign.hpp | 2 +-
src/backend/cuda/kernel/atomics.hpp | 59 +
src/backend/cuda/kernel/bilateral.hpp | 31 +-
src/backend/cuda/kernel/convolve.cu | 37 +-
src/backend/cuda/kernel/convolve_separable.cu | 5 +-
src/backend/cuda/kernel/diagonal.hpp | 4 +-
src/backend/cuda/kernel/diff.hpp | 4 +-
src/backend/cuda/kernel/exampleFunction.hpp | 11 +-
src/backend/cuda/kernel/fast.hpp | 94 +-
src/backend/cuda/kernel/fftconvolve.hpp | 58 +-
src/backend/cuda/kernel/gradient.hpp | 3 +-
src/backend/cuda/kernel/harris.hpp | 382 +
src/backend/cuda/kernel/histogram.hpp | 5 +-
src/backend/cuda/kernel/hsv_rgb.hpp | 2 +-
src/backend/cuda/kernel/identity.hpp | 2 +-
src/backend/cuda/kernel/iir.hpp | 2 +-
src/backend/cuda/kernel/index.hpp | 2 +-
src/backend/cuda/kernel/iota.hpp | 5 +-
src/backend/cuda/kernel/ireduce.hpp | 18 +-
src/backend/cuda/kernel/join.hpp | 6 +-
src/backend/cuda/kernel/lookup.hpp | 4 +-
src/backend/cuda/kernel/lu_split.hpp | 4 +-
src/backend/cuda/kernel/match_template.hpp | 3 +-
src/backend/cuda/kernel/meanshift.hpp | 40 +-
src/backend/cuda/kernel/medfilt.hpp | 49 +-
src/backend/cuda/kernel/memcopy.hpp | 12 +-
src/backend/cuda/kernel/morph.hpp | 157 +-
.../kernel/{hamming.hpp => nearest_neighbour.hpp} | 194 +-
src/backend/cuda/kernel/orb.hpp | 84 +-
src/backend/cuda/kernel/random.hpp | 8 +-
src/backend/cuda/kernel/range.hpp | 2 +-
src/backend/cuda/kernel/reduce.hpp | 106 +-
src/backend/cuda/kernel/regions.hpp | 25 +-
src/backend/cuda/kernel/reorder.hpp | 5 +-
src/backend/cuda/kernel/resize.hpp | 28 +-
src/backend/cuda/kernel/rotate.hpp | 6 +-
src/backend/cuda/kernel/scan_dim.hpp | 10 +-
src/backend/cuda/kernel/scan_first.hpp | 11 +-
src/backend/cuda/kernel/select.hpp | 155 +
src/backend/cuda/kernel/shift.hpp | 5 +-
src/backend/cuda/kernel/sift_nonfree.hpp | 1370 +++
src/backend/cuda/kernel/sobel.hpp | 36 +-
src/backend/cuda/kernel/sort.hpp | 5 +-
src/backend/cuda/kernel/sort_by_key.hpp | 6 +-
src/backend/cuda/kernel/sort_index.hpp | 12 +-
src/backend/cuda/kernel/susan.hpp | 180 +
src/backend/cuda/kernel/tile.hpp | 2 +-
src/backend/cuda/kernel/transform.hpp | 15 +-
src/backend/cuda/kernel/transform_interp.hpp | 42 +-
src/backend/cuda/kernel/transpose.hpp | 4 +-
src/backend/cuda/kernel/transpose_inplace.hpp | 4 +-
src/backend/cuda/kernel/triangle.hpp | 3 +-
src/backend/cuda/kernel/unwrap.hpp | 144 +
src/backend/cuda/kernel/where.hpp | 3 +-
src/backend/cuda/kernel/wrap.hpp | 113 +
src/backend/cuda/math.hpp | 3 +
src/backend/cuda/max.cu | 2 +
src/backend/cuda/memory.cpp | 62 +-
src/backend/cuda/memory.hpp | 3 +-
src/backend/cuda/min.cu | 2 +
src/backend/cuda/morph3d_impl.hpp | 5 +-
src/backend/cuda/morph_impl.hpp | 5 +-
src/backend/cuda/nearest_neighbour.cu | 79 +
.../cuda/{hamming.hpp => nearest_neighbour.hpp} | 9 +-
src/backend/cuda/platform.cpp | 25 +-
src/backend/cuda/platform.hpp | 5 +
src/backend/cuda/plot.cu | 3 +-
src/backend/cuda/product.cu | 2 +
src/backend/cuda/random.cu | 2 +
src/backend/cuda/range.cu | 2 +
src/backend/cuda/reduce.hpp | 6 +-
src/backend/cuda/reduce_impl.hpp | 13 +-
src/backend/cuda/resize.cu | 3 +
src/backend/cuda/rotate.cu | 9 +-
src/backend/cuda/scan.cu | 2 +
src/backend/cuda/select.cu | 51 +
src/backend/cuda/{iota.hpp => select.hpp} | 11 +-
src/backend/cuda/set.cu | 17 +-
src/backend/cuda/shift.cu | 2 +
src/backend/cuda/sift.cu | 89 +
src/backend/cuda/{orb.hpp => sift.hpp} | 15 +-
src/backend/cuda/sum.cu | 2 +
src/backend/cuda/susan.cu | 67 +
src/backend/cuda/susan.hpp | 24 +
src/backend/cuda/svd.cu | 155 +
src/backend/cuda/{cholesky.hpp => svd.hpp} | 8 +-
src/backend/cuda/tile.cu | 2 +
src/backend/cuda/transform.cu | 12 +-
src/backend/cuda/unary.hpp | 1 +
src/backend/cuda/unwrap.cu | 57 +
src/backend/cuda/{identity.hpp => unwrap.hpp} | 4 +-
src/backend/cuda/wrap.cu | 57 +
src/backend/cuda/{exampleFunction.hpp => wrap.hpp} | 9 +-
src/backend/dim4.cpp | 25 +-
src/backend/lapacke.cpp | 168 +
src/backend/lapacke.hpp | 93 +-
src/backend/opencl/Array.cpp | 33 +-
src/backend/opencl/Array.hpp | 51 +-
src/backend/opencl/CMakeLists.txt | 111 +-
src/backend/opencl/all.cpp | 2 +
src/backend/opencl/any.cpp | 2 +
src/backend/opencl/blas.cpp | 86 +-
src/backend/opencl/{fftconvolve.hpp => cache.hpp} | 16 +-
src/backend/opencl/cl.hpp | 10803 ++++++++++---------
src/backend/opencl/convolve.cpp | 4 +-
src/backend/opencl/convolve_separable.cpp | 44 +-
src/backend/opencl/copy.cpp | 7 +
src/backend/opencl/copy.hpp | 3 +
src/backend/opencl/count.cpp | 2 +
src/backend/opencl/diff.cpp | 2 +
src/backend/opencl/err_opencl.hpp | 22 +-
src/backend/opencl/fft.cpp | 225 +-
src/backend/opencl/fft.hpp | 12 +-
src/backend/opencl/fftconvolve.cpp | 12 +-
src/backend/opencl/hamming.cpp | 143 -
src/backend/opencl/harris.cpp | 57 +
src/backend/opencl/{fast.hpp => harris.hpp} | 10 +-
src/backend/opencl/iir.cpp | 4 +-
src/backend/opencl/iota.cpp | 2 +
src/backend/opencl/ireduce.cpp | 4 +
src/backend/opencl/jit.cpp | 48 +-
src/backend/opencl/kernel/approx.hpp | 11 +-
src/backend/opencl/kernel/approx1.cl | 9 +-
src/backend/opencl/kernel/approx2.cl | 10 +-
src/backend/opencl/kernel/bilateral.cl | 32 +-
src/backend/opencl/kernel/convolve.hpp | 6 +-
src/backend/opencl/kernel/convolve/conv2_impl.hpp | 117 +-
src/backend/opencl/kernel/convolve_separable.cpp | 131 +
src/backend/opencl/kernel/convolve_separable.hpp | 79 +-
src/backend/opencl/kernel/fast.hpp | 98 +-
src/backend/opencl/kernel/fftconvolve.hpp | 24 +-
src/backend/opencl/kernel/fftconvolve_multiply.cl | 6 +-
src/backend/opencl/kernel/fftconvolve_reorder.cl | 15 +-
src/backend/opencl/kernel/gradient.cl | 3 +-
src/backend/opencl/kernel/gradient.hpp | 7 +-
src/backend/opencl/kernel/harris.cl | 117 +
src/backend/opencl/kernel/harris.hpp | 343 +
src/backend/opencl/kernel/ireduce.hpp | 183 +-
src/backend/opencl/kernel/jit.cl | 1 +
src/backend/opencl/kernel/matchTemplate.cl | 2 +-
src/backend/opencl/kernel/meanshift.cl | 36 +-
src/backend/opencl/kernel/medfilt.cl | 37 +-
src/backend/opencl/kernel/morph.cl | 128 +-
.../kernel/{hamming.cl => nearest_neighbour.cl} | 66 +-
.../kernel/{hamming.hpp => nearest_neighbour.hpp} | 98 +-
src/backend/opencl/kernel/ops.cl | 8 +-
src/backend/opencl/kernel/orb.hpp | 12 +-
src/backend/opencl/kernel/random.cl | 45 +-
src/backend/opencl/kernel/random.hpp | 4 +
src/backend/opencl/kernel/reduce.hpp | 250 +-
src/backend/opencl/kernel/reduce_dim.cl | 4 +-
src/backend/opencl/kernel/reduce_first.cl | 4 +-
src/backend/opencl/kernel/regions.hpp | 5 +
src/backend/opencl/kernel/resize.cl | 21 +
src/backend/opencl/kernel/resize.hpp | 3 +-
src/backend/opencl/kernel/rotate.cl | 1 +
src/backend/opencl/kernel/rotate.hpp | 11 +-
src/backend/opencl/kernel/scan_dim.hpp | 207 +-
src/backend/opencl/kernel/scan_first.hpp | 196 +-
src/backend/opencl/kernel/select.cl | 90 +
src/backend/opencl/kernel/select.hpp | 175 +
src/backend/opencl/kernel/set.cl | 20 -
src/backend/opencl/kernel/set.hpp | 67 -
src/backend/opencl/kernel/sift_nonfree.cl | 806 ++
src/backend/opencl/kernel/sift_nonfree.hpp | 784 ++
src/backend/opencl/kernel/sobel.cl | 30 +-
src/backend/opencl/kernel/sort.hpp | 6 +
src/backend/opencl/kernel/sort_by_key.hpp | 6 +
src/backend/opencl/kernel/sort_index.hpp | 10 +-
src/backend/opencl/kernel/susan.cl | 113 +
src/backend/opencl/kernel/susan.hpp | 143 +
src/backend/opencl/kernel/transform.cl | 1 +
src/backend/opencl/kernel/transform.hpp | 13 +-
src/backend/opencl/kernel/transform_interp.cl | 40 +-
src/backend/opencl/kernel/unwrap.cl | 77 +
src/backend/opencl/kernel/unwrap.hpp | 117 +
src/backend/opencl/kernel/where.hpp | 7 +-
src/backend/opencl/kernel/wrap.cl | 74 +
src/backend/opencl/kernel/wrap.hpp | 112 +
src/backend/opencl/magma/gebrd.cpp | 368 +
src/backend/opencl/magma/geqrf2.cpp | 19 +-
src/backend/opencl/magma/geqrf3.cpp | 19 +-
src/backend/opencl/magma/getrf.cpp | 130 +-
src/backend/opencl/magma/getrs.cpp | 30 +-
src/backend/opencl/magma/labrd.cpp | 668 ++
src/backend/opencl/magma/larfb.cpp | 125 +-
src/backend/opencl/magma/laset.cpp | 44 +
src/backend/opencl/magma/laset_band.cpp | 44 +
src/backend/opencl/magma/laswp.cpp | 44 +
src/backend/opencl/magma/magma.h | 21 +
src/backend/opencl/magma/magma_blas.h | 57 +-
src/backend/opencl/magma/magma_cpu_blas.h | 91 +
src/backend/opencl/magma/magma_cpu_lapack.h | 143 +-
src/backend/opencl/magma/magma_helper.cpp | 20 +
src/backend/opencl/magma/magma_helper.h | 3 +
src/backend/opencl/magma/potrf.cpp | 130 +-
src/backend/opencl/magma/transpose.cpp | 44 +
src/backend/opencl/magma/transpose_inplace.cpp | 44 +
src/backend/opencl/magma/ungqr.cpp | 11 +-
src/backend/opencl/magma/unmqr.cpp | 11 +-
src/backend/opencl/magma/unmqr2.cpp | 10 +-
src/backend/opencl/max.cpp | 2 +
src/backend/opencl/memory.cpp | 46 +-
src/backend/opencl/memory.hpp | 4 +-
src/backend/opencl/min.cpp | 2 +
src/backend/opencl/nearest_neighbour.cpp | 115 +
.../opencl/{hamming.hpp => nearest_neighbour.hpp} | 9 +-
src/backend/opencl/platform.cpp | 33 +-
src/backend/opencl/platform.hpp | 1 +
src/backend/opencl/product.cpp | 2 +
src/backend/opencl/random.cpp | 2 +
src/backend/opencl/range.cpp | 2 +
src/backend/opencl/reduce.hpp | 5 +-
src/backend/opencl/reduce_impl.hpp | 13 +-
src/backend/opencl/resize.cpp | 3 +
src/backend/opencl/rotate.cpp | 12 +-
src/backend/opencl/scan.cpp | 14 +-
src/backend/opencl/select.cpp | 52 +
src/backend/opencl/{iota.hpp => select.hpp} | 11 +-
src/backend/opencl/set.cpp | 6 +
src/backend/opencl/shift.cpp | 2 +
src/backend/opencl/sift.cpp | 82 +
src/backend/opencl/{orb.hpp => sift.hpp} | 15 +-
src/backend/opencl/solve.cpp | 80 +-
.../convolve/conv2_s32.cpp => sort_by_key/b8.cpp} | 12 +-
.../convolve/conv2_s32.cpp => sort_by_key/f32.cpp} | 12 +-
.../convolve/conv2_s32.cpp => sort_by_key/f64.cpp} | 12 +-
.../{sort_by_key.cpp => sort_by_key/impl.hpp} | 34 +-
.../convolve/conv2_s32.cpp => sort_by_key/s32.cpp} | 12 +-
.../convolve/conv2_s32.cpp => sort_by_key/u32.cpp} | 12 +-
.../convolve/conv2_s32.cpp => sort_by_key/u8.cpp} | 12 +-
src/backend/opencl/sum.cpp | 2 +
src/backend/opencl/susan.cpp | 76 +
src/backend/opencl/susan.hpp | 24 +
src/backend/opencl/svd.cpp | 244 +
src/backend/opencl/{assign.hpp => svd.hpp} | 7 +-
src/backend/opencl/tile.cpp | 2 +
src/backend/opencl/traits.hpp | 10 -
src/backend/opencl/transform.cpp | 17 +-
src/backend/opencl/unary.hpp | 1 +
src/backend/opencl/unwrap.cpp | 56 +
src/backend/opencl/{identity.hpp => unwrap.hpp} | 4 +-
src/backend/opencl/wrap.cpp | 57 +
src/backend/opencl/{fftconvolve.hpp => wrap.hpp} | 11 +-
test/CMakeLists.txt | 7 +-
test/array.cpp | 24 +
test/assign.cpp | 193 +-
test/binary.cpp | 49 +-
test/complex.cpp | 18 +-
test/data | 2 +-
test/diagonal.cpp | 8 +
test/diff1.cpp | 2 +-
test/diff2.cpp | 2 +-
test/dog.cpp | 76 +
test/dot.cpp | 187 +
test/fast.cpp | 105 +-
test/fft.cpp | 106 +-
test/fft_real.cpp | 120 +
test/fftconvolve.cpp | 39 +
test/getting_started.cpp | 8 +-
test/gfor.cpp | 247 +
test/{fast.cpp => harris.cpp} | 152 +-
test/imageio.cpp | 64 +
test/index.cpp | 118 +
test/iota.cpp | 2 +-
test/ireduce.cpp | 45 +
test/manual_memory_test.cpp | 69 +
test/math.cpp | 22 +
test/memory.cpp | 261 +-
test/memory_lock.cpp | 95 +
test/nearest_neighbour.cpp | 218 +
test/random.cpp | 84 +-
test/range.cpp | 2 +-
test/rank_dense.cpp | 88 +
test/reduce.cpp | 247 +-
test/replace.cpp | 120 +
test/sat.cpp | 45 +
test/scan.cpp | 6 +-
test/select.cpp | 128 +
test/shift.cpp | 2 +-
test/{orb.cpp => sift_nonfree.cpp} | 136 +-
test/susan.cpp | 181 +
test/svd_dense.cpp | 99 +
test/tile.cpp | 2 +-
test/transpose.cpp | 18 +-
test/unwrap.cpp | 177 +
test/wrap.cpp | 179 +
test/{hsv_rgb.cpp => ycbcr_rgb.cpp} | 14 +-
510 files changed, 30587 insertions(+), 9932 deletions(-)
create mode 100644 LICENSES/OpenSIFT License.txt
create mode 100644 docs/details/index.dox
create mode 100644 docs/details/util.dox
delete mode 100644 docs/highlight.pack.js
delete mode 100644 docs/highlight_js_doxygen.css
create mode 100644 examples/computer_vision/matching.cpp
copy examples/computer_vision/{fast.cpp => susan.cpp} (73%)
create mode 100644 examples/financial/heston_model.cpp
copy examples/lin_algebra/{qr.cpp => svd.cpp} (61%)
create mode 100644 include/af/cuda.h
create mode 100644 src/api/c/dog.cpp
create mode 100644 src/api/c/fft_common.hpp
create mode 100644 src/api/c/harris.cpp
create mode 100644 src/api/c/nearest_neighbour.cpp
create mode 100644 src/api/c/replace.cpp
create mode 100644 src/api/c/sat.cpp
create mode 100644 src/api/c/select.cpp
create mode 100644 src/api/c/sift.cpp
create mode 100644 src/api/c/stream.cpp
create mode 100644 src/api/c/susan.cpp
create mode 100644 src/api/c/svd.cpp
create mode 100644 src/api/c/unwrap.cpp
create mode 100644 src/api/c/wrap.cpp
create mode 100644 src/api/c/ycbcr_rgb.cpp
copy src/api/cpp/{covariance.cpp => dog.cpp} (69%)
copy src/api/cpp/{fast.cpp => harris.cpp} (56%)
copy src/api/cpp/{hamming.cpp => nearest_neighbour.cpp} (60%)
copy src/api/cpp/{rotate.cpp => sat.cpp} (75%)
copy src/api/cpp/{orb.cpp => sift.cpp} (50%)
copy src/api/cpp/{fast.cpp => susan.cpp} (58%)
copy src/api/cpp/{scan.cpp => unwrap.cpp} (62%)
copy src/api/cpp/{diff.cpp => wrap.cpp} (51%)
copy src/api/cpp/{regions.cpp => ycbcr_rgb.cpp} (63%)
delete mode 100644 src/backend/cpu/hamming.cpp
create mode 100644 src/backend/cpu/harris.cpp
copy src/backend/cpu/{fast.hpp => harris.hpp} (54%)
create mode 100644 src/backend/cpu/nearest_neighbour.cpp
rename src/backend/cpu/{hamming.hpp => nearest_neighbour.hpp} (56%)
create mode 100644 src/backend/cpu/select.cpp
copy src/backend/cpu/{iota.hpp => select.hpp} (58%)
create mode 100644 src/backend/cpu/sift.cpp
copy src/backend/cpu/{orb.hpp => sift.hpp} (50%)
create mode 100644 src/backend/cpu/sift_nonfree.hpp
create mode 100644 src/backend/cpu/susan.cpp
create mode 100644 src/backend/cpu/susan.hpp
create mode 100644 src/backend/cpu/svd.cpp
copy src/backend/cpu/{exampleFunction.hpp => svd.hpp} (63%)
create mode 100644 src/backend/cpu/unwrap.cpp
copy src/backend/cpu/{diff.hpp => unwrap.hpp} (68%)
create mode 100644 src/backend/cpu/wrap.cpp
copy src/backend/cpu/{fftconvolve.hpp => wrap.hpp} (55%)
delete mode 100644 src/backend/cuda/hamming.cu
create mode 100644 src/backend/cuda/harris.cu
copy src/backend/cuda/{fast.hpp => harris.hpp} (54%)
create mode 100644 src/backend/cuda/kernel/atomics.hpp
create mode 100644 src/backend/cuda/kernel/harris.hpp
rename src/backend/cuda/kernel/{hamming.hpp => nearest_neighbour.hpp} (67%)
create mode 100644 src/backend/cuda/kernel/select.hpp
create mode 100644 src/backend/cuda/kernel/sift_nonfree.hpp
create mode 100644 src/backend/cuda/kernel/susan.hpp
create mode 100644 src/backend/cuda/kernel/unwrap.hpp
create mode 100644 src/backend/cuda/kernel/wrap.hpp
create mode 100644 src/backend/cuda/nearest_neighbour.cu
rename src/backend/cuda/{hamming.hpp => nearest_neighbour.hpp} (57%)
create mode 100644 src/backend/cuda/select.cu
copy src/backend/cuda/{iota.hpp => select.hpp} (58%)
create mode 100644 src/backend/cuda/sift.cu
copy src/backend/cuda/{orb.hpp => sift.hpp} (50%)
create mode 100644 src/backend/cuda/susan.cu
create mode 100644 src/backend/cuda/susan.hpp
create mode 100644 src/backend/cuda/svd.cu
copy src/backend/cuda/{cholesky.hpp => svd.hpp} (63%)
create mode 100644 src/backend/cuda/unwrap.cu
copy src/backend/cuda/{identity.hpp => unwrap.hpp} (69%)
create mode 100644 src/backend/cuda/wrap.cu
copy src/backend/cuda/{exampleFunction.hpp => wrap.hpp} (53%)
copy src/backend/opencl/{fftconvolve.hpp => cache.hpp} (54%)
delete mode 100644 src/backend/opencl/hamming.cpp
create mode 100644 src/backend/opencl/harris.cpp
copy src/backend/opencl/{fast.hpp => harris.hpp} (54%)
create mode 100644 src/backend/opencl/kernel/convolve_separable.cpp
create mode 100644 src/backend/opencl/kernel/harris.cl
create mode 100644 src/backend/opencl/kernel/harris.hpp
rename src/backend/opencl/kernel/{hamming.cl => nearest_neighbour.cl} (89%)
rename src/backend/opencl/kernel/{hamming.hpp => nearest_neighbour.hpp} (55%)
create mode 100644 src/backend/opencl/kernel/select.cl
create mode 100644 src/backend/opencl/kernel/select.hpp
delete mode 100644 src/backend/opencl/kernel/set.cl
delete mode 100644 src/backend/opencl/kernel/set.hpp
create mode 100644 src/backend/opencl/kernel/sift_nonfree.cl
create mode 100644 src/backend/opencl/kernel/sift_nonfree.hpp
create mode 100644 src/backend/opencl/kernel/susan.cl
create mode 100644 src/backend/opencl/kernel/susan.hpp
create mode 100644 src/backend/opencl/kernel/unwrap.cl
create mode 100644 src/backend/opencl/kernel/unwrap.hpp
create mode 100644 src/backend/opencl/kernel/wrap.cl
create mode 100644 src/backend/opencl/kernel/wrap.hpp
create mode 100644 src/backend/opencl/magma/gebrd.cpp
create mode 100644 src/backend/opencl/magma/labrd.cpp
create mode 100644 src/backend/opencl/magma/magma_cpu_blas.h
create mode 100644 src/backend/opencl/nearest_neighbour.cpp
rename src/backend/opencl/{hamming.hpp => nearest_neighbour.hpp} (58%)
create mode 100644 src/backend/opencl/select.cpp
copy src/backend/opencl/{iota.hpp => select.hpp} (59%)
create mode 100644 src/backend/opencl/sift.cpp
copy src/backend/opencl/{orb.hpp => sift.hpp} (50%)
copy src/backend/opencl/{kernel/convolve/conv2_s32.cpp => sort_by_key/b8.cpp} (79%)
copy src/backend/opencl/{kernel/convolve/conv2_s32.cpp => sort_by_key/f32.cpp} (79%)
copy src/backend/opencl/{kernel/convolve/conv2_s32.cpp => sort_by_key/f64.cpp} (79%)
rename src/backend/opencl/{sort_by_key.cpp => sort_by_key/impl.hpp} (57%)
copy src/backend/opencl/{kernel/convolve/conv2_s32.cpp => sort_by_key/s32.cpp} (79%)
copy src/backend/opencl/{kernel/convolve/conv2_s32.cpp => sort_by_key/u32.cpp} (79%)
copy src/backend/opencl/{kernel/convolve/conv2_s32.cpp => sort_by_key/u8.cpp} (79%)
create mode 100644 src/backend/opencl/susan.cpp
create mode 100644 src/backend/opencl/susan.hpp
create mode 100644 src/backend/opencl/svd.cpp
copy src/backend/opencl/{assign.hpp => svd.hpp} (62%)
create mode 100644 src/backend/opencl/unwrap.cpp
copy src/backend/opencl/{identity.hpp => unwrap.hpp} (69%)
create mode 100644 src/backend/opencl/wrap.cpp
copy src/backend/opencl/{fftconvolve.hpp => wrap.hpp} (55%)
create mode 100644 test/dog.cpp
create mode 100644 test/dot.cpp
create mode 100644 test/fft_real.cpp
copy test/{fast.cpp => harris.cpp} (56%)
create mode 100644 test/manual_memory_test.cpp
create mode 100644 test/memory_lock.cpp
create mode 100644 test/nearest_neighbour.cpp
create mode 100644 test/rank_dense.cpp
create mode 100644 test/replace.cpp
create mode 100644 test/sat.cpp
create mode 100644 test/select.cpp
copy test/{orb.cpp => sift_nonfree.cpp} (66%)
create mode 100644 test/susan.cpp
create mode 100644 test/svd_dense.cpp
create mode 100644 test/unwrap.cpp
create mode 100644 test/wrap.cpp
copy test/{hsv_rgb.cpp => ycbcr_rgb.cpp} (83%)
--
Alioth's /usr/local/bin/git-commit-notice on /srv/git.debian.org/git/debian-science/packages/arrayfire.git
More information about the debian-science-commits
mailing list