[arrayfire] branch master updated (7527e5b -> 92e08e9)

Ghislain Vaillant ghisvail-guest at moszumanska.debian.org
Mon Sep 21 19:20:35 UTC 2015


This is an automated email from the git hooks/post-receive script.

ghisvail-guest pushed a change to branch master
in repository arrayfire.

     omits  7527e5b   Merge remote-tracking branch 'upstream/master'
     omits  d86f4f5   Merge remote-tracking branch 'upstream/master'
      adds  36708b0   Merge pull request #815 from arrayfire/hotfixes-3.0.1
      adds  3053edd   Implement CUDA complex dot product
      adds  ab35ad1   Add complex dot product for CPU backend
      adds  0372645   Add complex dot product for OpenCL backend
      adds  0bd1110   Add forgotten instatiations
      adds  075f48d   Merge pull request #817 from FilipeMaia/complex_dot_product
      adds  f00a396   FEAT: Difference of Gaussians
      adds  b037a8c   Merge pull request #835 from 9prady9/dog
      adds  b0e4992   FEAT Add CPU backend for unwrap function
      adds  73d9038   Added CUDA backend for Unwrap
      adds  669b4b2   Added OpenCL backend for Unwrap
      adds  4badf65   Bugfixes, comments
      adds  9b13e8a   Adding padding for strides > 1
      adds  9281f4e   Added test for unwrap, updated data submodule
      adds  ec5c0d1   Added documentation for unwrap
      adds  2816c01   Changing behavior of unwrap using padding
      adds  b3d5bba   Changed unwrap tests to new behavior
      adds  dafe47a   Updated documentation for unwrap
      adds  dc18c34   Unwrap: Remove duplicate asserts, add intl, uintl to test
      adds  ce0d03f   Merge branch 'unwrap' of shehzan10/arrayfire into devel
      adds  967ffb0   Make dim4 a POD object
      adds  4c9962b   Make ArrayInfo a POD object
      adds  8204912   Add static checks for POD for dim4 and ArrayInfo
      adds  91f7431   Make Array<T> a standard layout type
      adds  f037380   Ensure Array<T> is standard layout using static_asserts
      adds  409b295   Revert "Make dim4 a POD object"
      adds  67d0f60   Reduce size of Array<T> by rearranging mem vars
      adds  0492397   Merge pull request #842 from umar456/pod
      adds  ee72370   Add static checks to make sure ArrayInfo is the first mem var
      adds  19b7067   Merge pull request #843 from umar456/offset_check
      adds  152fccf   FEAT: Implementing array::lock() and array::unlock()
      adds  691f0d1   FEAT / TEST: Adding af::copy()
      adds  94909e1   Merge pull request #848 from pavanky/new_additions
      adds  6bd71f0   DOCS: Fixing documentation for lock and unlock
      adds  2175ccb   Added AF_INTERP_LOWER and implementation for resize
      adds  d6bf32e   Merge pull request #852 from shehzan10/resize_lower
      adds  470cf99   FEAT: intl/uintl for random
      adds  7883d32   FEAT: intl/uintl support for all reduce functions
      adds  16b4c0a   TEST: Refactor reduce tests. Test intl/uintl
      adds  cf58d66   BUILD: Fix redefinition warning in blas.
      adds  29d0df7   Revert "BUILD: Fix redefinition warning in blas."
      adds  6f3e9f8   Merge pull request #855 from umar456/long_long
      adds  e2ee96e   FEAT Added nearest neighbour with SSD, SAD and SHD
      adds  9f46f19   Added tests for nearest neighbour
      adds  f3d4660   Added documentation for nearest neighbour
      adds  7f9b423   Merge remote-tracking branch 'upstream/devel' into nearestNeighbour
      adds  f999b05   Fix double compilation
      adds  90a9ffb   Remove redefition warning for blas
      adds  b626967   Added options for dotc and dotu to dot function
      adds  354a40b   Bug fixes for nearest neighbour and hamming
      adds  8540fa0   Added tests for dot
      adds  87ca992   Merge pull request #853 from shehzan10/nearestNeighbour
      adds  fd8d75d   Merge remote-tracking branch 'upstream/master' into devel
      adds  74c2b05   Compilation and warning fixes
      adds  2ae60bb   Merge branch 'master' into devel
      adds  8aae20b   BUFIX: Remove memory leak in af::copy()
      adds  1161ff5   Updating forge tag to fix build issuess with ninja
      adds  8ac5cb9   Merge pull request #863 from pavanky/bugfix
      adds  0672f56   FEAT: Adding function to get use_count of shared pointers
      adds  51b90c7   PERF: Do not make copies if the number of references is only 1
      adds  44d857d   Merge pull request #870 from pavanky/refs
      adds  a815801   Don't link the installed version of forge on Linux/OSX
      adds  7b657e4   Merge pull request #866 from JuliaComputing/kf/forgerpath
      adds  91cea3b   Add support for BLAS symbol renaming
      adds  319753f   Merge pull request #867 from JuliaComputing/kf/blasrename
      adds  28d9e71   Add lower interpolation to rotate and transform
      adds  99a9234   Allow users to set precision when using print
      adds  c62a250   Changed af_print macro, added documentation
      adds  e12dceb   Merge pull request #873 from shehzan10/more-lower
      adds  4dc9836   Allow building against 64bit index OpenBlas
      adds  c9fbfc5   Also use the void* interface for MKL on windows
      adds  1b315f9   STYLE: Remove macros; Simplify templates;
      adds  b774d57   Merge pull request #1 from umar456/JuliaComputing-kf/openblas64
      adds  895d6b2   Merge pull request #865 from JuliaComputing/kf/openblas64
      adds  8f37d8c   Fix print macro
      adds  998a0b8   Change build labels to be for devel branch
      adds  9d09831   Added noDoubleTest for nearest neighbour and dot test
      adds  3ebc19e   BUGFIX: Ignoring NaN values in min and max for all backends
      adds  3b2ce09   FEAT: Added support to substitute nan values for sum and product
      adds  9036d73   Adding missing instantiations for compat functions
      adds  248de68   TEST: Adding tests for reductions when using NaNs
      adds  93a581d   Merge pull request #875 from pavanky/reduce-nan
      adds  7c41de9   Added C API for Harris corner detector
      adds  4049083   Added C++ API for Harris corner detector
      adds  ea3c150   Added CPU backend for Harris corner detector
      adds  e9e8474   Added CUDA backend for Harris corner detector
      adds  d183732   Added OpenCL backend for Harris corner detector
      adds  cef733b   Added Harris corner detector unit tests
      adds  cfb97ca   Added documentation and function definitions for Harris
      adds  2b6712b   Fixed FAST unit tests
      adds  311b2c5   Updated data
      adds  7d1da23   Removed unused variables from OpenCL's sort_index
      adds  0cf4d80   Merge remote-tracking branch 'upstream/devel' into harris
      adds  a257ef4   Merge pull request #874 from pentschev/harris
      adds  bd21a82   BUGFIX: Fixing casting to and from complex numbers in CPU backend
      adds  56a000d   Adding more operator overloading for af::cfloat and af::cdouble
      adds  bb7bb77   BUGFIX: Making sure c32/c64 imitate f32/f64 when operating with scalars
      adds  2ee28e7   TEST: Add mixed type tests with complex inputs
      adds  5ef10b3   Merge pull request #877 from pavanky/cplx_fixes
      adds  f5ac99e   Fixed bug affecting Harris on AMD GPUs
      adds  d656a20   Renaming enums for convolve batch modes
      adds  d08400d   PERF: Improvements for non linear JIT kernels in OpenCL backend
      adds  9ee690c   TEST: Adding batched mode tests
      adds  f8e9398   PERF: Speeding up JIT for 3D arrays in OpenCL backends
      adds  5b49f07   BUGFIX: median of all elements is now fixed
      adds  05de8bf   PERF: Improvements to tile when tiling along singleton dimensions
      adds  ecdab67   PERF: Improvements to CUDA JIT for non linear 3D and 4D arrays
      adds  f8f072c   FEAT: Adding support for non overlapping batched convolution
      adds  2ca72cb   TEST: Adding tests for non overlapping convolves
      adds  73dbeb9   DOCS: Updating the documentation for convolution
      adds  c4b6ace   Merge pull request #889 from pavanky/features-3.1
      adds  a8c6ae9   Adding missing license for a few files
      adds  218043d   Updating version to 3.1.0
      adds  3a163f5   Changes in examples to reflect asset modifications
      adds  210978f   Merge pull request #891 from 9prady9/assets_changes
      adds  4ff4de3   turned off clFFT examples in external project build
      adds  eb9367a   fix forge dependency target errors in cpu, opencl backends
      adds  79fbe5d   Merge pull request #892 from 9prady9/cmake_fixes
      adds  fb17b53   Added AF_API_VERSION
      adds  724450f   Merge pull request #893 from shehzan10/devel
      adds  30cbbc7   Fix double free issue.
      adds  9cbc641   Add test for index copy assignment.
      adds  4d23d41   Add move constructor and move assignment op
      adds  4cc3a9b   Merge pull request #895 from FilipeMaia/devel
      adds  a5efdea   BUGFIX: Check for NULL values when allocating memory on CPU backend
      adds  981cb4a   Ensure CUDA and OpenCL return proper errors when out of memory
      adds  171d8dd   TEST: Adding test to trash the memory manager and see if it recovers
      adds  d22c0bb   FEAT,TEST: Adding sigmoid function for all backends
      adds  bc2b8f5   Adding the option to remove tests from ctest
      adds  1a2b405   Merge pull request #897 from pavanky/minor_changes
      adds  8253205   BUGFIX/TEST: Fixing bug in rank. Added appropriate tests
      adds  e7d75be   BUGFIX/TEST: Fixing not for C API. Added relevant tests.
      adds  1bde871   BUGFIX: Fixing a bug in randn for CPU backend
      adds  05509c9   BUGFIX: Fixing setSeed for randu
      adds  b8cfd77   TEST: Updating and fixing the randu/randn tests
      adds  bd39083   TEST: Updating random tests to properly reset seeds
      adds  cd78911   TEST: Fixing out of bounds access in fft tests
      adds  3fb199d   BUGFIX in randn for apple systems
      adds  214acf1   Renaming rank test to rank_dense
      adds  ed3a830   Merge pull request #906 from pavanky/bugfixes
      adds  c69ca0a   Added 64-bit integer type support for functions
      adds  426dc0b   Added print errors to documentation
      adds  a86171a   Merge pull request #910 from shehzan10/devel
      adds  89d7e65   Automatically enable and build CUDA and OpenCL backends by default.
      adds  5ad9ada   Suppress FIND output for non-essential libraries.
      adds  88a2c26   Merge pull request #909 from bkloppenborg/autobuild-backends
      adds  5df0554   BUGFIX: Fixed issues with mixed real and complex types
      adds  9564317   Fixing the checks for skew
      adds  fb24097   BUGFIX: conjg no longer errors out for real inputs
      adds  d4cd847   af_scale now checks for default parameters properly
      adds  3b19720   Merge pull request #915 from pavanky/fixes
      adds  f3df808   matchTemplate example
      adds  aca80f8   matchTemplate fix in opencl to support indexed template images
      adds  d322941   Removed uncessary copy in matchTemplate example
      adds  314bc9e   Added heat colormap to display disparity values for matching
      adds  51d441c   Merge pull request #922 from 9prady9/match_template_example
      adds  4c34bf2   Made cpu::Array constructor consistent with CUDA & OpenCL
      adds  3499b98   Fixed filenames for the files used in computer vision examples
      adds  de9ba74   SUSAN Corner Detector
      adds  1a8892b   Added error checks non-array parameters of susan API
      adds  2c3e0d8   CUDA backend for SUSAN dectector
      adds  e57ed91   OpenCL backend for SUSAN dectector
      adds  220f027   fixes: typos; specific to windows; additional unit test
      adds  307040b   Corrected memory allocation bug in cpu backend for orb, where
      adds  2f5c672   Removed uncessary corner sorting for SUSAN
      adds  e9535b9   Changed CUDA/OpenCL kernels to use zero leading dimension
      adds  57ffa26   typo fix in cuda SUSAN kernel
      adds  26df8b3   SUSAN CUDA/OpenCL: Added bound checks to load shared/local Memory
      adds  a9fd10c   Replaced static shared memory with dynamic in SUSAN CUDA kernel
      adds  3d405b9   Merge pull request #921 from 9prady9/susan
      adds  61ea091   FEAT: Summed Area Tables (sat, af_sat) a.k.a integral images
      adds  8f250e7   Added check for skipping double type test when not supported
      adds  d8760f6   Added uintl, intl support for jit operations in cuda backend
      adds  27f3385   Merge pull request #931 from 9prady9/sat
      adds  3ede60b   Changed default cuda stream to be non-zero
      adds  1363441   Wrapped cuda kernel launches with CUDA_LAUNCH macro
      adds  d46b15b   Added stream parameter for upstream{thrust, cufft, cublas} calls
      adds  60c2be9   thrust fixes for cuda stream selection on cuda < 7.0
      adds  f19ecf8   Merge pull request #929 from 9prady9/cuda_default_stream
      adds  16d8e27   Extended support for interleaved convolution
      adds  8aaf299   Updating the COPYRIGHT.md document
      adds  ddd704b   Updating the language in COPYRIGHT document
      adds  3cedee7   Added new example: heston_model in financial
      adds  4358056   Moving af_array info gathering functions from util.h to array.h
      adds  75c81f1   Merge pull request #928 from pavanky/minor_changes
      adds  499dc09   shared/local memory loading fix
      adds  8086d34   Merge pull request #933 from 9prady9/ker_fixes
      adds  60c4979   namespace fix for POST_LAUNCH_CHECK macro: cuda backend
      adds  6cf4a5a   Replaced cuda Memcopy/Memset with async versions
      adds  e2994d9   Merge pull request #935 from 9prady9/cudaMem_stream
      adds  f816f41   Making fft_inplace consistent across all backends
      adds  61968d6   FEAT / TEST: Adding support for inplace fft
      adds  822fd5a   Moving general fft implementation to src/api/c
      adds  b92e629   Removing consts from the fftInplace API
      adds  155293f   FEAT/TEST: Adding R2C and C2R FFT transforms for all backends
      adds  91f6e35   DOCS: Adding documentation for real to complex transforms
      adds  7900fca   Fixing a minor issue in ArrayFireConfigVersion.cmake file
      adds  d65d048   Fixing issue in documentation
      adds  0efb69a   Cleaning up multiply_inplace in cpu backend
      adds  0a1adee   Enabling memory manager back in cuda backend
      adds  d2e182f   Merge pull request #936 from pavanky/fft
      adds  3b4d983   typo fix in opencl morph kernel
      adds  dc38b24   graphics window set size functions
      adds  6efd999   style fixes in graphics examples
      adds  b47dc5b   Merge pull request #937 from 9prady9/wind_resize
      adds  910feb7   FEAT: Adding select for CPU backend
      adds  97c916a   FEAT: Select added for CUDA backend
      adds  773c02d   FEAT: Select for opencl backend
      adds  981c5e6   FEAT: replace for all backends
      adds  0472884   TEST: Adding tests for select and replace
      adds  5df1b8c   Merge pull request #941 from pavanky/new_funcs_31
      adds  b259259   YCbCr <-> RGB conversion functions
      adds  4c876fc   unit tests for YCbCr <-> RGB conversion
      adds  ce88eec   Documentation for ycbcr_rgb conversion functions
      adds  587aa2d   Modified colorspace function to handle new colorspace - YCbCr
      adds  b55a459   Modifed colorspace function wrapper code for efficiency
      adds  384cbc7   Replaced padArray calls with Jit operations in ycbcr conversions
      adds  f4cc97f   Updated forge tag
      adds  0da5291   Merge pull request #942 from 9prady9/ycbcr
      adds  1e60132   BUGFIX: Fixing offset issue with CPU element wise operations
      adds  26a2e40   PERF: improvements to element wise operations in CPU backend
      adds  9e20a23   Merge pull request #945 from pavanky/jit_fixes
      adds  77c31e6   FEAT: adding complex support for exp
      adds  345c579   Binary operations with floating point scalars default to single precision
      adds  cbdd5be   Merge pull request #944 from pavanky/minor
      adds  3e3be90   Fix type in linux doc
      adds  62b0360   Fix signed-usigned comp warnings
      adds  07ffda7   Fixes to print functions
      adds  50699cc   FEAT Added saveArray and readArray functions for file read/write
      adds  2a9e855   Removed af_print_array_c. af::print now calls af_print_array_p
      adds  57449b0   saveArray returns index of array
      adds  e59c953   Rename af_print_array_p to af_print_array_gen
      adds  ac92dda   FEAT added to string function
      adds  dc6b11c   Merge pull request #943 from shehzan10/stream
      adds  30dd47a   modified cpu::getInfo to display CPU information
      adds  4cd73be   Fixes for cpu backend getInfo on Tegra platform
      adds  2fa808b   Fixed missing header error for windows platform
      adds  791dfcc   Merge pull request #952 from 9prady9/cpuinfo
      adds  817ccc3   FEAT Added image IO using memory functions and tests
      adds  c43af47   Changed saveImageMem API. Added image format enum
      adds  05c3d62   Add typedef af_image_format af::imageFormat
      adds  0681ddd   Merge pull request #954 from shehzan10/imageio_mem
      adds  78b9b99   Style changes to code in unwrap
      adds  2d60aed   FEAT: Adding support to unwrap along rows as well as columns
      adds  a958628   BUGFIX: Fixed a bug for unwrap in all backends
      adds  7df4dcd   FEAT/TEST/DOC: Adding wrap for CPU backend
      adds  1909fa5   FEAT: wrap for CUDA backend
      adds  1fab923   Adding atomics.hpp file for CUDA that can be used in the future
      adds  365afb4   Moving the kernel cache map to a centralized location
      adds  2e05dd4   FEAT: wrap for OpenCL backend
      adds  a8113b0   Removing faulty test
      adds  aca30fa   Cleaning up unwrap code in OpenCL by using cache store
      adds  08b6ecf   Fixing the compile error on windows
      adds  f89e37b   Merge pull request #955 from pavanky/wrap
      adds  cd097af   Fix conjugate transpose for vectors
      adds  5867404   Merge pull request #958 from shehzan10/ctrans_fix
      adds  1580652   fix #957: remove unused dtype trait for size_t
      adds  a8d8837   Merge pull request #959 from ghisvail/bugfix/remove-unused-dtype-traits
      adds  7f06c0d   update cl.hpp to upstream version 1.2.7
      adds  30f9e3c   fix instantiation of Platform objects
      adds  162e258   fix instantiation of cl::Platform objects
      adds  7b759ce   adding GL headers in platform.cpp
      adds  af598d1   Moved GL headers in platform.cpp inside WITH_GRAPHICS block
      adds  9dbf2df   Merge pull request #961 from 9prady9/cl_hpp_fixes
      adds  3121ca1   Create instances for const index member functions
      adds  cc0f401   BUGFIX: in array_proxy::get() const
      adds  1ba33a6   Merge pull request #953 from umar456/fix_951
      adds  b6d8e2d   Updated boost compute version tags
      adds  0ce0df2   Merge pull request #962 from shehzan10/update-deps
      adds  028960f   Added SIFT prototypes and parameter documentation
      adds  60546a1   Added C API for SIFT
      adds  efe1331   Added C++ API for SIFT
      adds  3ef9fdf   Added CUDA backend for SIFT
      adds  b87ba86   Added OpenCL backend for SIFT
      adds  25344cb   Added CPU backend for SIFT
      adds  48e909a   Added AF_ERR_NONFREE to defines.h
      adds  1ee6b70   Added BUILD_NONFREE option to CMake
      adds  b177218   Added SIFT Copyright information
      adds  b69215c   Added OpenSIFT License
      adds  afe1e46   Added SIFT documentation
      adds  d1d122e   Updated test data
      adds  ca57067   Added SIFT unit tests
      adds  43e5fe1   Made SIFT image indexing more readable in CPU backend
      adds  a6ae78b   SIFT fix for CUDA on Windows, made it more readable
      adds  9d09e4d   Made SIFT image indexing more readable in OpenCL backend
      adds  cce0e73   Templated SIFT gaussianElimination() in CPU and CUDA backends
      adds  a5e8a5e   Added missing CUDA_LAUNCH and THRUST_SELECT to SIFT
      adds  bda81b0   Improved CUDA SIFT coalescing and performance
      adds  dd97ce5   Improved OpenCL SIFT coalescing and performance
      adds  a9a8140   Passing shared size memory dynamically to CUDA SIFT
      adds  3ada93a   Moved OpenCL's conv2Helper to kernel directory
      adds  b5cf8d5   Improved SIFT OpenCL code
      adds  079f4df   Using pre-defined constants for workgroup sizes in CUDA SIFT
      adds  c17da09   Using 3D arrays for Gaussian/DoG pyramids in CUDA SIFT
      adds  9bd1693   Using cudaMemsetAsync for SIFT
      adds  966a130   Fixed OpenCL SIFT bug causing segmentation faults on Intel
      adds  45a4966   Added missing buffer freeing call to OpenCL SIFT
      adds  298c4d8   Fixed CUDA SIFT on unused memory buffer
      adds  b563615   Changed SIFT unit test to use std::stable_sort()
      adds  faefa30   Moved syncthreads/barriers out of thread conditionals
      adds  ee4d0bd   Moving CUDA SIFT syncthreads calls out of thread conditionals
      adds  c88929e   Merge pull request #948 from pentschev/sift
      adds  b372e49   Reorganizing non free build process.
      adds  b9fac73   Changing build flags to build non free algorithms
      adds  5e0ceac   Templated options are now runtime compile options for opencl reductions
      adds  8bf6589   Templated options are now runtime compile options for opencl convolutions
      adds  bf69efc   Templated options are now runtime compile options for opencl indexed min/max
      adds  5c0da49   Templated options are now runtime compile options for opencl scan
      adds  39f9977   Templated options are now runtime compile options for opencl nearest neighbor
      adds  d03c3c0   Removing unnecessary switch case from opencl ireduce
      adds  2c00e64   Templated options are now runtime compile options for opencl FAST
      adds  72f7ff5   Splitting up opencl sort_by_key files to compile in parallel
      adds  398b55c   Splitting sort_by_key across too many files slows down compile times
      adds  98f6f7c   Fixing a bug introduced a couple of commits ago in OpenCL SIFT
      adds  8bb6ad7   Merge pull request #966 from pavanky/compile_fixes
      adds  0183b6a   Fixing the output of af::info() for OpenCL backend
      adds  f34693e   Updating documentation and adding version guards for for 3.1
      adds  c071d86   Merge pull request #964 from pavanky/fixes_310
      adds  7fb28f6   Merge branch 'devel' into nonfree_fixes
      adds  af7681f   Merge pull request #965 from pavanky/nonfree_fixes
      adds  71c12e8   Remove set_scalar(x, 0) instructions
      adds  d0e1e7b   Remove unused opencl/kernel/set files
      adds  f52389e   Merge pull request #968 from shehzan10/opencl_fixes
      adds  d1b1a0e   Fix typo
      adds  12e4f9d   Add SVD API
      adds  369013c   Add SVD Cuda backend
      adds  2969c73   Add SVD CPU Backend
      adds  82d039f   Add SVD OpenCL Stump
      adds  864c51d   Add SVD example
      adds  71da964   Merge pull request #882 from klemmster/cusolver_svd
      adds  3794c80   Merge branch 'devel' into svd
      adds  bdbf30e   Changes to style and fix compile errors
      adds  20b5f51   Cleaning up cpu blas / lapack in OpenCL backend
      adds  e7e38b5   Fixes to supress annoying compiler warnings in OpenCL backend
      adds  fde6380   Adding functions from clMagma necessary for OpenCL SVD:
      adds  73b8081   Initial support for SVD in OpenCL backend
      adds  163ab37   Adding proper error checking in magma
      adds  eb66094   Fixing svd params to reflect clmagma
      adds  c265948   Merge remote-tracking branch 'upstream/devel' into svd
      adds  549f6b5   Work around for issues in OpenCL svd
      adds  5fc32dc   API clean up and adding support for complex numbers for SVD
      adds  4118733   Fixing various typos and bug fixes for SVD in CUDA and OpenCL
      adds  b3c5f0f   TEST: for SVD
      adds  974856b   DOCS: Updating the documentation for SVD
      adds  9ef664c   Adding version guards for svd
      adds  73717c1   Adding more pragma directives to supress GCC warnings
      adds  083f6b8   TEST: updating SVD tests to contain all four floating point data types
      adds  25975bb   Fixing svd example to reflect the change in API
      adds  0b76aa3   Revert "Updated boost compute version tags"
      adds  ed9e1be   Compilation fixes for OSX
      adds  8bcbf98   Use xGESVD instead of xGESDD for ARM platforms
      adds  80593d5   Merge pull request #969 from pavanky/svd
      adds  8755004   Added missing 3.1 version gaurds
      adds  40912e5   BUGFIX SVD use gesdd only with MKL, use gesvd with atlas
      adds  52923a1   Updated release notes for v3.1.0
      adds  7f8799d   Version guards for complex operators
      adds  ff39491   Added SIFT license info to release notes
      adds  e1bee7a   SVD using gesdd on Apple
      adds  2c7044b   Merge pull request #970 from shehzan10/rel_31
      adds  aa8bc95   FEAT: Adding support for linear assignment in C API
      adds  654dffb   Restore original shape after flattening input for linear indexing
      adds  5b671c2   Merge pull request #971 from pavanky/assign
      adds  4b5725b   FEAT: Adding support for linear indexing in C API
      adds  d2741e0   OpenCL backend af_info function string fixes
      adds  52f45e9   Documentation for CUDA backend specific API
      adds  41d517e   Documentation fixes for 3.1 release
      adds  d27f91b   Updated forge upstream tag for 3.1 release
      adds  2712182   Documentation for missing index header functions
      adds  164b39e   Added missing docs for complex and opencl backend specific fns
      adds  4f60872   Added missing docs for operator%, array::H and array::T
      adds  1a52ee1   Merge pull request #973 from 9prady9/upstream_updates
      adds  e6acad6   Merge pull request #974 from arrayfire/devel
      adds  4d7e369   Increment version to 3.1.1
      adds  e0b8b5b   Fixes for snprintf on windows
      adds  095f29e   BUGFIX: For calculating number of elements for a buffer in CUDA backend
      adds  619ecf7   BUGFIX: For calculating number of elements for a buffer in OpenCL backend
      adds  bce6967   TEST: Adding tests for indexed reductions
      adds  704c0dc   Merge pull request #983 from pavanky/indexed_reduce_fixes
      adds  64a0dbf   DOC fix for AF_PATH rendering missing %
      adds  e87a003   Read me fixes
      adds  644d074   Try PkgConf first to find CBLAS
      adds  7788c3b   Merge pull request #990 from marbre/hotfixes-3.1.1-cblas
      adds  fc0ed41   Try PkgConf first to find LAPACKE
      adds  a083294   Merge pull request #991 from marbre/hotfixes-3.1.1-lapacke
      adds  887ae72   Changes to Heston model to remove c++11 dependencies
      adds  714cdc0   BUGFIX: seq.begin can now use negative offsets just like seq.end
      adds  01af962   Updating Release notes for 3.1.1
      adds  eb9fde9   BUG: Fixing seq when passing af::end to af::seq
      adds  5e91f81   Merge pull request #996 from pavanky/hotfixes-3.1.1
      adds  3d7f26b   bug fix in image_editing example
      adds  f2474cf   Updated forge tag for 3.1 release
      adds  90b61ce   Merge pull request #997 from 9prady9/gfx_examples_fixes
      adds  169af41   Updating release notes
      adds  92e08e9   Merge pull request #998 from arrayfire/hotfixes-3.1.1

This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version.  This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:

 * -- * -- B -- O -- O -- O   (7527e5b)
            \
             N -- N -- N   refs/heads/master (92e08e9)

You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.

Any revisions marked "omits" are not gone; other references still
refer to them.  Any revisions marked "discards" are gone forever.

No new revisions were added by this update.

Summary of changes:
 ArrayFireConfigVersion.cmake.in                    |     6 +-
 CMakeLists.txt                                     |    44 +-
 CMakeModules/FindCBLAS.cmake                       |    25 +
 CMakeModules/FindLAPACKE.cmake                     |   144 +-
 CMakeModules/Version.cmake                         |    23 +-
 CMakeModules/build_clFFT.cmake                     |     1 +
 CMakeModules/build_forge.cmake                     |    18 +-
 CMakeModules/version.h.in                          |     2 +
 COPYRIGHT.md                                       |    40 +-
 LICENSES/OpenSIFT License.txt                      |    57 +
 README.md                                          |    12 +-
 assets                                             |     2 +-
 docs/details/data.dox                              |    19 +-
 docs/details/image.dox                             |   223 +
 docs/details/index.dox                             |    19 +
 docs/details/lapack.dox                            |    22 +
 docs/details/signal.dox                            |    65 +-
 docs/details/util.dox                              |   148 +
 docs/details/vision.dox                            |    92 +
 docs/doxygen.mk                                    |     6 +-
 docs/header.htm                                    |     3 -
 docs/highlight.pack.js                             |     1 -
 docs/highlight_js_doxygen.css                      |    93 -
 docs/pages/README.md                               |     6 +-
 docs/pages/configuring_arrayfire_environment.md    |    10 +
 docs/pages/release_notes.md                        |   172 +-
 docs/pages/using_on_linux.md                       |     2 +-
 docs/pages/using_on_windows.md                     |     8 +-
 examples/computer_vision/fast.cpp                  |     2 +-
 examples/computer_vision/harris.cpp                |     2 +-
 examples/computer_vision/matching.cpp              |   120 +
 examples/computer_vision/{fast.cpp => susan.cpp}   |    26 +-
 examples/financial/heston_model.cpp                |   115 +
 examples/graphics/histogram.cpp                    |     4 +-
 examples/image_processing/edge.cpp                 |     2 +-
 examples/image_processing/filters.cpp              |     8 +-
 examples/image_processing/image_editing.cpp        |     4 +-
 examples/image_processing/morphing.cpp             |     2 +-
 examples/lin_algebra/{qr.cpp => svd.cpp}           |    37 +-
 examples/machine_learning/deep_belief_net.cpp      |     6 -
 examples/machine_learning/logistic_regression.cpp  |     6 -
 examples/machine_learning/neural_network.cpp       |     6 -
 examples/machine_learning/perceptron.cpp           |     6 -
 examples/machine_learning/rbm.cpp                  |     6 -
 include/af/algorithm.h                             |   143 +
 include/af/arith.h                                 |    25 +
 include/af/array.h                                 |   273 +-
 include/af/blas.h                                  |    20 +-
 include/af/complex.h                               |    64 +-
 include/af/constants.h                             |     8 +
 include/af/cuda.h                                  |    93 +
 include/af/data.h                                  |   145 +-
 include/af/defines.h                               |    46 +-
 include/af/device.h                                |    34 +-
 include/af/graphics.h                              |    26 +
 include/af/image.h                                 |   338 +-
 include/af/index.h                                 |    60 +-
 include/af/lapack.h                                |    55 +
 include/af/opencl.h                                |    31 +
 include/af/signal.h                                |   540 +-
 include/af/util.h                                  |   261 +-
 include/af/vision.h                                |   273 +
 include/arrayfire.h                                |    15 +-
 src/api/c/assign.cpp                               |    89 +-
 src/api/c/blas.cpp                                 |    16 +-
 src/api/c/colorspace.cpp                           |    60 +-
 src/api/c/complex.cpp                              |     2 +-
 src/api/c/convolve.cpp                             |    22 +-
 src/api/c/convolve_common.hpp                      |    11 +-
 src/api/c/data.cpp                                 |    34 +
 src/api/c/device.cpp                               |    73 +
 src/api/c/diff.cpp                                 |     4 +
 src/api/c/dog.cpp                                  |    70 +
 src/api/c/features.hpp                             |     8 +
 src/api/c/fft.cpp                                  |   204 +-
 src/api/c/fft_common.hpp                           |    78 +
 src/api/c/fftconvolve.cpp                          |   115 +-
 src/api/c/flip.cpp                                 |     2 +
 src/api/c/hamming.cpp                              |    54 +-
 src/api/c/harris.cpp                               |    84 +
 src/api/c/image.cpp                                |    20 +-
 src/api/c/imageio.cpp                              |   336 +-
 src/api/c/index.cpp                                |    24 +-
 src/api/c/join.cpp                                 |     2 +
 src/api/c/median.cpp                               |     5 +-
 src/api/c/nearest_neighbour.cpp                    |    90 +
 src/api/c/ops.hpp                                  |    21 +
 src/api/c/optypes.hpp                              |     2 +
 src/api/c/print.cpp                                |   117 +-
 src/api/c/rank.cpp                                 |    14 +-
 src/api/c/reduce.cpp                               |    79 +-
 src/api/c/replace.cpp                              |   109 +
 src/api/c/resize.cpp                               |     5 +-
 src/api/c/rotate.cpp                               |     5 +-
 src/api/c/sat.cpp                                  |    57 +
 src/api/c/scan.cpp                                 |     2 +
 src/api/c/select.cpp                               |   156 +
 src/api/c/shift.cpp                                |     2 +
 src/api/c/sift.cpp                                 |    90 +
 src/api/c/stream.cpp                               |   360 +
 src/api/c/susan.cpp                                |    79 +
 src/api/c/svd.cpp                                  |   128 +
 src/api/c/tile.cpp                                 |    30 +-
 src/api/c/transform.cpp                            |    20 +-
 src/api/c/transpose.cpp                            |    16 +-
 src/api/c/unary.cpp                                |    51 +-
 src/api/c/unwrap.cpp                               |    64 +
 src/api/c/wrap.cpp                                 |    78 +
 src/api/c/ycbcr_rgb.cpp                            |   160 +
 src/api/cpp/array.cpp                              |    83 +-
 src/api/cpp/complex.cpp                            |    61 +-
 src/api/cpp/constants.cpp                          |     8 +
 src/api/cpp/data.cpp                               |    32 +
 src/api/cpp/{covariance.cpp => dog.cpp}            |     8 +-
 src/api/cpp/fft.cpp                                |   137 +
 src/api/cpp/graphics.cpp                           |     5 +
 src/api/cpp/hamming.cpp                            |     2 +-
 src/api/cpp/{fast.cpp => harris.cpp}               |    12 +-
 src/api/cpp/imageio.cpp                            |    19 +
 src/api/cpp/index.cpp                              |    43 +
 src/api/cpp/lapack.cpp                             |    18 +
 src/api/cpp/{hamming.cpp => nearest_neighbour.cpp} |     9 +-
 src/api/cpp/reduce.cpp                             |    84 +-
 src/api/cpp/{rotate.cpp => sat.cpp}                |     4 +-
 src/api/cpp/seq.cpp                                |     6 +-
 src/api/cpp/{orb.cpp => sift.cpp}                  |    17 +-
 src/api/cpp/{fast.cpp => susan.cpp}                |    10 +-
 src/api/cpp/unary.cpp                              |     1 +
 src/api/cpp/{scan.cpp => unwrap.cpp}               |     7 +-
 src/api/cpp/util.cpp                               |    44 +-
 src/api/cpp/{diff.cpp => wrap.cpp}                 |    20 +-
 src/api/cpp/{regions.cpp => ycbcr_rgb.cpp}         |    11 +-
 src/backend/ArrayInfo.hpp                          |     5 +-
 src/backend/cblas.cpp                              |    74 +-
 src/backend/cpu/Array.cpp                          |    93 +-
 src/backend/cpu/Array.hpp                          |    61 +-
 src/backend/cpu/CMakeLists.txt                     |    57 +-
 src/backend/cpu/TNJ/BinaryNode.hpp                 |    19 +-
 src/backend/cpu/TNJ/BufferNode.hpp                 |    45 +-
 src/backend/cpu/TNJ/Node.hpp                       |    10 +-
 src/backend/cpu/TNJ/ScalarNode.hpp                 |     9 +-
 src/backend/cpu/TNJ/UnaryNode.hpp                  |    15 +-
 src/backend/cpu/blas.cpp                           |   205 +-
 src/backend/cpu/blas.hpp                           |    12 +
 src/backend/cpu/cast.hpp                           |    17 +
 src/backend/cpu/convolve.cpp                       |    12 +-
 src/backend/cpu/copy.cpp                           |     6 +
 src/backend/cpu/copy.hpp                           |     3 +
 src/backend/cpu/diff.cpp                           |     2 +
 src/backend/cpu/fft.cpp                            |   178 +-
 src/backend/cpu/fft.hpp                            |    10 +-
 src/backend/cpu/fftconvolve.cpp                    |    14 +-
 src/backend/cpu/hamming.cpp                        |   103 -
 src/backend/cpu/harris.cpp                         |   265 +
 src/backend/cpu/{fast.hpp => harris.hpp}           |    11 +-
 src/backend/cpu/iir.cpp                            |     4 +-
 src/backend/cpu/iota.cpp                           |     2 +
 src/backend/cpu/ireduce.cpp                        |     4 +
 src/backend/cpu/memory.cpp                         |    46 +-
 src/backend/cpu/memory.hpp                         |     3 +-
 src/backend/cpu/nearest_neighbour.cpp              |   175 +
 .../cpu/{hamming.hpp => nearest_neighbour.hpp}     |     9 +-
 src/backend/cpu/orb.cpp                            |     9 +-
 src/backend/cpu/platform.cpp                       |   177 +-
 src/backend/cpu/random.cpp                         |     9 +-
 src/backend/cpu/range.cpp                          |     2 +
 src/backend/cpu/reduce.cpp                         |    49 +-
 src/backend/cpu/reduce.hpp                         |     6 +-
 src/backend/cpu/resize.cpp                         |    30 +
 src/backend/cpu/rotate.cpp                         |     9 +-
 src/backend/cpu/scan.cpp                           |     2 +
 src/backend/cpu/select.cpp                         |   143 +
 src/backend/cpu/{iota.hpp => select.hpp}           |    10 +-
 src/backend/cpu/shift.cpp                          |     2 +
 src/backend/cpu/sift.cpp                           |    62 +
 src/backend/cpu/{orb.hpp => sift.hpp}              |    15 +-
 src/backend/cpu/sift_nonfree.hpp                   |  1033 ++
 src/backend/cpu/susan.cpp                          |   136 +
 src/backend/cpu/susan.hpp                          |    24 +
 src/backend/cpu/svd.cpp                            |   121 +
 src/backend/cpu/{exampleFunction.hpp => svd.hpp}   |     8 +-
 src/backend/cpu/tile.cpp                           |     2 +
 src/backend/cpu/transform.cpp                      |    16 +-
 src/backend/cpu/transform_interp.hpp               |    36 +
 src/backend/cpu/unary.hpp                          |     7 +
 src/backend/cpu/unwrap.cpp                         |   122 +
 src/backend/cpu/{diff.hpp => unwrap.hpp}           |     7 +-
 src/backend/cpu/where.cpp                          |     3 +-
 src/backend/cpu/wrap.cpp                           |   122 +
 src/backend/cpu/{fftconvolve.hpp => wrap.hpp}      |    12 +-
 src/backend/cuda/Array.cpp                         |    33 +-
 src/backend/cuda/Array.hpp                         |    52 +-
 src/backend/cuda/JIT/cast.cu                       |     2 +
 src/backend/cuda/JIT/exp.cu                        |    15 +
 src/backend/cuda/JIT/hyper.cu                      |     2 +
 src/backend/cuda/JIT/logic.cu                      |     2 +
 src/backend/cuda/JIT/numeric.cu                    |    16 +
 src/backend/cuda/JIT/trig.cu                       |     4 +
 src/backend/cuda/all.cu                            |     2 +
 src/backend/cuda/any.cu                            |     2 +
 src/backend/cuda/blas.cpp                          |   102 +-
 src/backend/cuda/convolve.cpp                      |     4 +-
 src/backend/cuda/copy.cu                           |    14 +-
 src/backend/cuda/copy.hpp                          |     3 +
 src/backend/cuda/count.cu                          |     2 +
 src/backend/cuda/cublasManager.cpp                 |     1 +
 src/backend/cuda/debug_cuda.hpp                    |    38 +-
 src/backend/cuda/diff.cu                           |     2 +
 src/backend/cuda/err_cuda.hpp                      |    33 +-
 src/backend/cuda/fft.cpp                           |   155 +-
 src/backend/cuda/fft.hpp                           |    12 +-
 src/backend/cuda/fftconvolve.cu                    |    14 +-
 src/backend/cuda/hamming.cu                        |    62 -
 src/backend/cuda/harris.cu                         |    59 +
 src/backend/cuda/{fast.hpp => harris.hpp}          |    10 +-
 src/backend/cuda/hist_graphics.cu                  |     3 +-
 src/backend/cuda/iir.cu                            |     4 +-
 src/backend/cuda/image.cu                          |     3 +-
 src/backend/cuda/iota.cu                           |     2 +
 src/backend/cuda/ireduce.cu                        |     4 +
 src/backend/cuda/jit.cpp                           |    68 +-
 src/backend/cuda/kernel/approx.hpp                 |     8 +-
 src/backend/cuda/kernel/assign.hpp                 |     2 +-
 src/backend/cuda/kernel/atomics.hpp                |    59 +
 src/backend/cuda/kernel/bilateral.hpp              |    31 +-
 src/backend/cuda/kernel/convolve.cu                |    37 +-
 src/backend/cuda/kernel/convolve_separable.cu      |     5 +-
 src/backend/cuda/kernel/diagonal.hpp               |     4 +-
 src/backend/cuda/kernel/diff.hpp                   |     4 +-
 src/backend/cuda/kernel/exampleFunction.hpp        |    11 +-
 src/backend/cuda/kernel/fast.hpp                   |    94 +-
 src/backend/cuda/kernel/fftconvolve.hpp            |    58 +-
 src/backend/cuda/kernel/gradient.hpp               |     3 +-
 src/backend/cuda/kernel/harris.hpp                 |   382 +
 src/backend/cuda/kernel/histogram.hpp              |     5 +-
 src/backend/cuda/kernel/hsv_rgb.hpp                |     2 +-
 src/backend/cuda/kernel/identity.hpp               |     2 +-
 src/backend/cuda/kernel/iir.hpp                    |     2 +-
 src/backend/cuda/kernel/index.hpp                  |     2 +-
 src/backend/cuda/kernel/iota.hpp                   |     5 +-
 src/backend/cuda/kernel/ireduce.hpp                |    18 +-
 src/backend/cuda/kernel/join.hpp                   |     6 +-
 src/backend/cuda/kernel/lookup.hpp                 |     4 +-
 src/backend/cuda/kernel/lu_split.hpp               |     4 +-
 src/backend/cuda/kernel/match_template.hpp         |     3 +-
 src/backend/cuda/kernel/meanshift.hpp              |    40 +-
 src/backend/cuda/kernel/medfilt.hpp                |    49 +-
 src/backend/cuda/kernel/memcopy.hpp                |    12 +-
 src/backend/cuda/kernel/morph.hpp                  |   157 +-
 .../kernel/{hamming.hpp => nearest_neighbour.hpp}  |   194 +-
 src/backend/cuda/kernel/orb.hpp                    |    84 +-
 src/backend/cuda/kernel/random.hpp                 |     8 +-
 src/backend/cuda/kernel/range.hpp                  |     2 +-
 src/backend/cuda/kernel/reduce.hpp                 |   106 +-
 src/backend/cuda/kernel/regions.hpp                |    25 +-
 src/backend/cuda/kernel/reorder.hpp                |     5 +-
 src/backend/cuda/kernel/resize.hpp                 |    28 +-
 src/backend/cuda/kernel/rotate.hpp                 |     6 +-
 src/backend/cuda/kernel/scan_dim.hpp               |    10 +-
 src/backend/cuda/kernel/scan_first.hpp             |    11 +-
 src/backend/cuda/kernel/select.hpp                 |   155 +
 src/backend/cuda/kernel/shift.hpp                  |     5 +-
 src/backend/cuda/kernel/sift_nonfree.hpp           |  1370 +++
 src/backend/cuda/kernel/sobel.hpp                  |    36 +-
 src/backend/cuda/kernel/sort.hpp                   |     5 +-
 src/backend/cuda/kernel/sort_by_key.hpp            |     6 +-
 src/backend/cuda/kernel/sort_index.hpp             |    12 +-
 src/backend/cuda/kernel/susan.hpp                  |   180 +
 src/backend/cuda/kernel/tile.hpp                   |     2 +-
 src/backend/cuda/kernel/transform.hpp              |    15 +-
 src/backend/cuda/kernel/transform_interp.hpp       |    42 +-
 src/backend/cuda/kernel/transpose.hpp              |     4 +-
 src/backend/cuda/kernel/transpose_inplace.hpp      |     4 +-
 src/backend/cuda/kernel/triangle.hpp               |     3 +-
 src/backend/cuda/kernel/unwrap.hpp                 |   144 +
 src/backend/cuda/kernel/where.hpp                  |     3 +-
 src/backend/cuda/kernel/wrap.hpp                   |   113 +
 src/backend/cuda/math.hpp                          |     3 +
 src/backend/cuda/max.cu                            |     2 +
 src/backend/cuda/memory.cpp                        |    62 +-
 src/backend/cuda/memory.hpp                        |     3 +-
 src/backend/cuda/min.cu                            |     2 +
 src/backend/cuda/morph3d_impl.hpp                  |     5 +-
 src/backend/cuda/morph_impl.hpp                    |     5 +-
 src/backend/cuda/nearest_neighbour.cu              |    79 +
 .../cuda/{hamming.hpp => nearest_neighbour.hpp}    |     9 +-
 src/backend/cuda/platform.cpp                      |    25 +-
 src/backend/cuda/platform.hpp                      |     5 +
 src/backend/cuda/plot.cu                           |     3 +-
 src/backend/cuda/product.cu                        |     2 +
 src/backend/cuda/random.cu                         |     2 +
 src/backend/cuda/range.cu                          |     2 +
 src/backend/cuda/reduce.hpp                        |     6 +-
 src/backend/cuda/reduce_impl.hpp                   |    13 +-
 src/backend/cuda/resize.cu                         |     3 +
 src/backend/cuda/rotate.cu                         |     9 +-
 src/backend/cuda/scan.cu                           |     2 +
 src/backend/cuda/select.cu                         |    51 +
 src/backend/cuda/{iota.hpp => select.hpp}          |    11 +-
 src/backend/cuda/set.cu                            |    17 +-
 src/backend/cuda/shift.cu                          |     2 +
 src/backend/cuda/sift.cu                           |    89 +
 src/backend/cuda/{orb.hpp => sift.hpp}             |    15 +-
 src/backend/cuda/sum.cu                            |     2 +
 src/backend/cuda/susan.cu                          |    67 +
 src/backend/cuda/susan.hpp                         |    24 +
 src/backend/cuda/svd.cu                            |   155 +
 src/backend/cuda/{cholesky.hpp => svd.hpp}         |     8 +-
 src/backend/cuda/tile.cu                           |     2 +
 src/backend/cuda/transform.cu                      |    12 +-
 src/backend/cuda/unary.hpp                         |     1 +
 src/backend/cuda/unwrap.cu                         |    57 +
 src/backend/cuda/{identity.hpp => unwrap.hpp}      |     4 +-
 src/backend/cuda/wrap.cu                           |    57 +
 src/backend/cuda/{exampleFunction.hpp => wrap.hpp} |     9 +-
 src/backend/dim4.cpp                               |    25 +-
 src/backend/lapacke.cpp                            |   168 +
 src/backend/lapacke.hpp                            |    93 +-
 src/backend/opencl/Array.cpp                       |    33 +-
 src/backend/opencl/Array.hpp                       |    51 +-
 src/backend/opencl/CMakeLists.txt                  |   111 +-
 src/backend/opencl/all.cpp                         |     2 +
 src/backend/opencl/any.cpp                         |     2 +
 src/backend/opencl/blas.cpp                        |    86 +-
 src/backend/opencl/{fftconvolve.hpp => cache.hpp}  |    16 +-
 src/backend/opencl/cl.hpp                          | 10803 ++++++++++---------
 src/backend/opencl/convolve.cpp                    |     4 +-
 src/backend/opencl/convolve_separable.cpp          |    44 +-
 src/backend/opencl/copy.cpp                        |     7 +
 src/backend/opencl/copy.hpp                        |     3 +
 src/backend/opencl/count.cpp                       |     2 +
 src/backend/opencl/diff.cpp                        |     2 +
 src/backend/opencl/err_opencl.hpp                  |    22 +-
 src/backend/opencl/fft.cpp                         |   225 +-
 src/backend/opencl/fft.hpp                         |    12 +-
 src/backend/opencl/fftconvolve.cpp                 |    12 +-
 src/backend/opencl/hamming.cpp                     |   143 -
 src/backend/opencl/harris.cpp                      |    57 +
 src/backend/opencl/{fast.hpp => harris.hpp}        |    10 +-
 src/backend/opencl/iir.cpp                         |     4 +-
 src/backend/opencl/iota.cpp                        |     2 +
 src/backend/opencl/ireduce.cpp                     |     4 +
 src/backend/opencl/jit.cpp                         |    48 +-
 src/backend/opencl/kernel/approx.hpp               |    11 +-
 src/backend/opencl/kernel/approx1.cl               |     9 +-
 src/backend/opencl/kernel/approx2.cl               |    10 +-
 src/backend/opencl/kernel/bilateral.cl             |    32 +-
 src/backend/opencl/kernel/convolve.hpp             |     6 +-
 src/backend/opencl/kernel/convolve/conv2_impl.hpp  |   117 +-
 src/backend/opencl/kernel/convolve_separable.cpp   |   131 +
 src/backend/opencl/kernel/convolve_separable.hpp   |    79 +-
 src/backend/opencl/kernel/fast.hpp                 |    98 +-
 src/backend/opencl/kernel/fftconvolve.hpp          |    24 +-
 src/backend/opencl/kernel/fftconvolve_multiply.cl  |     6 +-
 src/backend/opencl/kernel/fftconvolve_reorder.cl   |    15 +-
 src/backend/opencl/kernel/gradient.cl              |     3 +-
 src/backend/opencl/kernel/gradient.hpp             |     7 +-
 src/backend/opencl/kernel/harris.cl                |   117 +
 src/backend/opencl/kernel/harris.hpp               |   343 +
 src/backend/opencl/kernel/ireduce.hpp              |   183 +-
 src/backend/opencl/kernel/jit.cl                   |     1 +
 src/backend/opencl/kernel/matchTemplate.cl         |     2 +-
 src/backend/opencl/kernel/meanshift.cl             |    36 +-
 src/backend/opencl/kernel/medfilt.cl               |    37 +-
 src/backend/opencl/kernel/morph.cl                 |   128 +-
 .../kernel/{hamming.cl => nearest_neighbour.cl}    |    66 +-
 .../kernel/{hamming.hpp => nearest_neighbour.hpp}  |    98 +-
 src/backend/opencl/kernel/ops.cl                   |     8 +-
 src/backend/opencl/kernel/orb.hpp                  |    12 +-
 src/backend/opencl/kernel/random.cl                |    45 +-
 src/backend/opencl/kernel/random.hpp               |     4 +
 src/backend/opencl/kernel/reduce.hpp               |   250 +-
 src/backend/opencl/kernel/reduce_dim.cl            |     4 +-
 src/backend/opencl/kernel/reduce_first.cl          |     4 +-
 src/backend/opencl/kernel/regions.hpp              |     5 +
 src/backend/opencl/kernel/resize.cl                |    21 +
 src/backend/opencl/kernel/resize.hpp               |     3 +-
 src/backend/opencl/kernel/rotate.cl                |     1 +
 src/backend/opencl/kernel/rotate.hpp               |    11 +-
 src/backend/opencl/kernel/scan_dim.hpp             |   207 +-
 src/backend/opencl/kernel/scan_first.hpp           |   196 +-
 src/backend/opencl/kernel/select.cl                |    90 +
 src/backend/opencl/kernel/select.hpp               |   175 +
 src/backend/opencl/kernel/set.cl                   |    20 -
 src/backend/opencl/kernel/set.hpp                  |    67 -
 src/backend/opencl/kernel/sift_nonfree.cl          |   806 ++
 src/backend/opencl/kernel/sift_nonfree.hpp         |   784 ++
 src/backend/opencl/kernel/sobel.cl                 |    30 +-
 src/backend/opencl/kernel/sort.hpp                 |     6 +
 src/backend/opencl/kernel/sort_by_key.hpp          |     6 +
 src/backend/opencl/kernel/sort_index.hpp           |    10 +-
 src/backend/opencl/kernel/susan.cl                 |   113 +
 src/backend/opencl/kernel/susan.hpp                |   143 +
 src/backend/opencl/kernel/transform.cl             |     1 +
 src/backend/opencl/kernel/transform.hpp            |    13 +-
 src/backend/opencl/kernel/transform_interp.cl      |    40 +-
 src/backend/opencl/kernel/unwrap.cl                |    77 +
 src/backend/opencl/kernel/unwrap.hpp               |   117 +
 src/backend/opencl/kernel/where.hpp                |     7 +-
 src/backend/opencl/kernel/wrap.cl                  |    74 +
 src/backend/opencl/kernel/wrap.hpp                 |   112 +
 src/backend/opencl/magma/gebrd.cpp                 |   368 +
 src/backend/opencl/magma/geqrf2.cpp                |    19 +-
 src/backend/opencl/magma/geqrf3.cpp                |    19 +-
 src/backend/opencl/magma/getrf.cpp                 |   130 +-
 src/backend/opencl/magma/getrs.cpp                 |    30 +-
 src/backend/opencl/magma/labrd.cpp                 |   668 ++
 src/backend/opencl/magma/larfb.cpp                 |   125 +-
 src/backend/opencl/magma/laset.cpp                 |    44 +
 src/backend/opencl/magma/laset_band.cpp            |    44 +
 src/backend/opencl/magma/laswp.cpp                 |    44 +
 src/backend/opencl/magma/magma.h                   |    21 +
 src/backend/opencl/magma/magma_blas.h              |    57 +-
 src/backend/opencl/magma/magma_cpu_blas.h          |    91 +
 src/backend/opencl/magma/magma_cpu_lapack.h        |   143 +-
 src/backend/opencl/magma/magma_helper.cpp          |    20 +
 src/backend/opencl/magma/magma_helper.h            |     3 +
 src/backend/opencl/magma/potrf.cpp                 |   130 +-
 src/backend/opencl/magma/transpose.cpp             |    44 +
 src/backend/opencl/magma/transpose_inplace.cpp     |    44 +
 src/backend/opencl/magma/ungqr.cpp                 |    11 +-
 src/backend/opencl/magma/unmqr.cpp                 |    11 +-
 src/backend/opencl/magma/unmqr2.cpp                |    10 +-
 src/backend/opencl/max.cpp                         |     2 +
 src/backend/opencl/memory.cpp                      |    46 +-
 src/backend/opencl/memory.hpp                      |     4 +-
 src/backend/opencl/min.cpp                         |     2 +
 src/backend/opencl/nearest_neighbour.cpp           |   115 +
 .../opencl/{hamming.hpp => nearest_neighbour.hpp}  |     9 +-
 src/backend/opencl/platform.cpp                    |    33 +-
 src/backend/opencl/platform.hpp                    |     1 +
 src/backend/opencl/product.cpp                     |     2 +
 src/backend/opencl/random.cpp                      |     2 +
 src/backend/opencl/range.cpp                       |     2 +
 src/backend/opencl/reduce.hpp                      |     5 +-
 src/backend/opencl/reduce_impl.hpp                 |    13 +-
 src/backend/opencl/resize.cpp                      |     3 +
 src/backend/opencl/rotate.cpp                      |    12 +-
 src/backend/opencl/scan.cpp                        |    14 +-
 src/backend/opencl/select.cpp                      |    52 +
 src/backend/opencl/{iota.hpp => select.hpp}        |    11 +-
 src/backend/opencl/set.cpp                         |     6 +
 src/backend/opencl/shift.cpp                       |     2 +
 src/backend/opencl/sift.cpp                        |    82 +
 src/backend/opencl/{orb.hpp => sift.hpp}           |    15 +-
 src/backend/opencl/solve.cpp                       |    80 +-
 .../convolve/conv2_s32.cpp => sort_by_key/b8.cpp}  |    12 +-
 .../convolve/conv2_s32.cpp => sort_by_key/f32.cpp} |    12 +-
 .../convolve/conv2_s32.cpp => sort_by_key/f64.cpp} |    12 +-
 .../{sort_by_key.cpp => sort_by_key/impl.hpp}      |    34 +-
 .../convolve/conv2_s32.cpp => sort_by_key/s32.cpp} |    12 +-
 .../convolve/conv2_s32.cpp => sort_by_key/u32.cpp} |    12 +-
 .../convolve/conv2_s32.cpp => sort_by_key/u8.cpp}  |    12 +-
 src/backend/opencl/sum.cpp                         |     2 +
 src/backend/opencl/susan.cpp                       |    76 +
 src/backend/opencl/susan.hpp                       |    24 +
 src/backend/opencl/svd.cpp                         |   244 +
 src/backend/opencl/{assign.hpp => svd.hpp}         |     7 +-
 src/backend/opencl/tile.cpp                        |     2 +
 src/backend/opencl/traits.hpp                      |    10 -
 src/backend/opencl/transform.cpp                   |    17 +-
 src/backend/opencl/unary.hpp                       |     1 +
 src/backend/opencl/unwrap.cpp                      |    56 +
 src/backend/opencl/{identity.hpp => unwrap.hpp}    |     4 +-
 src/backend/opencl/wrap.cpp                        |    57 +
 src/backend/opencl/{fftconvolve.hpp => wrap.hpp}   |    11 +-
 test/CMakeLists.txt                                |     7 +-
 test/array.cpp                                     |    24 +
 test/assign.cpp                                    |   193 +-
 test/binary.cpp                                    |    49 +-
 test/complex.cpp                                   |    18 +-
 test/data                                          |     2 +-
 test/diagonal.cpp                                  |     8 +
 test/diff1.cpp                                     |     2 +-
 test/diff2.cpp                                     |     2 +-
 test/dog.cpp                                       |    76 +
 test/dot.cpp                                       |   187 +
 test/fast.cpp                                      |   105 +-
 test/fft.cpp                                       |   106 +-
 test/fft_real.cpp                                  |   120 +
 test/fftconvolve.cpp                               |    39 +
 test/getting_started.cpp                           |     8 +-
 test/gfor.cpp                                      |   247 +
 test/{fast.cpp => harris.cpp}                      |   152 +-
 test/imageio.cpp                                   |    64 +
 test/index.cpp                                     |   118 +
 test/iota.cpp                                      |     2 +-
 test/ireduce.cpp                                   |    45 +
 test/manual_memory_test.cpp                        |    69 +
 test/math.cpp                                      |    22 +
 test/memory.cpp                                    |   261 +-
 test/memory_lock.cpp                               |    95 +
 test/nearest_neighbour.cpp                         |   218 +
 test/random.cpp                                    |    84 +-
 test/range.cpp                                     |     2 +-
 test/rank_dense.cpp                                |    88 +
 test/reduce.cpp                                    |   247 +-
 test/replace.cpp                                   |   120 +
 test/sat.cpp                                       |    45 +
 test/scan.cpp                                      |     6 +-
 test/select.cpp                                    |   128 +
 test/shift.cpp                                     |     2 +-
 test/{orb.cpp => sift_nonfree.cpp}                 |   136 +-
 test/susan.cpp                                     |   181 +
 test/svd_dense.cpp                                 |    99 +
 test/tile.cpp                                      |     2 +-
 test/transpose.cpp                                 |    18 +-
 test/unwrap.cpp                                    |   177 +
 test/wrap.cpp                                      |   179 +
 test/{hsv_rgb.cpp => ycbcr_rgb.cpp}                |    14 +-
 510 files changed, 30587 insertions(+), 9932 deletions(-)
 create mode 100644 LICENSES/OpenSIFT License.txt
 create mode 100644 docs/details/index.dox
 create mode 100644 docs/details/util.dox
 delete mode 100644 docs/highlight.pack.js
 delete mode 100644 docs/highlight_js_doxygen.css
 create mode 100644 examples/computer_vision/matching.cpp
 copy examples/computer_vision/{fast.cpp => susan.cpp} (73%)
 create mode 100644 examples/financial/heston_model.cpp
 copy examples/lin_algebra/{qr.cpp => svd.cpp} (61%)
 create mode 100644 include/af/cuda.h
 create mode 100644 src/api/c/dog.cpp
 create mode 100644 src/api/c/fft_common.hpp
 create mode 100644 src/api/c/harris.cpp
 create mode 100644 src/api/c/nearest_neighbour.cpp
 create mode 100644 src/api/c/replace.cpp
 create mode 100644 src/api/c/sat.cpp
 create mode 100644 src/api/c/select.cpp
 create mode 100644 src/api/c/sift.cpp
 create mode 100644 src/api/c/stream.cpp
 create mode 100644 src/api/c/susan.cpp
 create mode 100644 src/api/c/svd.cpp
 create mode 100644 src/api/c/unwrap.cpp
 create mode 100644 src/api/c/wrap.cpp
 create mode 100644 src/api/c/ycbcr_rgb.cpp
 copy src/api/cpp/{covariance.cpp => dog.cpp} (69%)
 copy src/api/cpp/{fast.cpp => harris.cpp} (56%)
 copy src/api/cpp/{hamming.cpp => nearest_neighbour.cpp} (60%)
 copy src/api/cpp/{rotate.cpp => sat.cpp} (75%)
 copy src/api/cpp/{orb.cpp => sift.cpp} (50%)
 copy src/api/cpp/{fast.cpp => susan.cpp} (58%)
 copy src/api/cpp/{scan.cpp => unwrap.cpp} (62%)
 copy src/api/cpp/{diff.cpp => wrap.cpp} (51%)
 copy src/api/cpp/{regions.cpp => ycbcr_rgb.cpp} (63%)
 delete mode 100644 src/backend/cpu/hamming.cpp
 create mode 100644 src/backend/cpu/harris.cpp
 copy src/backend/cpu/{fast.hpp => harris.hpp} (54%)
 create mode 100644 src/backend/cpu/nearest_neighbour.cpp
 rename src/backend/cpu/{hamming.hpp => nearest_neighbour.hpp} (56%)
 create mode 100644 src/backend/cpu/select.cpp
 copy src/backend/cpu/{iota.hpp => select.hpp} (58%)
 create mode 100644 src/backend/cpu/sift.cpp
 copy src/backend/cpu/{orb.hpp => sift.hpp} (50%)
 create mode 100644 src/backend/cpu/sift_nonfree.hpp
 create mode 100644 src/backend/cpu/susan.cpp
 create mode 100644 src/backend/cpu/susan.hpp
 create mode 100644 src/backend/cpu/svd.cpp
 copy src/backend/cpu/{exampleFunction.hpp => svd.hpp} (63%)
 create mode 100644 src/backend/cpu/unwrap.cpp
 copy src/backend/cpu/{diff.hpp => unwrap.hpp} (68%)
 create mode 100644 src/backend/cpu/wrap.cpp
 copy src/backend/cpu/{fftconvolve.hpp => wrap.hpp} (55%)
 delete mode 100644 src/backend/cuda/hamming.cu
 create mode 100644 src/backend/cuda/harris.cu
 copy src/backend/cuda/{fast.hpp => harris.hpp} (54%)
 create mode 100644 src/backend/cuda/kernel/atomics.hpp
 create mode 100644 src/backend/cuda/kernel/harris.hpp
 rename src/backend/cuda/kernel/{hamming.hpp => nearest_neighbour.hpp} (67%)
 create mode 100644 src/backend/cuda/kernel/select.hpp
 create mode 100644 src/backend/cuda/kernel/sift_nonfree.hpp
 create mode 100644 src/backend/cuda/kernel/susan.hpp
 create mode 100644 src/backend/cuda/kernel/unwrap.hpp
 create mode 100644 src/backend/cuda/kernel/wrap.hpp
 create mode 100644 src/backend/cuda/nearest_neighbour.cu
 rename src/backend/cuda/{hamming.hpp => nearest_neighbour.hpp} (57%)
 create mode 100644 src/backend/cuda/select.cu
 copy src/backend/cuda/{iota.hpp => select.hpp} (58%)
 create mode 100644 src/backend/cuda/sift.cu
 copy src/backend/cuda/{orb.hpp => sift.hpp} (50%)
 create mode 100644 src/backend/cuda/susan.cu
 create mode 100644 src/backend/cuda/susan.hpp
 create mode 100644 src/backend/cuda/svd.cu
 copy src/backend/cuda/{cholesky.hpp => svd.hpp} (63%)
 create mode 100644 src/backend/cuda/unwrap.cu
 copy src/backend/cuda/{identity.hpp => unwrap.hpp} (69%)
 create mode 100644 src/backend/cuda/wrap.cu
 copy src/backend/cuda/{exampleFunction.hpp => wrap.hpp} (53%)
 copy src/backend/opencl/{fftconvolve.hpp => cache.hpp} (54%)
 delete mode 100644 src/backend/opencl/hamming.cpp
 create mode 100644 src/backend/opencl/harris.cpp
 copy src/backend/opencl/{fast.hpp => harris.hpp} (54%)
 create mode 100644 src/backend/opencl/kernel/convolve_separable.cpp
 create mode 100644 src/backend/opencl/kernel/harris.cl
 create mode 100644 src/backend/opencl/kernel/harris.hpp
 rename src/backend/opencl/kernel/{hamming.cl => nearest_neighbour.cl} (89%)
 rename src/backend/opencl/kernel/{hamming.hpp => nearest_neighbour.hpp} (55%)
 create mode 100644 src/backend/opencl/kernel/select.cl
 create mode 100644 src/backend/opencl/kernel/select.hpp
 delete mode 100644 src/backend/opencl/kernel/set.cl
 delete mode 100644 src/backend/opencl/kernel/set.hpp
 create mode 100644 src/backend/opencl/kernel/sift_nonfree.cl
 create mode 100644 src/backend/opencl/kernel/sift_nonfree.hpp
 create mode 100644 src/backend/opencl/kernel/susan.cl
 create mode 100644 src/backend/opencl/kernel/susan.hpp
 create mode 100644 src/backend/opencl/kernel/unwrap.cl
 create mode 100644 src/backend/opencl/kernel/unwrap.hpp
 create mode 100644 src/backend/opencl/kernel/wrap.cl
 create mode 100644 src/backend/opencl/kernel/wrap.hpp
 create mode 100644 src/backend/opencl/magma/gebrd.cpp
 create mode 100644 src/backend/opencl/magma/labrd.cpp
 create mode 100644 src/backend/opencl/magma/magma_cpu_blas.h
 create mode 100644 src/backend/opencl/nearest_neighbour.cpp
 rename src/backend/opencl/{hamming.hpp => nearest_neighbour.hpp} (58%)
 create mode 100644 src/backend/opencl/select.cpp
 copy src/backend/opencl/{iota.hpp => select.hpp} (59%)
 create mode 100644 src/backend/opencl/sift.cpp
 copy src/backend/opencl/{orb.hpp => sift.hpp} (50%)
 copy src/backend/opencl/{kernel/convolve/conv2_s32.cpp => sort_by_key/b8.cpp} (79%)
 copy src/backend/opencl/{kernel/convolve/conv2_s32.cpp => sort_by_key/f32.cpp} (79%)
 copy src/backend/opencl/{kernel/convolve/conv2_s32.cpp => sort_by_key/f64.cpp} (79%)
 rename src/backend/opencl/{sort_by_key.cpp => sort_by_key/impl.hpp} (57%)
 copy src/backend/opencl/{kernel/convolve/conv2_s32.cpp => sort_by_key/s32.cpp} (79%)
 copy src/backend/opencl/{kernel/convolve/conv2_s32.cpp => sort_by_key/u32.cpp} (79%)
 copy src/backend/opencl/{kernel/convolve/conv2_s32.cpp => sort_by_key/u8.cpp} (79%)
 create mode 100644 src/backend/opencl/susan.cpp
 create mode 100644 src/backend/opencl/susan.hpp
 create mode 100644 src/backend/opencl/svd.cpp
 copy src/backend/opencl/{assign.hpp => svd.hpp} (62%)
 create mode 100644 src/backend/opencl/unwrap.cpp
 copy src/backend/opencl/{identity.hpp => unwrap.hpp} (69%)
 create mode 100644 src/backend/opencl/wrap.cpp
 copy src/backend/opencl/{fftconvolve.hpp => wrap.hpp} (55%)
 create mode 100644 test/dog.cpp
 create mode 100644 test/dot.cpp
 create mode 100644 test/fft_real.cpp
 copy test/{fast.cpp => harris.cpp} (56%)
 create mode 100644 test/manual_memory_test.cpp
 create mode 100644 test/memory_lock.cpp
 create mode 100644 test/nearest_neighbour.cpp
 create mode 100644 test/rank_dense.cpp
 create mode 100644 test/replace.cpp
 create mode 100644 test/sat.cpp
 create mode 100644 test/select.cpp
 copy test/{orb.cpp => sift_nonfree.cpp} (66%)
 create mode 100644 test/susan.cpp
 create mode 100644 test/svd_dense.cpp
 create mode 100644 test/unwrap.cpp
 create mode 100644 test/wrap.cpp
 copy test/{hsv_rgb.cpp => ycbcr_rgb.cpp} (83%)

-- 
Alioth's /usr/local/bin/git-commit-notice on /srv/git.debian.org/git/debian-science/packages/arrayfire.git



More information about the debian-science-commits mailing list