[arrayfire] branch master updated (7507b61 -> 98ff1c0)

Ghislain Vaillant ghisvail-guest at moszumanska.debian.org
Sun Feb 7 18:59:44 UTC 2016


This is an automated email from the git hooks/post-receive script.

ghisvail-guest pushed a change to branch master
in repository arrayfire.

      from  7507b61   Merge pull request #1200 from arrayfire/hotfixes-3.2.2
       new  be0ff22   Setup async project; Setup basic queues;
       new  b94c3df   Convert CPU blas to use async queues
       new  3188bdf   Async CPU approx1 and approx2
       new  f797314   Async CPU Assign
       new  c8ecdb9   Async CPU Bilateral
       new  759b506   Async CPU Convolve
       new  c399e75   Async CPU diff1 and diff2
       new  80903d0   Avoid sending references to queued lambdas
       new  96c5602   Async CPU Copy, Assign, and Index
       new  46beebc   Async CPU diagonal
       new  b7c83e8   Async FFT for the CPU backend
       new  413eea8   Add eval to copyData
       new  49f0cce   Async random on CPU backend
       new  fada883   Async where on the CPU backe
       new  1a0802f   Async CPU reduce and ireduce
       new  1842bcf   Async CPU Transpose. Fix bug in eval
       new  5d428df   Merge branch 'devel' into async
       new  4a8e723   Merge additional changes for async CPU.
       new  91f7a1f   async cpu::index function
       new  2796770   Fixed asynchronous indexing & assignment in cpu backend
       new  0aeed42   converted cpu tile to asychronous call
       new  330ae1c   converted sort_index cpu function to asynchronous call
       new  5eea071   converted triangle fn in cpu backend to async call
       new  551433e   converted lu & cholesky decomposition functions to async calls
       new  ed6d26d   svd cpu backend is async now
       new  e0d7c12   converted qr & solve cpu functions to async calls
       new  d0223f9   Converted wrap & unwrap cpu fns to async calls
       new  32a65d8   converted transform to async call
       new  743fb4a   converted susan fn in cpu backend to asynchronous call
       new  840af46   convert sort & sort_by_key cpu fns to async calls
       new  e005557   sobel cpu fn is async fn after this change
       new  14e9d31   Converted resize & shift cpu fns to async calls
       new  3c2bc65   convert select & rotate cpu fns to async calls
       new  258d573   Convert morph & range cpu fns to async calls
       new  9510fcb   Converted cpu scan function to async call
       new  6fc636f   fix for async sift cpu function
       new  48a9e58   converted matchTemplate, meanShift & medfilt to async calls
       new  b813fd4   nearest neighbour cpu func is asyn call now
       new  35a462c   conversion of listed functions to async calls
       new  e5ab671   converted join cpu func to async call
       new  0c72451   converted cpu regions function to asynchronous call
       new  ed730cf   adds scatter function
       new  0a78b60   update plot2d example to include scatter plot
       new  4ec314a   cleanup and scatter example update
       new  6f52c36   adds scatter3
       new  6515414   Merge pull request #1116 from syurkevi/forge_scatter
       new  b267ffd   Increment version for devel to 3.3
       new  65c7a23   Fixes for scatter
       new  323aa77   Merge pull request #1133 from shehzan10/devel
       new  53de790   Removed dead code from opencl::DeviceManager class
       new  ce2d6a6   Enables use of ArrayFire with external context & command queue
       new  f65ee89   cpp wrappers for opencl external context related fns
       new  2bcc6de   unit tests for afcl::{pushDevice, setDevice, popDevice} fns
       new  964b1ea   Style changes in opencl header
       new  d41839f   api name change for afcl external context functionality
       new  227377d   Added OpenCL include dir for unit tests
       new  67ef051   additional style changes
       new  72b71e8   Merge pull request #1140 from 9prady9/ext_ocl_context
       new  b9e113b   Merge remote-tracking branch 'upstream/master' into devel
       new  365dc94   DOC resolve markerType enum in graphics
       new  b878711   Remove unused variable warning in homography cuda kernel
       new  ebe6cac   Merge branch 'devel' into async
       new  df2c091   Converted iir, fir, fftconvolve to async calls
       new  c06f24d   Change to gfx to handle Arrays created by async calls
       new  21f74eb   Fixed harris & homography cpu fns to work with async fns
       new  2217014   Fix in Array::device method
       new  919333e   Fix for getDeviceMemInfo function in cpu
       new  7dac34a   Fixes for asynchronous cpu copy && set functions
       new  6b9c157   FEAT added allocHost and freeHost functions
       new  3c9d69d   FEAT added infoString function to return info as string
       new  e73ee6d   Merge remote-tracking branch 'upstream/hotfixes-3.2.2' into devel
       new  1de97de   Using af_alloc_host when allocating user-return string
       new  f628fbe   toString now uses af_alloc_host to allocate memory
       new  7206028   Add overload of toString that returns a string
       new  06d4bef   FEAT add af_get_revision to get commit instead of AF_REVISION
       new  b3c28b6   Using c/version.cpp in unified
       new  34f3569   Merge pull request #1163 from shehzan10/devel
       new  ab11b6b   Merge branch 'devel' into async
       new  3ba9633   Added missing eval for input Array's in cpu backend fns
       new  8cc9c9c   threads library is now a submodule in cpu backend
       new  abce1e8   Moved fns enqueued onto async queue to separate folder
       new  5f2f155   Removed obselete fn of evalArray from all backends
       new  e651cad   cpu::Array::eval queue work moved to kerenel namespace
       new  3cddae2   moved assign cpu async fn to kernel space
       new  0e563ba   moved kernel namespace in cpu backend inside cpu namespace
       new  d03bb75   moved bilateral, convolve, fftconvolve to cpu kernel namespace
       new  e8f0242   moved copy queue fns from cpu backend to kernel namespace
       new  913850d   Moved diagonal cpu implementation to kernel namespace
       new  71298c6   moved diff, fast, gradient, harris, histogram to kernel namespace
       new  cc7d665   moved rgb_hsv & identity fns to kernel namespace
       new  696657c   moved indexing & assignment fns to kernel namespace
       new  f2b84dd   template parameters style fixes in cpu kernel namespace fns
       new  bb0a22c   Moved more cpu fns implementations to kernel namespace
       new  7d7f32f   moved the left over fns to cpu kernel namespace
       new  1313f98   Fixed the bug in cpu ireduce kernel function
       new  b684b06   Fixed orb async cpu fn
       new  a0f17b6   cmake fix to check for threads submodule
       new  c539f1d   moved fft cpu fns implementations to kernel namespace
       new  4831215   moved dot cpu implementation to kernel namespace
       new  d1089f8   moved fftconvolve reorder helper fn to kernel namespace
       new  4f8b3fa   fixed cpu::kernel::dot fn signature
       new  95d9346   Added ENQUEUE macro in cpu backend
       new  7dad2ef   Removed obsolete queue sync in cpu::padArray fn
       new  90611a2   Fixed cmake condition for threads submodule check
       new  1cbffbb   Synchronize when AF_SYNCHRONOUS_CALLS is set to 1
       new  6058dd2   Revert "Added ENQUEUE macro in cpu backend"
       new  edda52a   Update README.md with updated status badges
       new  29cfed5   Merge pull request #1178 from shehzan10/readme-badges
       new  cad4c2c   initial gravity example
       new  5507717   add collisions, split vectors into components for performance
       new  5c01608   remove state globals
       new  b1aa875   remove windows pause ifdef
       new  c62eea0   Merge pull request #1152 from syurkevi/gravity_example
       new  5ecdc54   Added API support for perspective transform
       new  2a43871   Added perspective transform to CPU backend
       new  7fdfe3e   Added perspective transform to CUDA backend
       new  15b9ad6   Added perspective transform to OpenCL backend
       new  81dca06   Updated transform documentation
       new  8e4e766   Added perspective transform unit tests
       new  7327fb2   Updated test data
       new  27aeed0   Fixed wrong data type in OpenCL transform
       new  bdc31d0   Simplified test for perspective transform in API
       new  951533c   Merge pull request #1190 from pentschev/perspective_transform
       new  2ae8273   Merge remote-tracking branch 'arrayfire/master' into devel
       new  f0b9691   Merge branch 'devel' into async
       new  861808e   Merge pull request #1174 from umar456/async
       new  c2d7e42   Fix clang warnings (std::abs, pragma ignores)
       new  fa765a6   Merge pull request #1208 from shehzan10/clang-warnings
       new  1dd2195   Cleanup util.cpp
       new  e19a6be   Using getEnvVar instead of getenv
       new  b260abf   Cleanup/improve backend test
       new  de4851d   Not building info for unified. backend does the same as info
       new  4d06c74   FEAT Added isImageIOAvailable function to check support
       new  1b85d6d   FEAT Added isLAPACKAvailable function to check support
       new  7747ee6   Use isImageIOAvailable in testHelper
       new  fe3fa66   Use isLAPACKAvailable in testHelper
       new  b89ab5d   Add missing af_err to string
       new  8813a2e   af_get_last_error supports NULL as valid argument for len
       new  b7af25a   Improvements to af::exception messages
       new  7b6eee1   Add version guards around allocHost and freeHost
       new  9605740   Deprecate af_(lock/unlock)_device_ptr. Use af_(lock/unlock)_array
       new  d02636a   Add memFreeUnlinked to free locked device ptrs
       new  330f4f8   FEAT Add printMemInfo to print memory information
       new  ed5556c   Renamed is_free -> mngr_lock and is_unlinked -> user_lock in cpu memory mngr
       new  aa25b17   Renamed is_free -> mngr_lock and is_unlinked -> user_lock in cuda memory mngr
       new  cef8559   Renamed is_free -> mngr_lock and is_unlinked -> user_lock in opencl memory mngr
       new  dbe861e   Reverse conditions for freeing in memory managers
       new  33fbf33   Renamed internal memFree functions
       new  8cb21a4   Fix AF_DISABLE_GRAPHICS condition (Fixes e19a6be)
       new  b0c7c9a   Merge pull request #1207 from shehzan10/memory
       new  3c91935   Replaced ssh based url with http url for threads submodule
       new  7f7ce20   Merge pull request #1216 from 9prady9/threads_uri_fix
       new  e5bb334   Add missing isLAPACKAvailable implementation in CPU backend
       new  775747e   Set revision to "default" when git is not available
       new  84dccc8   Documentation fixes
       new  60a722a   Merge pull request #1215 from shehzan10/devel
       new  6988950   Build fix for CUDA backend when using boost 1.60
       new  09bc060   Merge pull request #1211 from pavanky/compile_fixes
       new  5940d4b   Always use freeimage flags instead of hardcoded offsets
       new  a6ae9bd   Merge pull request #1218 from pavanky/freeimage_fix
       new  3047acd   Add the ability to sort OpenCL devices Using the following criterion
       new  8873ed2   Using proper offsets for loadImageNative and saveImageNative
       new  14230d2   Adding environment variables to choose OpenCL device
       new  2f16824   Cleaning up exception handling in src/api/c
       new  b42cbeb   Updating docs for new AF_OPENCL_*_TYPE environment variables
       new  17b2600   Freeimage only requires the flags for 24 / 32 bit images
       new  bb29d92   Merge pull request #1222 from pavanky/opencl_device_sort
       new  685dccd   Update boost compute release tag
       new  6b7b1ce   Update clFFT release tag
       new  5866b17   Merge pull request #1202 from shehzan10/clupdates
       new  777abcb   Moving dispatch.hpp / dispatch.cpp to src/backend/
       new  828138c   Renaming a few internal functions
       new  c8cd29b   Adding a unified memory manager for all backends
       new  a175432   Remove unnecessary line from CMakeLists
       new  43d030d   Cleaning up error messages in loading and saving files
       new  73b7cac   Fixing CUDA platform manager to sort devices in a more saner manner.
       new  d75b899   Adding lock to memory allocated using af_alloc_device / af::alloc
       new  d5f3bf1   Adding documentation for AF_MEM_DEBUG
       new  f9a8336   Additional sanitizing for mutex locks
       new  0638f3f   Removing unnecessary returns from void functions
       new  1520dc3   Fixing issue where garbageCollect was only called on current device
       new  aaf554e   BUGFIX: Initialize buffer counts to 0
       new  9d0c159   af_set_device now only warns when device > 0 on CPU
       new  3009e8f   Merge pull request #1225 from pavanky/memory_manager
       new  db14451   Re-enable disabled sort tests from issue #995
       new  6da71e5   BUGFIX Handle 16-bit data in saveImage
       new  b14ae20   Fix saveImageNative for 1-channel images
       new  a6a4cdb   Update test data submodule commit
       new  88cf471   Add tests for 16-bit images for ImageIO+Native
       new  968ae4e   Handle CUDA devices locked in exclusive mode
       new  cc9018e   Add try/catch around cuda::setDevice in Pinned Memory Manager
       new  904d3e0   Using device independent vector for cuda Pinned Memory Manager
       new  bb479bf   Merge pull request #1229 from shehzan10/imageio16-test
       new  d9e5288   Fix NONFREE Build CMake Options
       new  5be5511   Handle compute_53 (tegra x1) for cuda lapack
       new  cc00f35   Add definition for each compute type
       new  a1823b3   Added helper functions for device type and unified mem in OpenCL
       new  4275f5f   Added getMappedPtr and unmapPtr functions in opencl memory
       new  3c1ab9f   Added matmul offloading to CPU
       new  f9819f7   Fix blas header types in cpu
       new  d5077ec   Fix bug in OpenCL JIT when calling functions that return same value
       new  ac25f5b   Fix bug in CUDA JIT when calling functions that return same value
       new  507ec92   dot in CUDA/OpenCL now uses mul followed by reduction
       new  7eafd44   Remove cpu dot fallback. Slower than opencl dot always
       new  45abbc3   Add OpenCL-CPU fallback for LU
       new  88e910d   Add OpenCL-CPU fallback for Cholesky
       new  872acfb   Add OpenCL-CPU fallback for QR
       new  59a9df0   Add OpenCL-CPU fallback for SVD
       new  ffb191c   Add OpenCL-CPU fallback for solve
       new  4e2d46c   Add OpenCL-CPU fallback for inverse
       new  210a64c   Clean up header files in opencl/cpu/
       new  e08d41b   Update environment variables doc
       new  b36d003   Move MappedPtr into the Array class
       new  56f9140   FEAT Add getActiveBackend function
       new  b39b60d   Merge remote-tracking branch 'upstream/devel' into fallback-opts
       new  a8b831b   Cleanup for opencl-cpu offload
       new  f6e309b   Clean up cusolver finding in cmake
       new  9962b87   Merge pull request #1221 from shehzan10/fallback-opts
       new  323bf75   Added tranform coordinates functionality
       new  ba483f1   Added transform coordinates to unified backend
       new  3522f80   Added transform coordinates documentation
       new  7f3e215   Added transform coordinates unit tests
       new  230c603   Updated test data
       new  ffc6e7f   Putting transform coordinates within version guards
       new  3941550   Move asserts inside try/catch in indexer functions in util
       new  735b66b   Fix OpenCL-CPU offload when OpenCL is built without lapack
       new  8a68775   Merge branch 'devel' of shehzan10/arrayfire into devel
       new  3389940   Fix resize unit test.
       new  6a34bee   Compile fixes for gcc 5.3
       new  46042fe   Fixing compiler warnings
       new  5fba37c   Functions to get opencl device type and platforms
       new  34c8c97   Work around for a bug in AMD's clBuildProgram
       new  4579762   Merge branch 'compile_fixes' of pavanky/arrayfire into devel
       new  e7e6080   Update clBLAS release tag
       new  845d3b3   Fixes in magma potrf (opencl cholesky)
       new  3ce49a5   BUGFIX Fix how streams are created in setActiveDevice (CUDA)
       new  76e9af4   Merge pull request #1220 from shehzan10/clBLAS-updates
       new  cc2dda0   Fixes to build with MKL when INTEL_MKL_ROOT is exported
       new  58fc4c8   Fixes to getMappedPtr in OpenCL backend
       new  d3d2996   Add getHostMemorySize and getDeviceMemorySize functions
       new  6c30652   Updates to Memory Manager and Garbage Collection Heuristics
       new  043739f   Move ArrayFireConfig, CPack (as CPackConfig) into CMakeModules
       new  cfd60f1   Move /bigobj definitions into main CMakeList (windows)
       new  aba1851   BUGFIX Add/remove entries for platform when adding external device/context
       new  163b5fb   BUGFIX Fix CUDA device management and free at destructor
       new  805dc5b   Fix Tests: ORB, Meanshift, basic_c, solve
       new  7eb905f   Add documentation for deviceInfo
       new  78f157c   Merge branch 'memory' of shehzan10/arrayfire into devel
       new  96041b5   BUGFIX: incorrect index for 3rd dimension in select / replace
       new  f228de3   TEST: Adding tests for 3D and 4D select and replace
       new  f8b0fb1   Merge pull request #1251 from pavanky/bug_fixes
       new  0039cdb   Proper exception handling for memory manager
       new  91bed33   Removing unneeded cudaDeviceSynchronize()
       new  519d3bb   Adding compute 37 to list of accepted CUDA computes
       new  41bad15   Changes required to build tests in a single file
       new  f94f522   Merge pull request #1250 from pavanky/misc_fixes
       new  3242618   Documentation fix in matchTemplate function
       new  209643b   syntax+typo fix in opencl backend
       new  9bf1455   Updated test data for meanshift, bilateral & morph
       new  64ef789   Merge pull request #1252 from 9prady9/docs_fixes
       new  3fc6939   Fixes issues when compiling with icc
       new  8648066   Merge branch 'fixes_to_compile_with_icc' of mchandra/arrayfire into devel
       new  ee7fa33   Removing unnecessary option "BUILD_GTEST"
       new  653416d   Updating release notes for 3.3 pre-release
       new  fc7553d   BUGFIX: max_bytes were being set incorrectly in MemoryManager
       new  5183a35   Cleaning up internal API for memory functions
       new  82e6558   JIT evaluation can now be tweaked by environment variables
       new  a1b7f8c   Changes to internal memory manager
       new  a938500   Fixes to random.hpp to work in multi-threaded environment
       new  95aaf72   OpenCL JIT now launches more threads per work group for CPU devices
       new  90ab8cd   Merge pull request #1262 from pavanky/memory_manager_fixes
       new  f674cda   BUGFIX: Fixing error in where for OpenCL backend
       new  98ff1c0   Merge pull request #1260 from pavanky/where_fix

The 273 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "adds" were already present in the repository and have only
been added to this reference.


Summary of changes:
 .gitignore                                         |    1 +
 .gitmodules                                        |    3 +
 CMakeLists.txt                                     |   37 +-
 .../ArrayFireConfig.cmake.in                       |    0
 .../ArrayFireConfigVersion.cmake.in                |    0
 CPack.cmake => CMakeModules/CPackConfig.cmake      |    2 +-
 CMakeModules/FindCBLAS.cmake                       |   64 +-
 CMakeModules/FindLAPACKE.cmake                     |   56 +-
 CMakeModules/Version.cmake                         |    9 +-
 CMakeModules/build_boost_compute.cmake             |    9 +-
 CMakeModules/build_clBLAS.cmake                    |    2 +-
 CMakeModules/build_clFFT.cmake                     |    2 +-
 README.md                                          |    4 +-
 docs/details/backend.dox                           |    9 +
 docs/details/device.dox                            |   59 +-
 docs/details/image.dox                             |   65 +-
 docs/details/lapack.dox                            |    8 +
 docs/details/vision.dox                            |    9 +-
 docs/pages/README.md                               |    6 +-
 docs/pages/configuring_arrayfire_environment.md    |  117 +-
 docs/pages/release_notes.md                        |   70 ++
 examples/graphics/fractal.cpp                      |    3 +-
 examples/graphics/gravity_sim.cpp                  |  140 +++
 examples/graphics/plot2d.cpp                       |   11 +-
 .../image_processing/adaptive_thresholding.cpp     |    1 +
 examples/image_processing/brain_segmentation.cpp   |   10 +-
 examples/image_processing/filters.cpp              |    2 +-
 include/af/array.h                                 |    4 +-
 include/af/backend.h                               |   20 +
 include/af/defines.h                               |   16 +
 include/af/device.h                                |  183 ++-
 include/af/exception.h                             |    3 +
 include/af/graphics.h                              |   73 +-
 include/af/image.h                                 |   53 +
 include/af/lapack.h                                |   25 +
 include/af/opencl.h                                |  183 ++-
 include/af/util.h                                  |   33 +-
 include/arrayfire.h                                |    2 +
 src/api/c/assign.cpp                               |    9 +-
 src/api/c/data.cpp                                 |    2 +-
 src/api/c/device.cpp                               |  226 +---
 src/api/c/err_common.cpp                           |   65 +-
 src/api/c/flip.cpp                                 |    2 +-
 src/api/c/graphics_common.cpp                      |   35 +-
 src/api/c/graphics_common.hpp                      |    5 +-
 src/api/c/image.cpp                                |    2 +-
 src/api/c/imageio.cpp                              |   39 +-
 src/api/c/imageio2.cpp                             |   24 +-
 src/api/c/index.cpp                                |    8 +-
 src/api/c/lu.cpp                                   |   10 +
 src/api/c/{device.cpp => memory.cpp}               |  222 ++--
 src/api/c/moddims.cpp                              |   15 +-
 src/api/c/plot.cpp                                 |   38 +-
 src/api/c/plot3.cpp                                |   38 +-
 src/api/c/print.cpp                                |    4 +-
 src/api/c/sift.cpp                                 |    4 +-
 src/api/c/stream.cpp                               |   22 +-
 src/api/c/transform.cpp                            |   32 +-
 src/api/c/transform_coordinates.cpp                |   96 ++
 src/api/c/util.cpp                                 |   22 +-
 .../sort_by_key/ascd_f32.cu => api/c/version.cpp}  |    7 +-
 src/api/cpp/array.cpp                              |    4 +-
 src/api/cpp/device.cpp                             |   56 +-
 src/api/cpp/error.hpp                              |   12 +-
 src/api/cpp/exception.cpp                          |   16 +-
 src/api/cpp/graphics.cpp                           |   13 +-
 src/api/cpp/imageio.cpp                            |    7 +
 src/api/cpp/lapack.cpp                             |    7 +
 src/api/cpp/{sat.cpp => transform_coordinates.cpp} |    4 +-
 src/api/cpp/util.cpp                               |    6 +
 src/api/unified/CMakeLists.txt                     |    6 +-
 src/api/unified/device.cpp                         |   38 +
 src/api/unified/graphics.cpp                       |   12 +
 src/api/unified/image.cpp                          |   12 +
 src/api/unified/lapack.cpp                         |    5 +
 src/api/unified/symbol_manager.cpp                 |   19 -
 src/api/unified/symbol_manager.hpp                 |    2 +
 src/backend/MemoryManager.cpp                      |  308 +++++
 src/backend/MemoryManager.hpp                      |  119 ++
 src/backend/cpu/Array.cpp                          |  394 +++----
 src/backend/cpu/Array.hpp                          |   22 +-
 src/backend/cpu/CMakeLists.txt                     |   15 +
 src/backend/cpu/approx.cpp                         |  380 +------
 src/backend/cpu/assign.cpp                         |   89 +-
 src/backend/cpu/bilateral.cpp                      |   74 +-
 src/backend/cpu/blas.cpp                           |  112 +-
 src/backend/cpu/blas.hpp                           |    1 +
 src/backend/cpu/cholesky.cpp                       |   16 +-
 src/backend/cpu/convolve.cpp                       |  249 +---
 src/backend/cpu/copy.cpp                           |  239 ++--
 src/backend/cpu/diagonal.cpp                       |   94 +-
 src/backend/cpu/diff.cpp                           |  131 +--
 src/backend/cpu/exampleFunction.cpp                |    7 +
 src/backend/cpu/fast.cpp                           |  234 +---
 src/backend/cpu/fft.cpp                            |  179 +--
 src/backend/cpu/fftconvolve.cpp                    |  418 ++-----
 src/backend/cpu/gradient.cpp                       |   86 +-
 src/backend/cpu/harris.cpp                         |  200 +---
 src/backend/cpu/hist_graphics.cpp                  |    4 +
 src/backend/cpu/histogram.cpp                      |   30 +-
 src/backend/cpu/homography.cpp                     |   32 +-
 src/backend/cpu/hsv_rgb.cpp                        |  102 +-
 src/backend/cpu/identity.cpp                       |   57 +-
 src/backend/cpu/iir.cpp                            |   86 +-
 src/backend/cpu/image.cpp                          |   58 +-
 src/backend/cpu/index.cpp                          |   78 +-
 src/backend/cpu/inverse.cpp                        |   13 +-
 src/backend/cpu/iota.cpp                           |   69 +-
 src/backend/cpu/ireduce.cpp                        |  236 ++--
 src/backend/cpu/join.cpp                           |  317 ++----
 src/backend/cpu/kernel/Array.hpp                   |   58 +
 src/backend/cpu/kernel/approx1.hpp                 |  144 +++
 src/backend/cpu/kernel/approx2.hpp                 |  170 +++
 src/backend/cpu/kernel/assign.hpp                  |   80 ++
 src/backend/cpu/kernel/bilateral.hpp               |   81 ++
 src/backend/cpu/kernel/convolve.hpp                |  267 +++++
 src/backend/cpu/kernel/copy.hpp                    |   90 ++
 src/backend/cpu/kernel/diagonal.hpp                |   67 ++
 src/backend/cpu/kernel/diff.hpp                    |   86 ++
 src/backend/cpu/kernel/dot.hpp                     |   46 +
 src/backend/cpu/{fast.cpp => kernel/fast.hpp}      |  154 +--
 src/backend/cpu/{fft.cpp => kernel/fft.hpp}        |  115 +-
 src/backend/cpu/kernel/fftconvolve.hpp             |  256 +++++
 src/backend/cpu/kernel/gradient.hpp                |   89 ++
 src/backend/cpu/kernel/harris.hpp                  |  124 ++
 src/backend/cpu/kernel/histogram.hpp               |   49 +
 .../cpu/{hsv_rgb.cpp => kernel/hsv_rgb.hpp}        |   40 +-
 src/backend/cpu/kernel/identity.hpp                |   37 +
 src/backend/cpu/kernel/iir.hpp                     |   61 +
 src/backend/cpu/kernel/index.hpp                   |   71 ++
 src/backend/cpu/kernel/iota.hpp                    |   45 +
 src/backend/cpu/kernel/ireduce.hpp                 |  108 ++
 src/backend/cpu/kernel/join.hpp                    |  144 +++
 src/backend/cpu/kernel/lookup.hpp                  |   62 +
 src/backend/cpu/kernel/lu.hpp                      |   80 ++
 src/backend/cpu/kernel/match_template.hpp          |  141 +++
 .../cpu/{meanshift.cpp => kernel/meanshift.hpp}    |   62 +-
 .../cpu/{medfilt.cpp => kernel/medfilt.hpp}        |   44 +-
 src/backend/cpu/{morph.cpp => kernel/morph.hpp}    |  100 +-
 .../nearest_neighbour.hpp}                         |   57 +-
 src/backend/cpu/{orb.cpp => kernel/orb.hpp}        |  296 +----
 src/backend/cpu/kernel/random.hpp                  |  200 ++++
 src/backend/cpu/kernel/range.hpp                   |   52 +
 src/backend/cpu/kernel/reduce.hpp                  |   71 ++
 .../cpu/{regions.cpp => kernel/regions.hpp}        |   44 +-
 src/backend/cpu/kernel/reorder.hpp                 |   55 +
 src/backend/cpu/kernel/resize.hpp                  |  177 +++
 src/backend/cpu/kernel/rotate.hpp                  |   84 ++
 src/backend/cpu/kernel/scan.hpp                    |   72 ++
 src/backend/cpu/kernel/select.hpp                  |  124 ++
 src/backend/cpu/kernel/shift.hpp                   |   69 ++
 src/backend/cpu/kernel/sift_nonfree.hpp            | 1195 ++++++++++++++++++++
 src/backend/cpu/{sobel.cpp => kernel/sobel.hpp}    |   46 +-
 src/backend/cpu/kernel/sort.hpp                    |   51 +
 src/backend/cpu/kernel/sort_by_key.hpp             |   85 ++
 src/backend/cpu/kernel/sort_index.hpp              |   70 ++
 src/backend/cpu/kernel/susan.hpp                   |   99 ++
 src/backend/cpu/kernel/tile.hpp                    |   55 +
 src/backend/cpu/kernel/transform.hpp               |  131 +++
 .../cpu/{transpose.cpp => kernel/transpose.hpp}    |   92 +-
 .../cpu_triangle.hpp => cpu/kernel/triangle.hpp}   |   25 +-
 src/backend/cpu/kernel/unwrap.hpp                  |   81 ++
 src/backend/cpu/kernel/wrap.hpp                    |   80 ++
 src/backend/cpu/lookup.cpp                         |   54 +-
 src/backend/cpu/lu.cpp                             |  138 +--
 src/backend/cpu/lu.hpp                             |    2 +
 src/backend/cpu/match_template.cpp                 |  126 +--
 src/backend/cpu/math.cpp                           |   72 +-
 src/backend/cpu/meanshift.cpp                      |  119 +-
 src/backend/cpu/medfilt.cpp                        |  114 +-
 src/backend/cpu/memory.cpp                         |  342 +++---
 src/backend/cpu/memory.hpp                         |   18 +-
 src/backend/cpu/morph.cpp                          |  130 +--
 src/backend/cpu/nearest_neighbour.cpp              |  155 +--
 src/backend/cpu/orb.cpp                            |  534 +--------
 src/backend/cpu/platform.cpp                       |   64 +-
 src/backend/cpu/platform.hpp                       |   12 +
 src/backend/cpu/plot.cpp                           |   53 +-
 src/backend/cpu/plot3.cpp                          |   53 +-
 src/backend/cpu/qr.cpp                             |   60 +-
 src/backend/cpu/queue.hpp                          |   46 +
 src/backend/cpu/random.cpp                         |  145 +--
 src/backend/cpu/range.cpp                          |   92 +-
 src/backend/cpu/reduce.cpp                         |  341 +++---
 src/backend/cpu/regions.cpp                        |  176 +--
 src/backend/cpu/reorder.cpp                        |   81 +-
 src/backend/cpu/resize.cpp                         |  234 +---
 src/backend/cpu/rotate.cpp                         |  130 +--
 src/backend/cpu/scan.cpp                           |  113 +-
 src/backend/cpu/select.cpp                         |  147 +--
 src/backend/cpu/set.cpp                            |  170 +--
 src/backend/cpu/shift.cpp                          |   88 +-
 src/backend/cpu/sift.cpp                           |    6 +-
 src/backend/cpu/sift_nonfree.hpp                   | 1193 -------------------
 src/backend/cpu/sobel.cpp                          |   74 +-
 src/backend/cpu/solve.cpp                          |  136 ++-
 src/backend/cpu/sort.cpp                           |   87 +-
 src/backend/cpu/sort_by_key.cpp                    |  133 +--
 src/backend/cpu/sort_index.cpp                     |  110 +-
 src/backend/cpu/surface.cpp                        |   53 +-
 src/backend/cpu/susan.cpp                          |  115 +-
 src/backend/cpu/svd.cpp                            |  132 ++-
 src/backend/cpu/threads                            |    1 +
 src/backend/cpu/tile.cpp                           |   80 +-
 src/backend/cpu/transform.cpp                      |  166 +--
 src/backend/cpu/transform.hpp                      |    2 +-
 src/backend/cpu/transform_interp.hpp               |   80 +-
 src/backend/cpu/transpose.cpp                      |  119 +-
 src/backend/cpu/triangle.cpp                       |   65 +-
 src/backend/cpu/unwrap.cpp                         |  127 +--
 src/backend/cpu/utility.hpp                        |   63 ++
 src/backend/cpu/where.cpp                          |   82 +-
 src/backend/cpu/wrap.cpp                           |  125 +-
 src/backend/cuda/Array.cpp                         |   15 +-
 src/backend/cuda/Array.hpp                         |    6 +-
 src/backend/cuda/CMakeLists.txt                    |   67 +-
 src/backend/cuda/JIT/numeric.cu                    |   13 +
 src/backend/cuda/blas.cpp                          |   38 +-
 src/backend/cuda/complex.hpp                       |   22 +-
 src/backend/cuda/copy.cu                           |    3 +-
 src/backend/cuda/debug_cuda.hpp                    |   10 +-
 src/backend/cuda/err_cuda.hpp                      |   37 +-
 src/backend/cuda/interopManager.cu                 |    9 +-
 src/backend/cuda/kernel/fast_pyramid.hpp           |    6 +-
 src/backend/cuda/kernel/random.hpp                 |   15 +-
 src/backend/cuda/kernel/rotate.hpp                 |    6 +-
 src/backend/cuda/kernel/select.hpp                 |    4 +-
 src/backend/cuda/kernel/transform.hpp              |   79 +-
 src/backend/cuda/kernel/transform_interp.hpp       |   65 +-
 src/backend/cuda/lu.cu                             |   15 +
 src/backend/cuda/lu.hpp                            |    2 +
 src/backend/cuda/memory.cpp                        |  495 +++-----
 src/backend/cuda/memory.hpp                        |   18 +-
 src/backend/cuda/platform.cpp                      |  136 ++-
 src/backend/cuda/platform.hpp                      |    9 +
 src/backend/cuda/sift.cu                           |    4 +-
 src/backend/cuda/transform.cu                      |   10 +-
 src/backend/cuda/transform.hpp                     |    3 +-
 src/{api/c => backend}/dispatch.cpp                |    0
 src/{api/c => backend}/dispatch.hpp                |    0
 src/backend/host_memory.cpp                        |  113 ++
 .../c/dispatch.hpp => backend/host_memory.hpp}     |    8 +-
 src/backend/opencl/Array.cpp                       |   19 +-
 src/backend/opencl/Array.hpp                       |   36 +-
 src/backend/opencl/CMakeLists.txt                  |   20 +-
 src/backend/opencl/binary.hpp                      |    2 +-
 src/backend/opencl/blas.cpp                        |   53 +-
 src/backend/opencl/cholesky.cpp                    |   13 +-
 .../{cpu/blas.cpp => opencl/cpu/cpu_blas.cpp}      |   99 +-
 .../cpu_inverse.hpp => opencl/cpu/cpu_blas.hpp}    |    5 +-
 .../cpu_lapack => opencl/cpu}/cpu_cholesky.cpp     |   49 +-
 .../cpu_lapack => opencl/cpu}/cpu_cholesky.hpp     |    2 +-
 src/backend/opencl/cpu/cpu_helper.hpp              |   74 ++
 .../cpu_lapack => opencl/cpu}/cpu_inverse.cpp      |   50 +-
 .../cpu_lapack => opencl/cpu}/cpu_inverse.hpp      |    2 +-
 src/backend/{cpu/lu.cpp => opencl/cpu/cpu_lu.cpp}  |   83 +-
 .../{cuda/cpu_lapack => opencl/cpu}/cpu_lu.hpp     |    2 +-
 src/backend/{cpu/qr.cpp => opencl/cpu/cpu_qr.cpp}  |   68 +-
 .../{cuda/cpu_lapack => opencl/cpu}/cpu_qr.hpp     |    2 +-
 .../{cuda/cpu_lapack => opencl/cpu}/cpu_solve.cpp  |   88 +-
 .../{cuda/cpu_lapack => opencl/cpu}/cpu_solve.hpp  |    2 +-
 .../{cpu/svd.cpp => opencl/cpu/cpu_svd.cpp}        |   59 +-
 .../{cuda/cpu_lapack => opencl/cpu}/cpu_svd.hpp    |    2 +-
 .../cpu_lapack => opencl/cpu}/cpu_triangle.hpp     |    7 +-
 src/backend/opencl/debug_opencl.hpp                |    7 +-
 src/backend/opencl/err_opencl.hpp                  |    4 +-
 src/backend/opencl/inverse.cpp                     |    6 +
 src/backend/opencl/jit.cpp                         |    8 +-
 src/backend/opencl/kernel/convolve.hpp             |    1 +
 src/backend/opencl/kernel/ireduce.hpp              |   14 +
 src/backend/opencl/kernel/jit.cl                   |    1 +
 src/backend/opencl/kernel/orb.hpp                  |   37 +-
 src/backend/opencl/kernel/select.cl                |    4 +-
 src/backend/opencl/kernel/transform.cl             |   32 +-
 src/backend/opencl/kernel/transform.hpp            |   14 +-
 src/backend/opencl/kernel/transform_interp.cl      |   69 +-
 src/backend/opencl/kernel/where.hpp                |    4 +-
 src/backend/opencl/lu.cpp                          |   21 +-
 src/backend/opencl/lu.hpp                          |    2 +
 src/backend/opencl/magma/getrs.cpp                 |    4 +-
 src/backend/opencl/magma/magma_helper.cpp          |   15 +
 src/backend/opencl/magma/potrf.cpp                 |    4 +-
 src/backend/opencl/math.hpp                        |   15 +
 src/backend/opencl/memory.cpp                      |  488 ++++----
 src/backend/opencl/memory.hpp                      |   21 +-
 src/backend/opencl/platform.cpp                    |  498 ++++++--
 src/backend/opencl/platform.hpp                    |   47 +-
 src/backend/opencl/program.hpp                     |    5 +-
 src/backend/opencl/qr.cpp                          |   19 +-
 src/backend/opencl/sift.cpp                        |    4 +-
 src/backend/opencl/solve.cpp                       |   19 +-
 src/backend/opencl/svd.cpp                         |   10 +
 src/backend/opencl/transform.cpp                   |  110 +-
 src/backend/opencl/transform.hpp                   |    2 +-
 src/backend/opencl/unary.hpp                       |    2 +-
 src/backend/util.cpp                               |   37 +
 src/backend/{opencl/errorcodes.hpp => util.hpp}    |    8 +-
 test/CMakeLists.txt                                |   73 +-
 test/approx1.cpp                                   |    1 +
 test/approx2.cpp                                   |    1 +
 test/backend.cpp                                   |   26 +
 test/basic_c.c                                     |    4 +-
 test/bilateral.cpp                                 |    1 +
 test/binary.cpp                                    |    1 +
 test/cholesky_dense.cpp                            |    1 +
 test/convolve.cpp                                  |    1 +
 test/data                                          |    2 +-
 test/diagonal.cpp                                  |    1 +
 test/dot.cpp                                       |    1 +
 test/fast.cpp                                      |    5 +-
 test/fft.cpp                                       |    1 +
 test/fft_real.cpp                                  |    1 +
 test/fftconvolve.cpp                               |    1 +
 test/getting_started.cpp                           |    1 +
 test/gloh_nonfree.cpp                              |   21 +-
 test/harris.cpp                                    |    5 +-
 test/histogram.cpp                                 |    1 +
 test/homography.cpp                                |    1 +
 test/imageio.cpp                                   |  140 ++-
 test/inverse_dense.cpp                             |    1 +
 test/lu_dense.cpp                                  |    1 +
 test/main.cpp                                      |    6 +
 test/math.cpp                                      |    1 +
 test/meanshift.cpp                                 |   16 +-
 test/medfilt.cpp                                   |    1 +
 test/morph.cpp                                     |    1 +
 test/ocl_ext_context.cpp                           |  131 +++
 test/orb.cpp                                       |   13 +-
 test/qr_dense.cpp                                  |    1 +
 test/random.cpp                                    |    4 +
 test/rank_dense.cpp                                |    1 +
 test/reduce.cpp                                    |   10 -
 test/replace.cpp                                   |   43 +
 test/resize.cpp                                    |    3 +-
 test/rotate.cpp                                    |    1 +
 test/rotate_linear.cpp                             |   15 +-
 test/scan.cpp                                      |   10 -
 test/select.cpp                                    |   40 +
 test/sift_nonfree.cpp                              |   22 +-
 test/solve_dense.cpp                               |    6 +-
 test/sort_by_key.cpp                               |   18 +-
 test/sort_index.cpp                                |   17 +-
 test/susan.cpp                                     |    5 +-
 test/svd_dense.cpp                                 |    5 +-
 test/testHelpers.hpp                               |   61 +-
 test/transform.cpp                                 |  268 +++++
 test/transform_coordinates.cpp                     |  118 ++
 test/translate.cpp                                 |    1 +
 test/transpose.cpp                                 |    1 +
 test/triangle.cpp                                  |    1 +
 test/where.cpp                                     |   18 +-
 test/wrap.cpp                                      |   11 +-
 352 files changed, 13326 insertions(+), 11121 deletions(-)
 rename ArrayFireConfig.cmake.in => CMakeModules/ArrayFireConfig.cmake.in (100%)
 rename ArrayFireConfigVersion.cmake.in => CMakeModules/ArrayFireConfigVersion.cmake.in (100%)
 rename CPack.cmake => CMakeModules/CPackConfig.cmake (98%)
 create mode 100644 examples/graphics/gravity_sim.cpp
 copy src/api/c/{device.cpp => memory.cpp} (59%)
 create mode 100644 src/api/c/transform_coordinates.cpp
 copy src/{backend/cuda/sort_by_key/ascd_f32.cu => api/c/version.cpp} (77%)
 copy src/api/cpp/{sat.cpp => transform_coordinates.cpp} (76%)
 create mode 100644 src/backend/MemoryManager.cpp
 create mode 100644 src/backend/MemoryManager.hpp
 create mode 100644 src/backend/cpu/kernel/Array.hpp
 create mode 100644 src/backend/cpu/kernel/approx1.hpp
 create mode 100644 src/backend/cpu/kernel/approx2.hpp
 create mode 100644 src/backend/cpu/kernel/assign.hpp
 create mode 100644 src/backend/cpu/kernel/bilateral.hpp
 create mode 100644 src/backend/cpu/kernel/convolve.hpp
 create mode 100644 src/backend/cpu/kernel/copy.hpp
 create mode 100644 src/backend/cpu/kernel/diagonal.hpp
 create mode 100644 src/backend/cpu/kernel/diff.hpp
 create mode 100644 src/backend/cpu/kernel/dot.hpp
 copy src/backend/cpu/{fast.cpp => kernel/fast.hpp} (62%)
 copy src/backend/cpu/{fft.cpp => kernel/fft.hpp} (74%)
 create mode 100644 src/backend/cpu/kernel/fftconvolve.hpp
 create mode 100644 src/backend/cpu/kernel/gradient.hpp
 create mode 100644 src/backend/cpu/kernel/harris.hpp
 create mode 100644 src/backend/cpu/kernel/histogram.hpp
 copy src/backend/cpu/{hsv_rgb.cpp => kernel/hsv_rgb.hpp} (81%)
 create mode 100644 src/backend/cpu/kernel/identity.hpp
 create mode 100644 src/backend/cpu/kernel/iir.hpp
 create mode 100644 src/backend/cpu/kernel/index.hpp
 create mode 100644 src/backend/cpu/kernel/iota.hpp
 create mode 100644 src/backend/cpu/kernel/ireduce.hpp
 create mode 100644 src/backend/cpu/kernel/join.hpp
 create mode 100644 src/backend/cpu/kernel/lookup.hpp
 create mode 100644 src/backend/cpu/kernel/lu.hpp
 create mode 100644 src/backend/cpu/kernel/match_template.hpp
 copy src/backend/cpu/{meanshift.cpp => kernel/meanshift.hpp} (75%)
 copy src/backend/cpu/{medfilt.cpp => kernel/medfilt.hpp} (82%)
 copy src/backend/cpu/{morph.cpp => kernel/morph.hpp} (65%)
 copy src/backend/cpu/{nearest_neighbour.cpp => kernel/nearest_neighbour.hpp} (60%)
 copy src/backend/cpu/{orb.cpp => kernel/orb.hpp} (53%)
 create mode 100644 src/backend/cpu/kernel/random.hpp
 create mode 100644 src/backend/cpu/kernel/range.hpp
 create mode 100644 src/backend/cpu/kernel/reduce.hpp
 copy src/backend/cpu/{regions.cpp => kernel/regions.hpp} (85%)
 create mode 100644 src/backend/cpu/kernel/reorder.hpp
 create mode 100644 src/backend/cpu/kernel/resize.hpp
 create mode 100644 src/backend/cpu/kernel/rotate.hpp
 create mode 100644 src/backend/cpu/kernel/scan.hpp
 create mode 100644 src/backend/cpu/kernel/select.hpp
 create mode 100644 src/backend/cpu/kernel/shift.hpp
 create mode 100644 src/backend/cpu/kernel/sift_nonfree.hpp
 copy src/backend/cpu/{sobel.cpp => kernel/sobel.hpp} (66%)
 create mode 100644 src/backend/cpu/kernel/sort.hpp
 create mode 100644 src/backend/cpu/kernel/sort_by_key.hpp
 create mode 100644 src/backend/cpu/kernel/sort_index.hpp
 create mode 100644 src/backend/cpu/kernel/susan.hpp
 create mode 100644 src/backend/cpu/kernel/tile.hpp
 create mode 100644 src/backend/cpu/kernel/transform.hpp
 copy src/backend/cpu/{transpose.cpp => kernel/transpose.hpp} (57%)
 copy src/backend/{cuda/cpu_lapack/cpu_triangle.hpp => cpu/kernel/triangle.hpp} (81%)
 create mode 100644 src/backend/cpu/kernel/unwrap.hpp
 create mode 100644 src/backend/cpu/kernel/wrap.hpp
 create mode 100644 src/backend/cpu/queue.hpp
 delete mode 100644 src/backend/cpu/sift_nonfree.hpp
 create mode 160000 src/backend/cpu/threads
 create mode 100644 src/backend/cpu/utility.hpp
 rename src/{api/c => backend}/dispatch.cpp (100%)
 copy src/{api/c => backend}/dispatch.hpp (100%)
 create mode 100644 src/backend/host_memory.cpp
 rename src/{api/c/dispatch.hpp => backend/host_memory.hpp} (83%)
 copy src/backend/{cpu/blas.cpp => opencl/cpu/cpu_blas.cpp} (73%)
 copy src/backend/{cuda/cpu_lapack/cpu_inverse.hpp => opencl/cpu/cpu_blas.hpp} (74%)
 copy src/backend/{cuda/cpu_lapack => opencl/cpu}/cpu_cholesky.cpp (61%)
 copy src/backend/{cuda/cpu_lapack => opencl/cpu}/cpu_cholesky.hpp (97%)
 create mode 100644 src/backend/opencl/cpu/cpu_helper.hpp
 copy src/backend/{cuda/cpu_lapack => opencl/cpu}/cpu_inverse.cpp (63%)
 copy src/backend/{cuda/cpu_lapack => opencl/cpu}/cpu_inverse.hpp (96%)
 copy src/backend/{cpu/lu.cpp => opencl/cpu/cpu_lu.cpp} (77%)
 copy src/backend/{cuda/cpu_lapack => opencl/cpu}/cpu_lu.hpp (97%)
 copy src/backend/{cpu/qr.cpp => opencl/cpu/cpu_qr.cpp} (68%)
 copy src/backend/{cuda/cpu_lapack => opencl/cpu}/cpu_qr.hpp (96%)
 copy src/backend/{cuda/cpu_lapack => opencl/cpu}/cpu_solve.cpp (70%)
 copy src/backend/{cuda/cpu_lapack => opencl/cpu}/cpu_solve.hpp (97%)
 copy src/backend/{cpu/svd.cpp => opencl/cpu/cpu_svd.cpp} (75%)
 copy src/backend/{cuda/cpu_lapack => opencl/cpu}/cpu_svd.hpp (97%)
 copy src/backend/{cuda/cpu_lapack => opencl/cpu}/cpu_triangle.hpp (94%)
 create mode 100644 src/backend/util.cpp
 copy src/backend/{opencl/errorcodes.hpp => util.hpp} (69%)
 create mode 100644 test/main.cpp
 create mode 100644 test/ocl_ext_context.cpp
 create mode 100644 test/transform.cpp
 create mode 100644 test/transform_coordinates.cpp

-- 
Alioth's /usr/local/bin/git-commit-notice on /srv/git.debian.org/git/debian-science/packages/arrayfire.git



More information about the debian-science-commits mailing list