[arrayfire] branch dfsg-clean updated (25a2a75 -> cb95bda)
Ghislain Vaillant
ghisvail-guest at moszumanska.debian.org
Sun Feb 7 18:59:43 UTC 2016
This is an automated email from the git hooks/post-receive script.
ghisvail-guest pushed a change to branch dfsg-clean
in repository arrayfire.
from 25a2a75 Merge tag 'upstream/v3.2.2' into dfsg-clean
new be0ff22 Setup async project; Setup basic queues;
new b94c3df Convert CPU blas to use async queues
new 3188bdf Async CPU approx1 and approx2
new f797314 Async CPU Assign
new c8ecdb9 Async CPU Bilateral
new 759b506 Async CPU Convolve
new c399e75 Async CPU diff1 and diff2
new 80903d0 Avoid sending references to queued lambdas
new 96c5602 Async CPU Copy, Assign, and Index
new 46beebc Async CPU diagonal
new b7c83e8 Async FFT for the CPU backend
new 413eea8 Add eval to copyData
new 49f0cce Async random on CPU backend
new fada883 Async where on the CPU backe
new 1a0802f Async CPU reduce and ireduce
new 1842bcf Async CPU Transpose. Fix bug in eval
new 5d428df Merge branch 'devel' into async
new 4a8e723 Merge additional changes for async CPU.
new 91f7a1f async cpu::index function
new 2796770 Fixed asynchronous indexing & assignment in cpu backend
new 0aeed42 converted cpu tile to asychronous call
new 330ae1c converted sort_index cpu function to asynchronous call
new 5eea071 converted triangle fn in cpu backend to async call
new 551433e converted lu & cholesky decomposition functions to async calls
new ed6d26d svd cpu backend is async now
new e0d7c12 converted qr & solve cpu functions to async calls
new d0223f9 Converted wrap & unwrap cpu fns to async calls
new 32a65d8 converted transform to async call
new 743fb4a converted susan fn in cpu backend to asynchronous call
new 840af46 convert sort & sort_by_key cpu fns to async calls
new e005557 sobel cpu fn is async fn after this change
new 14e9d31 Converted resize & shift cpu fns to async calls
new 3c2bc65 convert select & rotate cpu fns to async calls
new 258d573 Convert morph & range cpu fns to async calls
new 9510fcb Converted cpu scan function to async call
new 6fc636f fix for async sift cpu function
new 48a9e58 converted matchTemplate, meanShift & medfilt to async calls
new b813fd4 nearest neighbour cpu func is asyn call now
new 35a462c conversion of listed functions to async calls
new e5ab671 converted join cpu func to async call
new 0c72451 converted cpu regions function to asynchronous call
new ed730cf adds scatter function
new 0a78b60 update plot2d example to include scatter plot
new 4ec314a cleanup and scatter example update
new 6f52c36 adds scatter3
new 6515414 Merge pull request #1116 from syurkevi/forge_scatter
new b267ffd Increment version for devel to 3.3
new 65c7a23 Fixes for scatter
new 323aa77 Merge pull request #1133 from shehzan10/devel
new 53de790 Removed dead code from opencl::DeviceManager class
new ce2d6a6 Enables use of ArrayFire with external context & command queue
new f65ee89 cpp wrappers for opencl external context related fns
new 2bcc6de unit tests for afcl::{pushDevice, setDevice, popDevice} fns
new 964b1ea Style changes in opencl header
new d41839f api name change for afcl external context functionality
new 227377d Added OpenCL include dir for unit tests
new 67ef051 additional style changes
new 72b71e8 Merge pull request #1140 from 9prady9/ext_ocl_context
new b9e113b Merge remote-tracking branch 'upstream/master' into devel
new 365dc94 DOC resolve markerType enum in graphics
new b878711 Remove unused variable warning in homography cuda kernel
new ebe6cac Merge branch 'devel' into async
new df2c091 Converted iir, fir, fftconvolve to async calls
new c06f24d Change to gfx to handle Arrays created by async calls
new 21f74eb Fixed harris & homography cpu fns to work with async fns
new 2217014 Fix in Array::device method
new 919333e Fix for getDeviceMemInfo function in cpu
new 7dac34a Fixes for asynchronous cpu copy && set functions
new 6b9c157 FEAT added allocHost and freeHost functions
new 3c9d69d FEAT added infoString function to return info as string
new e73ee6d Merge remote-tracking branch 'upstream/hotfixes-3.2.2' into devel
new 1de97de Using af_alloc_host when allocating user-return string
new f628fbe toString now uses af_alloc_host to allocate memory
new 7206028 Add overload of toString that returns a string
new 06d4bef FEAT add af_get_revision to get commit instead of AF_REVISION
new b3c28b6 Using c/version.cpp in unified
new 34f3569 Merge pull request #1163 from shehzan10/devel
new ab11b6b Merge branch 'devel' into async
new 3ba9633 Added missing eval for input Array's in cpu backend fns
new 8cc9c9c threads library is now a submodule in cpu backend
new abce1e8 Moved fns enqueued onto async queue to separate folder
new 5f2f155 Removed obselete fn of evalArray from all backends
new e651cad cpu::Array::eval queue work moved to kerenel namespace
new 3cddae2 moved assign cpu async fn to kernel space
new 0e563ba moved kernel namespace in cpu backend inside cpu namespace
new d03bb75 moved bilateral, convolve, fftconvolve to cpu kernel namespace
new e8f0242 moved copy queue fns from cpu backend to kernel namespace
new 913850d Moved diagonal cpu implementation to kernel namespace
new 71298c6 moved diff, fast, gradient, harris, histogram to kernel namespace
new cc7d665 moved rgb_hsv & identity fns to kernel namespace
new 696657c moved indexing & assignment fns to kernel namespace
new f2b84dd template parameters style fixes in cpu kernel namespace fns
new bb0a22c Moved more cpu fns implementations to kernel namespace
new 7d7f32f moved the left over fns to cpu kernel namespace
new 1313f98 Fixed the bug in cpu ireduce kernel function
new b684b06 Fixed orb async cpu fn
new a0f17b6 cmake fix to check for threads submodule
new c539f1d moved fft cpu fns implementations to kernel namespace
new 4831215 moved dot cpu implementation to kernel namespace
new d1089f8 moved fftconvolve reorder helper fn to kernel namespace
new 4f8b3fa fixed cpu::kernel::dot fn signature
new 95d9346 Added ENQUEUE macro in cpu backend
new 7dad2ef Removed obsolete queue sync in cpu::padArray fn
new 90611a2 Fixed cmake condition for threads submodule check
new 1cbffbb Synchronize when AF_SYNCHRONOUS_CALLS is set to 1
new 6058dd2 Revert "Added ENQUEUE macro in cpu backend"
new edda52a Update README.md with updated status badges
new 29cfed5 Merge pull request #1178 from shehzan10/readme-badges
new cad4c2c initial gravity example
new 5507717 add collisions, split vectors into components for performance
new 5c01608 remove state globals
new b1aa875 remove windows pause ifdef
new c62eea0 Merge pull request #1152 from syurkevi/gravity_example
new 5ecdc54 Added API support for perspective transform
new 2a43871 Added perspective transform to CPU backend
new 7fdfe3e Added perspective transform to CUDA backend
new 15b9ad6 Added perspective transform to OpenCL backend
new 81dca06 Updated transform documentation
new 8e4e766 Added perspective transform unit tests
new 7327fb2 Updated test data
new 27aeed0 Fixed wrong data type in OpenCL transform
new bdc31d0 Simplified test for perspective transform in API
new 951533c Merge pull request #1190 from pentschev/perspective_transform
new 2ae8273 Merge remote-tracking branch 'arrayfire/master' into devel
new f0b9691 Merge branch 'devel' into async
new 861808e Merge pull request #1174 from umar456/async
new c2d7e42 Fix clang warnings (std::abs, pragma ignores)
new fa765a6 Merge pull request #1208 from shehzan10/clang-warnings
new 1dd2195 Cleanup util.cpp
new e19a6be Using getEnvVar instead of getenv
new b260abf Cleanup/improve backend test
new de4851d Not building info for unified. backend does the same as info
new 4d06c74 FEAT Added isImageIOAvailable function to check support
new 1b85d6d FEAT Added isLAPACKAvailable function to check support
new 7747ee6 Use isImageIOAvailable in testHelper
new fe3fa66 Use isLAPACKAvailable in testHelper
new b89ab5d Add missing af_err to string
new 8813a2e af_get_last_error supports NULL as valid argument for len
new b7af25a Improvements to af::exception messages
new 7b6eee1 Add version guards around allocHost and freeHost
new 9605740 Deprecate af_(lock/unlock)_device_ptr. Use af_(lock/unlock)_array
new d02636a Add memFreeUnlinked to free locked device ptrs
new 330f4f8 FEAT Add printMemInfo to print memory information
new ed5556c Renamed is_free -> mngr_lock and is_unlinked -> user_lock in cpu memory mngr
new aa25b17 Renamed is_free -> mngr_lock and is_unlinked -> user_lock in cuda memory mngr
new cef8559 Renamed is_free -> mngr_lock and is_unlinked -> user_lock in opencl memory mngr
new dbe861e Reverse conditions for freeing in memory managers
new 33fbf33 Renamed internal memFree functions
new 8cb21a4 Fix AF_DISABLE_GRAPHICS condition (Fixes e19a6be)
new b0c7c9a Merge pull request #1207 from shehzan10/memory
new 3c91935 Replaced ssh based url with http url for threads submodule
new 7f7ce20 Merge pull request #1216 from 9prady9/threads_uri_fix
new e5bb334 Add missing isLAPACKAvailable implementation in CPU backend
new 775747e Set revision to "default" when git is not available
new 84dccc8 Documentation fixes
new 60a722a Merge pull request #1215 from shehzan10/devel
new 6988950 Build fix for CUDA backend when using boost 1.60
new 09bc060 Merge pull request #1211 from pavanky/compile_fixes
new 5940d4b Always use freeimage flags instead of hardcoded offsets
new a6ae9bd Merge pull request #1218 from pavanky/freeimage_fix
new 3047acd Add the ability to sort OpenCL devices Using the following criterion
new 8873ed2 Using proper offsets for loadImageNative and saveImageNative
new 14230d2 Adding environment variables to choose OpenCL device
new 2f16824 Cleaning up exception handling in src/api/c
new b42cbeb Updating docs for new AF_OPENCL_*_TYPE environment variables
new 17b2600 Freeimage only requires the flags for 24 / 32 bit images
new bb29d92 Merge pull request #1222 from pavanky/opencl_device_sort
new 685dccd Update boost compute release tag
new 6b7b1ce Update clFFT release tag
new 5866b17 Merge pull request #1202 from shehzan10/clupdates
new 777abcb Moving dispatch.hpp / dispatch.cpp to src/backend/
new 828138c Renaming a few internal functions
new c8cd29b Adding a unified memory manager for all backends
new a175432 Remove unnecessary line from CMakeLists
new 43d030d Cleaning up error messages in loading and saving files
new 73b7cac Fixing CUDA platform manager to sort devices in a more saner manner.
new d75b899 Adding lock to memory allocated using af_alloc_device / af::alloc
new d5f3bf1 Adding documentation for AF_MEM_DEBUG
new f9a8336 Additional sanitizing for mutex locks
new 0638f3f Removing unnecessary returns from void functions
new 1520dc3 Fixing issue where garbageCollect was only called on current device
new aaf554e BUGFIX: Initialize buffer counts to 0
new 9d0c159 af_set_device now only warns when device > 0 on CPU
new 3009e8f Merge pull request #1225 from pavanky/memory_manager
new db14451 Re-enable disabled sort tests from issue #995
new 6da71e5 BUGFIX Handle 16-bit data in saveImage
new b14ae20 Fix saveImageNative for 1-channel images
new a6a4cdb Update test data submodule commit
new 88cf471 Add tests for 16-bit images for ImageIO+Native
new 968ae4e Handle CUDA devices locked in exclusive mode
new cc9018e Add try/catch around cuda::setDevice in Pinned Memory Manager
new 904d3e0 Using device independent vector for cuda Pinned Memory Manager
new bb479bf Merge pull request #1229 from shehzan10/imageio16-test
new d9e5288 Fix NONFREE Build CMake Options
new 5be5511 Handle compute_53 (tegra x1) for cuda lapack
new cc00f35 Add definition for each compute type
new a1823b3 Added helper functions for device type and unified mem in OpenCL
new 4275f5f Added getMappedPtr and unmapPtr functions in opencl memory
new 3c1ab9f Added matmul offloading to CPU
new f9819f7 Fix blas header types in cpu
new d5077ec Fix bug in OpenCL JIT when calling functions that return same value
new ac25f5b Fix bug in CUDA JIT when calling functions that return same value
new 507ec92 dot in CUDA/OpenCL now uses mul followed by reduction
new 7eafd44 Remove cpu dot fallback. Slower than opencl dot always
new 45abbc3 Add OpenCL-CPU fallback for LU
new 88e910d Add OpenCL-CPU fallback for Cholesky
new 872acfb Add OpenCL-CPU fallback for QR
new 59a9df0 Add OpenCL-CPU fallback for SVD
new ffb191c Add OpenCL-CPU fallback for solve
new 4e2d46c Add OpenCL-CPU fallback for inverse
new 210a64c Clean up header files in opencl/cpu/
new e08d41b Update environment variables doc
new b36d003 Move MappedPtr into the Array class
new 56f9140 FEAT Add getActiveBackend function
new b39b60d Merge remote-tracking branch 'upstream/devel' into fallback-opts
new a8b831b Cleanup for opencl-cpu offload
new f6e309b Clean up cusolver finding in cmake
new 9962b87 Merge pull request #1221 from shehzan10/fallback-opts
new 323bf75 Added tranform coordinates functionality
new ba483f1 Added transform coordinates to unified backend
new 3522f80 Added transform coordinates documentation
new 7f3e215 Added transform coordinates unit tests
new 230c603 Updated test data
new ffc6e7f Putting transform coordinates within version guards
new 3941550 Move asserts inside try/catch in indexer functions in util
new 735b66b Fix OpenCL-CPU offload when OpenCL is built without lapack
new 8a68775 Merge branch 'devel' of shehzan10/arrayfire into devel
new 3389940 Fix resize unit test.
new 6a34bee Compile fixes for gcc 5.3
new 46042fe Fixing compiler warnings
new 5fba37c Functions to get opencl device type and platforms
new 34c8c97 Work around for a bug in AMD's clBuildProgram
new 4579762 Merge branch 'compile_fixes' of pavanky/arrayfire into devel
new e7e6080 Update clBLAS release tag
new 845d3b3 Fixes in magma potrf (opencl cholesky)
new 3ce49a5 BUGFIX Fix how streams are created in setActiveDevice (CUDA)
new 76e9af4 Merge pull request #1220 from shehzan10/clBLAS-updates
new cc2dda0 Fixes to build with MKL when INTEL_MKL_ROOT is exported
new 58fc4c8 Fixes to getMappedPtr in OpenCL backend
new d3d2996 Add getHostMemorySize and getDeviceMemorySize functions
new 6c30652 Updates to Memory Manager and Garbage Collection Heuristics
new 043739f Move ArrayFireConfig, CPack (as CPackConfig) into CMakeModules
new cfd60f1 Move /bigobj definitions into main CMakeList (windows)
new aba1851 BUGFIX Add/remove entries for platform when adding external device/context
new 163b5fb BUGFIX Fix CUDA device management and free at destructor
new 805dc5b Fix Tests: ORB, Meanshift, basic_c, solve
new 7eb905f Add documentation for deviceInfo
new 78f157c Merge branch 'memory' of shehzan10/arrayfire into devel
new 96041b5 BUGFIX: incorrect index for 3rd dimension in select / replace
new f228de3 TEST: Adding tests for 3D and 4D select and replace
new f8b0fb1 Merge pull request #1251 from pavanky/bug_fixes
new 0039cdb Proper exception handling for memory manager
new 91bed33 Removing unneeded cudaDeviceSynchronize()
new 519d3bb Adding compute 37 to list of accepted CUDA computes
new 41bad15 Changes required to build tests in a single file
new f94f522 Merge pull request #1250 from pavanky/misc_fixes
new 3242618 Documentation fix in matchTemplate function
new 209643b syntax+typo fix in opencl backend
new 9bf1455 Updated test data for meanshift, bilateral & morph
new 64ef789 Merge pull request #1252 from 9prady9/docs_fixes
new 3fc6939 Fixes issues when compiling with icc
new 8648066 Merge branch 'fixes_to_compile_with_icc' of mchandra/arrayfire into devel
new ee7fa33 Removing unnecessary option "BUILD_GTEST"
new 653416d Updating release notes for 3.3 pre-release
new fc7553d BUGFIX: max_bytes were being set incorrectly in MemoryManager
new 5183a35 Cleaning up internal API for memory functions
new 82e6558 JIT evaluation can now be tweaked by environment variables
new a1b7f8c Changes to internal memory manager
new a938500 Fixes to random.hpp to work in multi-threaded environment
new 95aaf72 OpenCL JIT now launches more threads per work group for CPU devices
new 90ab8cd Merge pull request #1262 from pavanky/memory_manager_fixes
new f674cda BUGFIX: Fixing error in where for OpenCL backend
new 98ff1c0 Merge pull request #1260 from pavanky/where_fix
new fb7d11e Merge tag 'v3.3.alpha' into dfsg-clean
new cb95bda Clean source tree from non-dfsg components.
The 275 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "adds" were already present in the repository and have only
been added to this reference.
Summary of changes:
.gitignore | 1 +
.gitmodules | 3 +
CMakeLists.txt | 37 +-
.../ArrayFireConfig.cmake.in | 0
.../ArrayFireConfigVersion.cmake.in | 0
CPack.cmake => CMakeModules/CPackConfig.cmake | 2 +-
CMakeModules/FindCBLAS.cmake | 64 ++-
CMakeModules/FindLAPACKE.cmake | 56 ++-
CMakeModules/Version.cmake | 9 +-
CMakeModules/build_boost_compute.cmake | 9 +-
CMakeModules/build_clBLAS.cmake | 2 +-
CMakeModules/build_clFFT.cmake | 2 +-
README.md | 4 +-
docs/details/backend.dox | 9 +
docs/details/device.dox | 59 ++-
docs/details/image.dox | 65 ++-
docs/details/lapack.dox | 8 +
docs/details/vision.dox | 9 +-
docs/pages/README.md | 6 +-
docs/pages/configuring_arrayfire_environment.md | 117 ++++-
docs/pages/release_notes.md | 70 +++
examples/graphics/fractal.cpp | 3 +-
examples/graphics/gravity_sim.cpp | 140 ++++++
examples/graphics/plot2d.cpp | 11 +-
.../image_processing/adaptive_thresholding.cpp | 1 +
examples/image_processing/brain_segmentation.cpp | 10 +-
examples/image_processing/filters.cpp | 2 +-
include/af/array.h | 4 +-
include/af/backend.h | 20 +
include/af/defines.h | 16 +
include/af/device.h | 183 ++++++-
include/af/exception.h | 3 +
include/af/graphics.h | 73 ++-
include/af/image.h | 53 ++
include/af/lapack.h | 25 +
include/af/opencl.h | 183 ++++++-
include/af/util.h | 33 +-
include/arrayfire.h | 2 +
src/api/c/assign.cpp | 9 +-
src/api/c/data.cpp | 2 +-
src/api/c/device.cpp | 226 ++-------
src/api/c/err_common.cpp | 65 +--
src/api/c/flip.cpp | 2 +-
src/api/c/graphics_common.cpp | 35 +-
src/api/c/graphics_common.hpp | 5 +-
src/api/c/image.cpp | 2 +-
src/api/c/imageio.cpp | 39 +-
src/api/c/imageio2.cpp | 24 +-
src/api/c/index.cpp | 8 +-
src/api/c/lu.cpp | 10 +
src/api/c/{device.cpp => memory.cpp} | 222 +++------
src/api/c/moddims.cpp | 15 +-
src/api/c/plot.cpp | 38 +-
src/api/c/plot3.cpp | 38 +-
src/api/c/print.cpp | 4 +-
src/api/c/sift.cpp | 4 +-
src/api/c/stream.cpp | 22 +-
src/api/c/transform.cpp | 32 +-
src/api/c/transform_coordinates.cpp | 96 ++++
src/api/c/util.cpp | 22 +-
.../sort_by_key/ascd_f32.cu => api/c/version.cpp} | 7 +-
src/api/cpp/array.cpp | 4 +-
src/api/cpp/device.cpp | 56 ++-
src/api/cpp/error.hpp | 12 +-
src/api/cpp/exception.cpp | 16 +-
src/api/cpp/graphics.cpp | 13 +-
src/api/cpp/imageio.cpp | 7 +
src/api/cpp/lapack.cpp | 7 +
src/api/cpp/{sat.cpp => transform_coordinates.cpp} | 4 +-
src/api/cpp/util.cpp | 6 +
src/api/unified/CMakeLists.txt | 6 +-
src/api/unified/device.cpp | 38 ++
src/api/unified/graphics.cpp | 12 +
src/api/unified/image.cpp | 12 +
src/api/unified/lapack.cpp | 5 +
src/api/unified/symbol_manager.cpp | 19 -
src/api/unified/symbol_manager.hpp | 2 +
src/backend/MemoryManager.cpp | 308 ++++++++++++
src/backend/MemoryManager.hpp | 119 +++++
src/backend/cpu/Array.cpp | 394 +++++++--------
src/backend/cpu/Array.hpp | 22 +-
src/backend/cpu/CMakeLists.txt | 15 +
src/backend/cpu/approx.cpp | 380 +++------------
src/backend/cpu/assign.cpp | 89 +---
src/backend/cpu/bilateral.cpp | 74 +--
src/backend/cpu/blas.cpp | 112 ++---
src/backend/cpu/blas.hpp | 1 +
src/backend/cpu/cholesky.cpp | 16 +-
src/backend/cpu/convolve.cpp | 249 +---------
src/backend/cpu/copy.cpp | 239 ++++-----
src/backend/cpu/diagonal.cpp | 94 ++--
src/backend/cpu/diff.cpp | 131 ++---
src/backend/cpu/exampleFunction.cpp | 7 +
src/backend/cpu/fast.cpp | 234 +--------
src/backend/cpu/fft.cpp | 179 +------
src/backend/cpu/fftconvolve.cpp | 418 +++++-----------
src/backend/cpu/gradient.cpp | 86 +---
src/backend/cpu/harris.cpp | 200 ++------
src/backend/cpu/hist_graphics.cpp | 4 +
src/backend/cpu/histogram.cpp | 30 +-
src/backend/cpu/homography.cpp | 32 +-
src/backend/cpu/hsv_rgb.cpp | 102 +---
src/backend/cpu/identity.cpp | 57 +--
src/backend/cpu/iir.cpp | 86 ++--
src/backend/cpu/image.cpp | 58 +--
src/backend/cpu/index.cpp | 78 +--
src/backend/cpu/inverse.cpp | 13 +-
src/backend/cpu/iota.cpp | 69 +--
src/backend/cpu/ireduce.cpp | 236 +++------
src/backend/cpu/join.cpp | 317 ++++--------
src/backend/cpu/kernel/Array.hpp | 58 +++
src/backend/cpu/kernel/approx1.hpp | 144 ++++++
src/backend/cpu/kernel/approx2.hpp | 170 +++++++
src/backend/cpu/kernel/assign.hpp | 80 +++
src/backend/cpu/kernel/bilateral.hpp | 81 ++++
src/backend/cpu/kernel/convolve.hpp | 267 +++++++++++
src/backend/cpu/kernel/copy.hpp | 90 ++++
src/backend/cpu/kernel/diagonal.hpp | 67 +++
src/backend/cpu/kernel/diff.hpp | 86 ++++
src/backend/cpu/kernel/dot.hpp | 46 ++
src/backend/cpu/{fast.cpp => kernel/fast.hpp} | 154 +-----
src/backend/cpu/{fft.cpp => kernel/fft.hpp} | 115 ++---
src/backend/cpu/kernel/fftconvolve.hpp | 256 ++++++++++
src/backend/cpu/kernel/gradient.hpp | 89 ++++
src/backend/cpu/kernel/harris.hpp | 124 +++++
src/backend/cpu/kernel/histogram.hpp | 49 ++
.../cpu/{hsv_rgb.cpp => kernel/hsv_rgb.hpp} | 40 +-
src/backend/cpu/kernel/identity.hpp | 37 ++
src/backend/cpu/kernel/iir.hpp | 61 +++
src/backend/cpu/kernel/index.hpp | 71 +++
src/backend/cpu/kernel/iota.hpp | 45 ++
src/backend/cpu/kernel/ireduce.hpp | 108 +++++
src/backend/cpu/kernel/join.hpp | 144 ++++++
src/backend/cpu/kernel/lookup.hpp | 62 +++
src/backend/cpu/kernel/lu.hpp | 80 +++
src/backend/cpu/kernel/match_template.hpp | 141 ++++++
.../cpu/{meanshift.cpp => kernel/meanshift.hpp} | 62 +--
.../cpu/{medfilt.cpp => kernel/medfilt.hpp} | 44 +-
src/backend/cpu/{morph.cpp => kernel/morph.hpp} | 100 ++--
.../nearest_neighbour.hpp} | 57 +--
src/backend/cpu/{orb.cpp => kernel/orb.hpp} | 296 +-----------
src/backend/cpu/kernel/random.hpp | 200 ++++++++
src/backend/cpu/kernel/range.hpp | 52 ++
src/backend/cpu/kernel/reduce.hpp | 71 +++
.../cpu/{regions.cpp => kernel/regions.hpp} | 44 +-
src/backend/cpu/kernel/reorder.hpp | 55 +++
src/backend/cpu/kernel/resize.hpp | 177 +++++++
src/backend/cpu/kernel/rotate.hpp | 84 ++++
src/backend/cpu/kernel/scan.hpp | 72 +++
src/backend/cpu/kernel/select.hpp | 124 +++++
src/backend/cpu/kernel/shift.hpp | 69 +++
src/backend/cpu/{sobel.cpp => kernel/sobel.hpp} | 46 +-
src/backend/cpu/kernel/sort.hpp | 51 ++
src/backend/cpu/kernel/sort_by_key.hpp | 85 ++++
src/backend/cpu/kernel/sort_index.hpp | 70 +++
src/backend/cpu/kernel/susan.hpp | 99 ++++
src/backend/cpu/kernel/tile.hpp | 55 +++
src/backend/cpu/kernel/transform.hpp | 131 +++++
.../cpu/{transpose.cpp => kernel/transpose.hpp} | 92 +---
.../cpu_triangle.hpp => cpu/kernel/triangle.hpp} | 25 +-
src/backend/cpu/kernel/unwrap.hpp | 81 ++++
src/backend/cpu/kernel/wrap.hpp | 80 +++
src/backend/cpu/lookup.cpp | 54 +--
src/backend/cpu/lu.cpp | 138 ++----
src/backend/cpu/lu.hpp | 2 +
src/backend/cpu/match_template.cpp | 126 +----
src/backend/cpu/math.cpp | 72 +--
src/backend/cpu/meanshift.cpp | 119 +----
src/backend/cpu/medfilt.cpp | 114 +----
src/backend/cpu/memory.cpp | 342 ++++++-------
src/backend/cpu/memory.hpp | 18 +-
src/backend/cpu/morph.cpp | 130 +----
src/backend/cpu/nearest_neighbour.cpp | 155 +-----
src/backend/cpu/orb.cpp | 534 +--------------------
src/backend/cpu/platform.cpp | 64 ++-
src/backend/cpu/platform.hpp | 12 +
src/backend/cpu/plot.cpp | 53 +-
src/backend/cpu/plot3.cpp | 53 +-
src/backend/cpu/qr.cpp | 60 ++-
src/backend/cpu/queue.hpp | 46 ++
src/backend/cpu/random.cpp | 145 +-----
src/backend/cpu/range.cpp | 92 ++--
src/backend/cpu/reduce.cpp | 341 ++++++-------
src/backend/cpu/regions.cpp | 176 +------
src/backend/cpu/reorder.cpp | 81 ++--
src/backend/cpu/resize.cpp | 234 ++-------
src/backend/cpu/rotate.cpp | 130 ++---
src/backend/cpu/scan.cpp | 113 ++---
src/backend/cpu/select.cpp | 147 ++----
src/backend/cpu/set.cpp | 170 ++++---
src/backend/cpu/shift.cpp | 88 +---
src/backend/cpu/sift.cpp | 6 +-
src/backend/cpu/sobel.cpp | 74 +--
src/backend/cpu/solve.cpp | 136 +++---
src/backend/cpu/sort.cpp | 87 +---
src/backend/cpu/sort_by_key.cpp | 133 ++---
src/backend/cpu/sort_index.cpp | 110 +----
src/backend/cpu/surface.cpp | 53 +-
src/backend/cpu/susan.cpp | 115 +----
src/backend/cpu/svd.cpp | 132 ++---
src/backend/cpu/threads | 1 +
src/backend/cpu/tile.cpp | 80 ++-
src/backend/cpu/transform.cpp | 166 ++-----
src/backend/cpu/transform.hpp | 2 +-
src/backend/cpu/transform_interp.hpp | 80 ++-
src/backend/cpu/transpose.cpp | 119 +----
src/backend/cpu/triangle.cpp | 65 +--
src/backend/cpu/unwrap.cpp | 127 ++---
src/backend/cpu/utility.hpp | 63 +++
src/backend/cpu/where.cpp | 82 ++--
src/backend/cpu/wrap.cpp | 125 ++---
src/backend/cuda/Array.cpp | 15 +-
src/backend/cuda/Array.hpp | 6 +-
src/backend/cuda/CMakeLists.txt | 67 +--
src/backend/cuda/JIT/numeric.cu | 13 +
src/backend/cuda/blas.cpp | 38 +-
src/backend/cuda/complex.hpp | 22 +-
src/backend/cuda/copy.cu | 3 +-
src/backend/cuda/debug_cuda.hpp | 10 +-
src/backend/cuda/err_cuda.hpp | 37 +-
src/backend/cuda/interopManager.cu | 9 +-
src/backend/cuda/kernel/fast_pyramid.hpp | 6 +-
src/backend/cuda/kernel/random.hpp | 15 +-
src/backend/cuda/kernel/rotate.hpp | 6 +-
src/backend/cuda/kernel/select.hpp | 4 +-
src/backend/cuda/kernel/transform.hpp | 79 ++-
src/backend/cuda/kernel/transform_interp.hpp | 65 ++-
src/backend/cuda/lu.cu | 15 +
src/backend/cuda/lu.hpp | 2 +
src/backend/cuda/memory.cpp | 495 +++++++------------
src/backend/cuda/memory.hpp | 18 +-
src/backend/cuda/platform.cpp | 136 ++++--
src/backend/cuda/platform.hpp | 9 +
src/backend/cuda/sift.cu | 4 +-
src/backend/cuda/transform.cu | 10 +-
src/backend/cuda/transform.hpp | 3 +-
src/{api/c => backend}/dispatch.cpp | 0
src/{api/c => backend}/dispatch.hpp | 0
src/backend/host_memory.cpp | 113 +++++
.../c/dispatch.hpp => backend/host_memory.hpp} | 8 +-
src/backend/opencl/Array.cpp | 19 +-
src/backend/opencl/Array.hpp | 36 +-
src/backend/opencl/CMakeLists.txt | 20 +-
src/backend/opencl/binary.hpp | 2 +-
src/backend/opencl/blas.cpp | 53 +-
src/backend/opencl/cholesky.cpp | 13 +-
.../{cpu/blas.cpp => opencl/cpu/cpu_blas.cpp} | 99 ++--
.../cpu_inverse.hpp => opencl/cpu/cpu_blas.hpp} | 5 +-
.../cpu_lapack => opencl/cpu}/cpu_cholesky.cpp | 49 +-
.../cpu_lapack => opencl/cpu}/cpu_cholesky.hpp | 2 +-
src/backend/opencl/cpu/cpu_helper.hpp | 74 +++
.../cpu_lapack => opencl/cpu}/cpu_inverse.cpp | 50 +-
.../cpu_lapack => opencl/cpu}/cpu_inverse.hpp | 2 +-
src/backend/{cpu/lu.cpp => opencl/cpu/cpu_lu.cpp} | 83 ++--
.../{cuda/cpu_lapack => opencl/cpu}/cpu_lu.hpp | 2 +-
src/backend/{cpu/qr.cpp => opencl/cpu/cpu_qr.cpp} | 68 +--
.../{cuda/cpu_lapack => opencl/cpu}/cpu_qr.hpp | 2 +-
.../{cuda/cpu_lapack => opencl/cpu}/cpu_solve.cpp | 88 ++--
.../{cuda/cpu_lapack => opencl/cpu}/cpu_solve.hpp | 2 +-
.../{cpu/svd.cpp => opencl/cpu/cpu_svd.cpp} | 59 +--
.../{cuda/cpu_lapack => opencl/cpu}/cpu_svd.hpp | 2 +-
.../cpu_lapack => opencl/cpu}/cpu_triangle.hpp | 7 +-
src/backend/opencl/debug_opencl.hpp | 7 +-
src/backend/opencl/err_opencl.hpp | 4 +-
src/backend/opencl/inverse.cpp | 6 +
src/backend/opencl/jit.cpp | 8 +-
src/backend/opencl/kernel/convolve.hpp | 1 +
src/backend/opencl/kernel/ireduce.hpp | 14 +
src/backend/opencl/kernel/jit.cl | 1 +
src/backend/opencl/kernel/orb.hpp | 37 +-
src/backend/opencl/kernel/select.cl | 4 +-
src/backend/opencl/kernel/transform.cl | 32 +-
src/backend/opencl/kernel/transform.hpp | 14 +-
src/backend/opencl/kernel/transform_interp.cl | 69 ++-
src/backend/opencl/kernel/where.hpp | 4 +-
src/backend/opencl/lu.cpp | 21 +-
src/backend/opencl/lu.hpp | 2 +
src/backend/opencl/magma/getrs.cpp | 4 +-
src/backend/opencl/magma/magma_helper.cpp | 15 +
src/backend/opencl/magma/potrf.cpp | 4 +-
src/backend/opencl/math.hpp | 15 +
src/backend/opencl/memory.cpp | 488 ++++++++-----------
src/backend/opencl/memory.hpp | 21 +-
src/backend/opencl/platform.cpp | 498 ++++++++++++++++---
src/backend/opencl/platform.hpp | 47 +-
src/backend/opencl/program.hpp | 5 +-
src/backend/opencl/qr.cpp | 19 +-
src/backend/opencl/sift.cpp | 4 +-
src/backend/opencl/solve.cpp | 19 +-
src/backend/opencl/svd.cpp | 10 +
src/backend/opencl/transform.cpp | 110 +++--
src/backend/opencl/transform.hpp | 2 +-
src/backend/opencl/unary.hpp | 2 +-
src/backend/util.cpp | 37 ++
src/backend/{opencl/errorcodes.hpp => util.hpp} | 8 +-
test/CMakeLists.txt | 73 ++-
test/approx1.cpp | 1 +
test/approx2.cpp | 1 +
test/backend.cpp | 26 +
test/basic_c.c | 4 +-
test/bilateral.cpp | 1 +
test/binary.cpp | 1 +
test/cholesky_dense.cpp | 1 +
test/convolve.cpp | 1 +
test/data | 2 +-
test/diagonal.cpp | 1 +
test/dot.cpp | 1 +
test/fast.cpp | 5 +-
test/fft.cpp | 1 +
test/fft_real.cpp | 1 +
test/fftconvolve.cpp | 1 +
test/getting_started.cpp | 1 +
test/harris.cpp | 5 +-
test/histogram.cpp | 1 +
test/homography.cpp | 1 +
test/imageio.cpp | 140 +++++-
test/inverse_dense.cpp | 1 +
test/lu_dense.cpp | 1 +
test/main.cpp | 6 +
test/math.cpp | 1 +
test/meanshift.cpp | 16 +-
test/medfilt.cpp | 1 +
test/morph.cpp | 1 +
test/ocl_ext_context.cpp | 131 +++++
test/orb.cpp | 13 +-
test/qr_dense.cpp | 1 +
test/random.cpp | 4 +
test/rank_dense.cpp | 1 +
test/reduce.cpp | 10 -
test/replace.cpp | 43 ++
test/resize.cpp | 3 +-
test/rotate.cpp | 1 +
test/rotate_linear.cpp | 15 +-
test/scan.cpp | 10 -
test/select.cpp | 40 ++
test/solve_dense.cpp | 6 +-
test/sort_by_key.cpp | 18 +-
test/sort_index.cpp | 17 +-
test/susan.cpp | 5 +-
test/svd_dense.cpp | 5 +-
test/testHelpers.hpp | 61 +--
test/transform.cpp | 268 +++++++++++
test/transform_coordinates.cpp | 118 +++++
test/translate.cpp | 1 +
test/transpose.cpp | 1 +
test/triangle.cpp | 1 +
test/where.cpp | 18 +-
test/wrap.cpp | 11 +-
348 files changed, 12107 insertions(+), 9909 deletions(-)
rename ArrayFireConfig.cmake.in => CMakeModules/ArrayFireConfig.cmake.in (100%)
rename ArrayFireConfigVersion.cmake.in => CMakeModules/ArrayFireConfigVersion.cmake.in (100%)
rename CPack.cmake => CMakeModules/CPackConfig.cmake (98%)
create mode 100644 examples/graphics/gravity_sim.cpp
copy src/api/c/{device.cpp => memory.cpp} (59%)
create mode 100644 src/api/c/transform_coordinates.cpp
copy src/{backend/cuda/sort_by_key/ascd_f32.cu => api/c/version.cpp} (77%)
copy src/api/cpp/{sat.cpp => transform_coordinates.cpp} (76%)
create mode 100644 src/backend/MemoryManager.cpp
create mode 100644 src/backend/MemoryManager.hpp
create mode 100644 src/backend/cpu/kernel/Array.hpp
create mode 100644 src/backend/cpu/kernel/approx1.hpp
create mode 100644 src/backend/cpu/kernel/approx2.hpp
create mode 100644 src/backend/cpu/kernel/assign.hpp
create mode 100644 src/backend/cpu/kernel/bilateral.hpp
create mode 100644 src/backend/cpu/kernel/convolve.hpp
create mode 100644 src/backend/cpu/kernel/copy.hpp
create mode 100644 src/backend/cpu/kernel/diagonal.hpp
create mode 100644 src/backend/cpu/kernel/diff.hpp
create mode 100644 src/backend/cpu/kernel/dot.hpp
copy src/backend/cpu/{fast.cpp => kernel/fast.hpp} (62%)
copy src/backend/cpu/{fft.cpp => kernel/fft.hpp} (74%)
create mode 100644 src/backend/cpu/kernel/fftconvolve.hpp
create mode 100644 src/backend/cpu/kernel/gradient.hpp
create mode 100644 src/backend/cpu/kernel/harris.hpp
create mode 100644 src/backend/cpu/kernel/histogram.hpp
copy src/backend/cpu/{hsv_rgb.cpp => kernel/hsv_rgb.hpp} (81%)
create mode 100644 src/backend/cpu/kernel/identity.hpp
create mode 100644 src/backend/cpu/kernel/iir.hpp
create mode 100644 src/backend/cpu/kernel/index.hpp
create mode 100644 src/backend/cpu/kernel/iota.hpp
create mode 100644 src/backend/cpu/kernel/ireduce.hpp
create mode 100644 src/backend/cpu/kernel/join.hpp
create mode 100644 src/backend/cpu/kernel/lookup.hpp
create mode 100644 src/backend/cpu/kernel/lu.hpp
create mode 100644 src/backend/cpu/kernel/match_template.hpp
copy src/backend/cpu/{meanshift.cpp => kernel/meanshift.hpp} (75%)
copy src/backend/cpu/{medfilt.cpp => kernel/medfilt.hpp} (82%)
copy src/backend/cpu/{morph.cpp => kernel/morph.hpp} (65%)
copy src/backend/cpu/{nearest_neighbour.cpp => kernel/nearest_neighbour.hpp} (60%)
copy src/backend/cpu/{orb.cpp => kernel/orb.hpp} (53%)
create mode 100644 src/backend/cpu/kernel/random.hpp
create mode 100644 src/backend/cpu/kernel/range.hpp
create mode 100644 src/backend/cpu/kernel/reduce.hpp
copy src/backend/cpu/{regions.cpp => kernel/regions.hpp} (85%)
create mode 100644 src/backend/cpu/kernel/reorder.hpp
create mode 100644 src/backend/cpu/kernel/resize.hpp
create mode 100644 src/backend/cpu/kernel/rotate.hpp
create mode 100644 src/backend/cpu/kernel/scan.hpp
create mode 100644 src/backend/cpu/kernel/select.hpp
create mode 100644 src/backend/cpu/kernel/shift.hpp
copy src/backend/cpu/{sobel.cpp => kernel/sobel.hpp} (66%)
create mode 100644 src/backend/cpu/kernel/sort.hpp
create mode 100644 src/backend/cpu/kernel/sort_by_key.hpp
create mode 100644 src/backend/cpu/kernel/sort_index.hpp
create mode 100644 src/backend/cpu/kernel/susan.hpp
create mode 100644 src/backend/cpu/kernel/tile.hpp
create mode 100644 src/backend/cpu/kernel/transform.hpp
copy src/backend/cpu/{transpose.cpp => kernel/transpose.hpp} (57%)
copy src/backend/{cuda/cpu_lapack/cpu_triangle.hpp => cpu/kernel/triangle.hpp} (81%)
create mode 100644 src/backend/cpu/kernel/unwrap.hpp
create mode 100644 src/backend/cpu/kernel/wrap.hpp
create mode 100644 src/backend/cpu/queue.hpp
create mode 160000 src/backend/cpu/threads
create mode 100644 src/backend/cpu/utility.hpp
rename src/{api/c => backend}/dispatch.cpp (100%)
copy src/{api/c => backend}/dispatch.hpp (100%)
create mode 100644 src/backend/host_memory.cpp
rename src/{api/c/dispatch.hpp => backend/host_memory.hpp} (83%)
copy src/backend/{cpu/blas.cpp => opencl/cpu/cpu_blas.cpp} (73%)
copy src/backend/{cuda/cpu_lapack/cpu_inverse.hpp => opencl/cpu/cpu_blas.hpp} (74%)
copy src/backend/{cuda/cpu_lapack => opencl/cpu}/cpu_cholesky.cpp (61%)
copy src/backend/{cuda/cpu_lapack => opencl/cpu}/cpu_cholesky.hpp (97%)
create mode 100644 src/backend/opencl/cpu/cpu_helper.hpp
copy src/backend/{cuda/cpu_lapack => opencl/cpu}/cpu_inverse.cpp (63%)
copy src/backend/{cuda/cpu_lapack => opencl/cpu}/cpu_inverse.hpp (96%)
copy src/backend/{cpu/lu.cpp => opencl/cpu/cpu_lu.cpp} (77%)
copy src/backend/{cuda/cpu_lapack => opencl/cpu}/cpu_lu.hpp (97%)
copy src/backend/{cpu/qr.cpp => opencl/cpu/cpu_qr.cpp} (68%)
copy src/backend/{cuda/cpu_lapack => opencl/cpu}/cpu_qr.hpp (96%)
copy src/backend/{cuda/cpu_lapack => opencl/cpu}/cpu_solve.cpp (70%)
copy src/backend/{cuda/cpu_lapack => opencl/cpu}/cpu_solve.hpp (97%)
copy src/backend/{cpu/svd.cpp => opencl/cpu/cpu_svd.cpp} (75%)
copy src/backend/{cuda/cpu_lapack => opencl/cpu}/cpu_svd.hpp (97%)
copy src/backend/{cuda/cpu_lapack => opencl/cpu}/cpu_triangle.hpp (94%)
create mode 100644 src/backend/util.cpp
copy src/backend/{opencl/errorcodes.hpp => util.hpp} (69%)
create mode 100644 test/main.cpp
create mode 100644 test/ocl_ext_context.cpp
create mode 100644 test/transform.cpp
create mode 100644 test/transform_coordinates.cpp
--
Alioth's /usr/local/bin/git-commit-notice on /srv/git.debian.org/git/debian-science/packages/arrayfire.git
More information about the debian-science-commits
mailing list