[SCM] Fast arithmetic with dense matrices over F_{2^e} annotated tag, debian/20130416-1, created. debian/20130416-1

Mon Jun 17 20:28:44 UTC 2013

The annotated tag, debian/20130416-1 has been created
        at  452349b9dd3951e4ee608391bdbaf38fbfd72fb3 (tag)
   tagging  3c3431bfac6afd5943e5e502c5bbdcb659720597 (commit)
 tagged by  Cédric Boutillier
        on  Mon Jun 17 21:51:38 2013 +0200

- Shortlog ------------------------------------------------------------
libm4rie Debian release 20130416-1
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.12 (GNU/Linux)

iQIcBAABCAAGBQJRv2jKAAoJENpJWPYR4UnparcP/1Cx34Is/HHoKzFuzz2qu3/V
fxNIv0rb0zZhWYHAUwi6FLm7lJiPBJvygEn36XVRKVL3j4UNHgeafD/J+6zmu6t+
ZXi4a1SYfC4wJ1Hgju8L/xMNTXQv+khnHXoDkC3cC7ciVYJhaxUcmq1eTEgNUdKh
xvgrJnIw1OfxgodCy9rRKhCmuTrWCzkGSX54xlqXBE+bL/MEXj2bcwBhbj5khyMT
isyGZJYCImFClIEV0R/jIyKRhvjOYIhJ6D0YZp0u/iEAiGZMc5XaG9BPXdgJdV8O
xUHK3bNaCi2mssE9hAtyCrq7tUKNoMG+9NOZk7HzL/Z27dU3E5qtAbAaGfOGjU0e
b6HdfJRrKxIioA6zAdq4LcN3RS7bRTkowGmu8Wuo0ydHu2YC+R+jKLljflnAaZYF
FkSZktIiMSNkyJqcmkFqhtcIcSKwLhK685F7IXNdKA50KKI5wjRQom03slR/Uwau
MLOuG+R9/vh67dTbPO0AcmZrwylPoC156adA/AmVXhiVHvn1zlBPdeThbf6jWJXf
8MGnrX8GKJ4bek2t2Sk995f/THg9SjXStQeYjaXDW5BZsfxcfCP4G9kf6icMu+h3
sSp8HAX4aYFYV4hZ15u79FZhtQyIkixDN+m4eVZBtIXVUHSuI8qOR+892AbZWbGT
zCYIV3gjqsA2E0ANJU/5
=UhtO
-----END PGP SIGNATURE-----

Cédric Boutillier (31):
      Add unapply-patches local option
      Use multiarch path in *.install; do not ship .la file
      Add libgivaro-dev and libgmp-dev to build-deps
      Add dpkg-dev (>= 1.16.1~) to build-deps
      enable multiarch support
      enable hardening
      Add Debian version to changelog entry
      Add debian/watch file
      Add strict version dependency on shared lib for -dev package
      Bump Standards-Version: to 3.9.3
      Merge tag 'release-20120613'
      prepare packaging for 0.0.20120613
      convert debian/copyright file to copyright-format/1.0
      use same version scheme as upstream
      fix version number for dependency on libm4ri-dev, to reflect new convention
      use upstream convention for upstream tag
      change source name to libm4rie
      ignore modifications of INSTALL file
      point to /usr/share/common-licenses/GPL-3 for full text of GPL-3+
      Add myself to Uploaders:
      update descriptions
      override lintian messages about absence of upstream changelog
      remove export of DEB_BUILD_MAINT_OPTIONS (not needed)
      fix capitalization of M4RIE
      close ITP bug
      Add VCS-* fields; Bump Standards-Version: to 3.9.4 (no changes needed)
      Merge upstream release-20130416
      prepare for 20130416 version
      fix Vcs-Git to use more canonical url
      create a -dbg package
      update changelog

Felix Salfelder (5):
      initial debian commit
      Merge commit 'release-20120415'
      depend on m4ri
      typo fix in libm4rie-dev.install
      add a note about the upstream (mercurial) repo

Martin Albrecht (189):
      starting M4RIE library for linear algebra over GF(2^n) for small n.
      added stubs for functions that need to be implemented
      implemented two tables in mzed_echelonize_travolta()
      implemented 6 grease tables for elimination
      added top level echelonize command which picks an algorithm
      adding fast code for add_multiple_of_row for 16-bit wide words
      adding configure in order to be able to test on systems with old autoconf
      same for install-sh
      do not enforce autoconf version
      using M4RI tuning parameters in mzed_echelonize_travolta
      implemented most of the  trivial functions
      adding Makefile, ignoring configure
      using four travolta tables in multiplication now
      naive multiplication for comparison
      added test code for multiplication, implemented more small functions
      implemented add_multiple_of_row for width == 2
      implemented cache optimisation and disabled it in most cases since it is slower in general
      peform slightly less work in mzed_make_table (avoid 0 and 1)
      fixing a bug in r16 where L wasn't updated
      fixing memory leaks in test code
      adding libm4ri as a dependency for libm4rie
      Cygwin requires no-undefined (and we have no undefined symbols)
      implemented strassen multiplication
      adding tuning parameters for strassen
      more robust cache tuning by increasing the number of trials
      adapted Strassen cutoff to use cache sizes
      implemented Karatsuba multiplication for bitsliced representations of dense matrices over GF(2^2)
      some refactoring and making functions more readable (hopefully)
      performance improvement: don't slice/cling words which are zero
      use Karatsuba for GF(2^2) it is always better
      improved speed of cache tuning seems to give good results on prai243,road,eno,iras,redhawk,t2
      cleaned up build targets (especially bench/)
      new release
      Added tag release-20100817 for changeset 27f53a7570aa
      fixing two issues with the building of bench_elimination
      disable default building of benchmarking binaries, they cause problems
      adapted to work with recent M4RI bit order reversal
      adapted to work with hg tip of M4RI
      imported updated cache tuning from M4RI
      remove cache size detection and SSE detection and rely on M4RI
      Added tag release-20110715 for changeset 05db9d68bf61
      Added tag release-20110601 for changeset f2b76762cd70
      mzd_slice_t datatype and first utility functions
      mzd_slice4 for slicing degree 3 and 4
      revised configure & makefiles to allow --with-m4ri and --with-givaro (needs more testing)
      revised benchmarketing code to be easier to build
      adding bench_smallops (for now: slicing and clinging)
      force rpath to the right M4RIE
      10% or so faster slicing
      slicing & clinging for k=2,3,4
      cosmetic stuff
      more cosmetic stuff
      and yet more whitespace stuff
      improved memory consumption of karatsuba over GF(4)
      adapted hierarchy of karatsuba functions in preparation for karatsuba for GF(4) and GF(8)
      karatsuba for GF(8)
      switching size_t to rci_t
      fixed addmul!
      better tests + bugfixes for multiplication
      new function mzed_mul_scalar which performs scalar scaling of a matrix (fixes #9)
      faster smallops tests
      Added tag release-20110821 for changeset fe6dab8a9bb4
      Removed tag release-20110821
      fix a typo mzed_mul_scalar (fixes #10)
      Added tag release-20110821 for changeset dd06c345824f
      fixed compiler warnings about wrong types
      karatsuba multiplication for 2^4 and less memory consumption for 2^3
      improved memory consumption for GF(2^4) Karatsuba from 9 to 7 temporaries
      better benchmarketing
      TRSM lower left for mzed_t: X = ~L * B
      trsm lower_left templated (for mzd_slice_t and mzed_t)
      added mzed_add to bench_smallops
      better benchmarkteting for trsm
      refactored templating stuff for easier re-use
      fixed but in add multiple of rows
      more fixes to mzed_add_multiple_of_row()
      better test_elimination
      implemented naive PLE decomposition
      documentation updated
      adding canary (cf., stack protection) to detect buffer overfows (i.e., algorithms writing past ncols)
      asymptotically fast PLE decomposition + travolta PLE
      faster mzed_rescale_row
      better PLE tests
      merge
      documentation update
      bench_ple added
      better help for bench_ple/bench_trsm
      better complexity feedback + fixed bug in _mzed_mul_karatsuba
      gf2e_t16_init()/free() for construction/destruction of 16-bit multiplication tables
      TRSM upper left
      PLUQ, better support for offsets, much faster TRSM
      faster trsm upper left + better PLE crossovers
      slightly faster mzed_make_table
      performance enhancements for ple
      fixed ple cutoff
      slightly more sensible benchmarking Makefile
      use m4ri_mm_malloc/free instead of malloc/free
      TRSM lower left with Travolta tables
      bugfix in mzed_add_multiple_of_row
      mzed_echelonize_ple() implemented and passes tests
      travolta table creation in k MUL + 2^k ADD instead of 2^k MUL
      use ple in echelonform
      don't use bitmask begin in clinging/slicing: it didn't work properly, we don't need it, it costs cycles
      ignore more stuff like Makefiles
      slicing that isn't totally naive (clinging yet to do)
      matrix_trsm_upper_left_travolta
      allow type in bench_elimination
      define mzd_slice_echelonize (alias for mzd_slice_echelonize_ple)
      allow type argument in bench_multiplication
      removed old cold & renamed some internal functions for slicing
      clinging (2-4) that doesn't totally suck
      DNDEBUG in bench
      whitespace changes for more readability
      more compact slicing/clinging
      copy less in echelonform.c
      pass M4RI's SIMD and OpenMP flags to compiler
      add a test for one particular if branch + asserts
      changing version number for upcoming release
      simple re-arrangement
      allow benching for trsm_upper_left
      played with the cutoffs for nicer scaling
      Added tag release-20111004 for changeset d1d724d9545a
      table creation got cheaper and hence blocks make sense now
      slicing/clinging for e<=8
      refactored: clinging/slicing in conversion
      naive polynomial multiplication for bitsliced matrices
      Karatsuba for e=5
      Karatsuba for deg 8
      unified strassen cutoff
      use rpath for test_* to make sure the right version of the library is used
      Karatsuba for GF(2^6)
      saving memory in karatsuba multiplication
      Karatsuba for GF(2^7), i.e. for all e <= 8
      using less temporary memory for Karatsuba for GF(2^4)
      added more documentation and first steps toward Travolta => Newton-John
      wrote a lot of documentation
      mzd_slice_mul_scalar
      adding a softlink m4rie -> src
      remove -lntl, it's not needed
      replaced nonstandard __STRING(x) with #x
      degrees are small, hence ints
      avoid new/delete in testing code (caused problems on cicero)
      fixed memleak in test_smallops
      bitmasks should be unsigned
      fixed a memleak
      make sure the canary does not set bits which are expected to be zero (e.g. in lockup tables)
      this prevents a segfault on cicero with GCC 4.6.1 (and I'm too lazy right now to properly debug this)
      changed library version to 20111203
      Added tag release-20111203 for changeset e6bd45516c69
      refactoring: renamed files to match names of main structs in them
      more compact docstrings & fixed single include directives
      refactored: mzd_poly added (for now internal-ish stuff)
      fixed includes to work with development version of M4RI
      use unsigned ints for degrees everywhere
      removed a bit of dead code scan-build reported
      Added tag release-20120415 for changeset 8245cd046d86
      reserve enough pointers for finite fields in testsuite
      Removed tag release-20120415
      Added tag release-20120415 for changeset 83b1a73e827a
      Removed tag release-20120415
      Added tag release-20120415 for changeset 4c2e04ec11a4
      don't divide by arbitrary constants in cycle count
      be more indifferent about Givaro versions
      changed all includes to <m4rie/blah.h> so that -I$INCLUDEDIR is enough
      continuation of previous patch (forgot to save files)
      library version updated to 20120415
      I am stupid and I don't close by brackets
      preparation for next (bugfix) release
      use autotools' release mechanisms to avoid release fuck-up's
      don't use rpath in TESTS on Makefile.am, Solaris chokes on this.
      "make dist" should produce something that works now
      "make check" now works with "make dist" created tarballs
      added gf2e_init(minpoly)
      small bugfixes
      removed dependency on Givaro for finite field creation
      dropping inversion tables from gf2e
      added gf2e_mul() and started moving functions over
      moved to ff->mul from explicit multiplication tables
      added first tests for degrees > 10
      more tests for degrees > 10 and <= 16
      avoid mzd_set_ui() in table creation
      conversion mzed_t <=> mzd_slice_t for up to degree 16
      fixed typos
      fixed typo in comment
      added naive polynomial multiplication
      updating library version 20121224
      Added tag release-20121224 for changeset 621cc82cee2a
      split conversion into several files to make it easier to compile M4RIE on machines with low RAM
      preparing for upcoming release

-----------------------------------------------------------------------

-- 
Fast arithmetic with dense matrices over F_{2^e}