Skip to content

Releases: CNugteren/CLTune

Version 1.6.4

28 May 12:41
Compare
Choose a tag to compare

Version 1.6.4

  • Reduced the requirements from GCC 4.8.0 to 4.7.0
  • Fixes various warnings on Clang

Version 1.6.3

26 May 14:33
Compare
Choose a tag to compare

Version 1.6.3

  • Reduced the requirements from GCC 4.9.0 to 4.8.0
  • Minor updates to the CMake file

Version 1.6.2

25 May 12:15
Compare
Choose a tag to compare

Version 1.6.2

  • Fixed another exception-related bug
  • Further improved reporting of failed runs
  • Updated C++11 OpenCL API

Version 1.6.1

22 May 12:23
Compare
Choose a tag to compare

Version 1.6.1

  • Fixed a couple of issues related to exceptions
  • Improved reporting of failed runs

Version 1.6.0

21 May 15:31
Compare
Choose a tag to compare

Version 1.6.0

  • Much cleaner API due to Pimpl idiom: only cltune.h header is now required
  • Replaced Khronos' cl.hpp with a custom C++11 version tailored for CLTune
  • Code clean-up / reorganisation
  • Added an option to add fixed defines to reference kernels
  • Added an option to load a kernel from string instead of from file
  • Added support for size_t OpenCL buffers

Version 1.5.1

19 May 11:12
Compare
Choose a tag to compare

Version 1.5.1

  • Improved the GEMM example to support the Intel MIC (Xeon Phi) accelerators
  • Updated compiler check and compiler flags
  • Adds support for multiple OpenCL kernel files at once (e.g. when wanting to include a header file)
  • Adds support for the std::complex data-types
  • Fixed some compilation warnings regarding size_t conversions
  • Updated the FindOpenCL.cmake file

Version 1.5.0

30 Apr 12:06
Compare
Choose a tag to compare

Version 1.5.0

  • OpenCL local work size and memory size constraints are now automatically handled
  • Greatly improved the new 2D convolution example:
    • Filter coefficients are now dynamic
    • Added support for local memory padding
    • In-lined the convolution header into the kernels and host code
    • Fixed various bugs
  • Moved the examples to separate subfolders
  • Uses chrono timers as seed in favor of random device
  • Bugfix for simulated annealing when 2 variables can only change together.

Version 1.4.1

19 Apr 13:13
Compare
Choose a tag to compare

Version 1.4.1

  • Added 2D convolution as an example
  • Added command-line arguments to the GEMM search-method sample
  • Fixed a CUDA 7 related bug in the GEMM kernel
  • Fixed a logging bug in the PSO search technique

Version 1.4.0

  • Added the particle swarm optimisation (PSO) search technique
  • Updated the example GEMM kernel

Version 1.3.2

12 Apr 17:12
Compare
Choose a tag to compare

Version 1.3.2

  • Now prints OpenCL version when running on a device
  • Added install targets to CMake
  • Moved header files around and renamed the main include to "cltune.h"
  • Catches OpenCL exceptions and skips those configurations

Version 1.3.1

03 Apr 07:58
Compare
Choose a tag to compare

Version 1.3.1

  • Fixed simulated annealing's random number generation
  • Added new FindOpenCL CMake script
  • Added option to print database-formatted output of best results