Skip to content

Releases: intel/yask

Version 2.4

01 Apr 00:24
2d6d6c3
Compare
Choose a tag to compare

Kernel: Add ability to use temporal wave-front blocking across MPI ranks.
Compiler: Add ability to specify equation grouping using regular expressions.

Now requires g++ 4.9 or later.

Version 2.3

26 Mar 21:20
cd15da7
Compare
Choose a tag to compare

Added scratch-grids feature.

Version 2.2

08 Mar 00:32
1ffd414
Compare
Choose a tag to compare

API functions now throw C++ and Python exceptions instead of just terminating the program.

Version 2.1.5

23 Dec 21:38
d412e53
Compare
Choose a tag to compare

Improves performance when domain sizes are not multiples of vector-cluster sizes.
Several other small fixes and improvements since v2.1.0.

Version 2.1

01 Dec 17:48
10bf767
Compare
Choose a tag to compare

Added some code to src/stencils/Iso3dfdStencil.hpp that illustrates applying a simple Cerjan ABC via three 1D arrays. It adds about 11% runtime to this "iso3dfd' acoustic stencil using a 1024^3 problem size on KNL and only about 0.0001% more memory.

Version 2.0

28 Nov 20:27
f739bc5
Compare
Choose a tag to compare

Significant new features including arbitrary grid dimensions and APIs for C++ and Python.

Version 1.0

24 Aug 21:37
Compare
Choose a tag to compare
v1.0.0

Update README.md

Binary reorg

22 Apr 00:36
Compare
Choose a tag to compare
  • Scripts and binaries have been moved into the 'bin' directory, and several have been renamed to start with 'yask' instead of 'stencil'. Similar changes will occur in the 'src' dir later.
  • Turned off streaming stores by default on all architectures.
  • Updated the docs to reflect the above changes.
  • Run script (now called 'yask.sh') now creates a log file in the 'logs' dir. Useful for keeping a record of configs and perf.

Xeon tuning update

17 Apr 17:02
Compare
Choose a tag to compare

Update tuning on iso3dfd and awp stencils for Xeon platforms (BDW, etc.).

Sub-blocking

12 Apr 23:32
Compare
Choose a tag to compare
  • Major new features include addition of sub-block hierarchy between blocks and vector-clusters.
  • Added ability to provide default cmd-line options in the Makefile. This is much more expressive and flexible than the old way of setting a few macros.
  • Updated the example recipes and results for KNL, BDW, and HSW architectures.
  • Make it easier to copy settings from the stencil-tuner back into the Makefile.