Changelog

All notable changes to this project will be documented in this file.

[v2.0] - 2024-08-19

Human-readable output now uses YAML format
Renamed parameter NUM_TD_PER_THREAD to PPWI for all implementations
Consolidated builds to use a shared CMake script, Makefiles removed
All implementation now share a common C++ driver with device selection based on index or name substrings
More robust input deck/parameter validation (bad params now gives reason at launch instead of SEGV mid-benchmark)
Optimised deck IO for faster poses to memory
OpenCL implementation now embeds kernel directly in executable
OpenCL implementation now use the official OpenCL C++ Bindings
Kokkos implementation now supports team policies
SYCL implementation now uses discard_write instead of read_write for storing energy results
HIP implementation is now standalone and not derived from the CUDA via hipify
Fused OpenMP and OpenMP target implementation via macros
OpenMP/OpenMP target now supports the team directive as wgsize

Initial public release.