Skip to content

Commit

Permalink
v0.4: update NEWS, regenerate changelog
Browse files Browse the repository at this point in the history
  • Loading branch information
kspalaiologos committed Oct 16, 2024
1 parent a9f0403 commit a20f4ff
Show file tree
Hide file tree
Showing 3 changed files with 37 additions and 1 deletion.
26 changes: 26 additions & 0 deletions ChangeLog
Original file line number Diff line number Diff line change
@@ -1,5 +1,31 @@
2024-10-16 Kamila Szewczyk <[email protected]>

TODO: update tasks.

mention interlacing parallelism

-j

parallel decode

CI: Stop providing Linux binaries.

CI: Install OMP

speed up interlacing modes -i2 -i3 with OpenMP

fix overflowing shifts

xpar-x86_64: missing align

fix off-by-1 in asm code

xpar-x86_64.asm: clean up the source, add 3-way saturating CRC32C.

smode: restrict on vector-scalar product

v0.3

to-do: strike down some performance-related goals

avx512 version in xpar-x86_64.asm
Expand Down
10 changes: 10 additions & 0 deletions NEWS
Original file line number Diff line number Diff line change
@@ -1,6 +1,16 @@
This file contains release notes for major and minor releases of xpar.
For a complete list of source-level changes, consult the ChangeLog file.

===============================================================================
v0.4 (16-10-2024)
- x86_64 static Linux binaries are no longer provided.
- OpenMP support has been added to improve encoding and decoding performance
in joint mode with high interlacing factors on multi-core machines.
- 3-way saturating CRC32C implementation has been added to improve performance
on x86_64 machines that support SSE4.2.
- Slightly improve the performance of the sharded mode.
- Fix undefined behaviour in sharded mode regarding int shifts.

===============================================================================
v0.3 (16-10-2024)
- Improve joint encoding performance on x86_64 machines.
Expand Down
2 changes: 1 addition & 1 deletion configure.ac
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
m4_define([xpar_version_major], [0])
m4_define([xpar_version_minor], [3])
m4_define([xpar_version_minor], [4])
m4_define([xpar_version], [xpar_version_major.xpar_version_minor])

AC_PREREQ([2.69])
Expand Down

0 comments on commit a20f4ff

Please sign in to comment.