Skip to content

Latest commit

 

History

History
314 lines (277 loc) · 13.6 KB

CHANGELOG.rst

File metadata and controls

314 lines (277 loc) · 13.6 KB

Changelog

version develop

  • Use upstream ISA-L version 2.31.1 which includes patches to make installation possible.

version 1.7.1

  • Fix a bug where flushing files when writing in threaded mode did not work properly.
  • Prevent threaded opening from blocking python exit when an error is thrown in the calling thread.

version 1.7.0

  • Include a patched ISA-L version 2.31. The applied patches make compilation and wheelbuilding on MacOS ARM64 possible.
  • Fix a bug where READ and WRITE in isal.igzip were inconsistent with the values in gzip on Python 3.13
  • Small simplifications to the igzip.compress function, which should lead to less overhead.

version 1.6.1

  • Fix a bug where streams that were passed to igzip_threaded.open where closed.

version 1.6.0

  • Fix a bug where compression levels for IGzipFile where checked in read mode.
  • Update statically linked ISA-L release to 2.31.0
  • Fix an error that occurred in the __close__ function when a threaded writer was initialized with incorrect parameters.

version 1.5.3

  • Fix a bug where append mode would not work when using igzip_threaded.open.

version 1.5.2

  • Fix a bug where a filehandle remained opened when igzip_threaded.open was used for writing with a wrong compression level.
  • Fix a memory leak that occurred when an error was thrown for a gzip header with the wrong magic numbers.
  • Fix a memory leak that occurred when isal_zlib.decompressobj was given a wrong wbits value.

version 1.5.1

  • Fix a memory leak in the GzipReader.readall implementation.

version 1.5.0

  • Make a special case for threads==1 in igzip_threaded.open for writing files. This now combines the writing and compression thread for less overhead.
  • Maximize time spent outside the GIL for igzip_threaded.open writing. This has decreased wallclock time significantly.

version 1.4.1

  • Fix several errors related to unclosed files and buffers.

version 1.4.0

  • Drop support for python 3.7 and PyPy 3.8 as these are no longer supported. Add testing and support for python 3.12 and PyPy 3.10.
  • Added an experimental isal.igzip_threaded module which has an open function. This can be used to read and write large files in a streaming fashion while escaping the GIL.
  • The internal igzip._IGzipReader has been rewritten in C. As a result the overhead of decompressing files has significantly been reduced and python -m isal.igzip is now very close to the C igzip application.
  • The igzip._IGZipReader in C is now used in igzip.decompress. The _GzipReader also can read from objects that support the buffer protocol. This has reduced overhead significantly.

version 1.3.0

  • Gzip headers are now actively checked for a BGZF extra field. If found the block size is taken into account when decompressing. This has further improved bgzf decompression speed by 5% on some files compared to the more generic solution of 1.2.0.
  • Integrated CPython 3.11 code for reading gzip headers. This leads to more commonality between the python-isal code and the upstream gzip.py code. This has enabled the change above. It comes at the cost of a slight increase in overhead at the gzip.decompress function.

version 1.2.0

  • Bgzip files are now detected and a smaller reading buffer is used to accomodate the fact that bgzip blocks are typically less than 64K. (Unlike normal gzip files that consist of one block that spans the entire file.) This has reduced decompression time for bgzip files by roughly 12%.
  • Speed-up source build by using ISA-L Unix-specific makefile rather than the autotools build.
  • Simplify build setup. ISA-L release flags are now used and not overwritten with python release flags when building the included static library.
  • Fix bug where zdict's could not be set for isal_zlib.decompressobj and igzip_lib.IgzipDecompressor.
  • Escape GIL when calling inflate, deflate, crc32 and adler32 functions just like in CPython. This allows for utilising more CPU cores in combination with the threading module. This comes with a very slight cost in efficiency for strict single-threaded applications.

version 1.1.0

  • Added tests and support for Python 3.11.

version 1.0.1

  • Fixed failing tests and wheel builds for PyPy.

version 1.0.0

Python-isal has been rewritten as a C-extension (first implementation was in Cython). This has made the library faster in many key areas.

  • Since the module now mostly contains code copied from CPython and then modified to work with ISA-L the license has been changed to the Python Software Foundation License version 2.
  • Python versions lower than 3.7 are no longer supported. Python 3.6 is out of support since December 2021.
  • Stub files with type information have now been updated to correctly display positional-only arguments.
  • Expose READ and WRITE constants on the igzip module. These are also present in Python's stdlib gzip module and exposing them allows for better drop-in capability of igzip. Thanks to @alexander-beedie in #115.
  • A --no-name flag has been added to python -m isal.igzip.
  • Reduced wheel size by not including debug symbols in the binary. Thanks to @marcelm in #108.
  • Cython is no longer required as a build dependency.
  • isal_zlib.compressobj and isal_zlib.decompressobj are now about six times faster.
  • igzip.decompress has 30% less overhead when called.
  • Error structure has been simplified. There is only IsalError which has Exception as baseclass instead of OSError. isal_zlib.IsalError, igzip_lib.IsalError, isal_zlib.error and igzip_lib.error are all aliases of the same error class.
  • GzipReader now uses larger input and output buffers (128k) by default and IgzipDecompressor.decompress has been updated to allocate maxsize buffers when these are of reasonable size, instead of growing the buffer to maxsize on every call. This has improved gzip decompression speeds by 7%.
  • Patch statically linked included library (ISA-L 2.30.0) to fix the following:
    • ISA-L library version variables are now available on windows as well, for the statically linked version available on PyPI.
    • Wheels are now always build with nasm for the x86 architecture. Previously yasm was used for Linux and MacOS due to build issues.
    • Fixed a bug upstream in ISA-L were zlib headers would be created with an incorrect wbits value.
  • Python-isal shows up in Python profiler reports.
  • Support and tests for Python 3.10 were added.
  • Due to a change in the deployment process wheels should work for older versions of pip.
  • Added a crc property to the IgzipDecompressor class. Depending on the decompression flag chosen, this will update with an adler32 or crc32 checksum.
  • All the decompression NO_HDR flags on igzip_lib were incorrectly documented. This is now fixed.

version 0.11.1

  • Fixed an issue which occurred rarely that caused IgzipDecompressor's unused_data to report back incorrectly. This caused checksum errors when reading gzip files. The issue was more likely to trigger in multi-member gzip files.

version 0.11.0

In this release the python -m isal.igzip relatively slow decompression rate has been improved in both speed and usability. Previously it was 19% slower than igzip when used with the -d flag for decompressing, now it is just 8% slower. Also some extra flags were added to make it easier to select the output file.

  • Prompt when an output file is overwritten with the python -m isal.igzip command line utility and provide the -f or --force flags to force overwriting.
  • Added -o and --output flags to the python -m isal.igzip command line utility to allow the user to select the destination of the output file.
  • Reverse a bug in the build system which caused some docstring and parameter information on igzip_lib and isal_zlib to disappear in the documentation and the REPL.
  • Increase the buffer size for python -m isal.igzip so it is now closer to speeds reached with igzip.
  • Add a READ_BUFFER_SIZE attribute to igzip which allows setting the amount of raw data that is read at once.
  • Add an igzip_lib.IgzipDecompressor object which can decompress without using an unconsumed_tail and is therefore more efficient.

version 0.10.0

  • Added an igzip_lib module which allows more direct access to ISA-L's igzip_lib API. This allows features such as headerless compression and decompression, as well as setting the memory levels manually.
  • Added more extensive documentation.

version 0.9.0

  • Fix a bug where a AttributeError was triggered when zlib.Z_RLE or zlib.Z_FIXED were not present.
  • Add support for Linux aarch64 builds.
  • Add support for pypy by adding pypy tests to the CI and setting up wheel building support.

version 0.8.1

  • Fix a bug where multi-member gzip files where read incorrectly due to an offset error. This was caused by ISA-L's decompressobj having a small bitbuffer which was not taken properly into account in some circumstances.

version 0.8.0

  • Speed up igzip.compress and igzip.decompress by improving the implementation.
  • Make sure compiler arguments are passed to ISA-L compilation step. Previously ISA-L was compiled without optimisation steps, causing the statically linked library to be significantly slower.
  • A unused constant from the isal_zlib library was removed: ISAL_DEFAULT_HIST_BITS.
  • Refactor isal_zlib.pyx to work almost the same as zlibmodule.c. This has made the code look cleaner and has reduced some overhead.

version 0.7.0

  • Remove workarounds in the igzip module for the unconsumed_tail and unused_data bugs. igzip._IGzipReader now functions the same as gzip._GzipReader with only a few calls replaced with isal_zlib calls for speed.
  • Correctly implement unused_data and unconsumed_tail on isal_zlib.Decompress objects. It works the same as in CPython's zlib now.
  • Correctly implement flush implementation on isal_zlib.Compress and isal_zlib.Decompress objects. It works the same as in CPython's zlib now.

version 0.6.1

  • Fix a crash that occurs when opening a file that did not end in .gz while outputting to stdout using python -m isal.igzip.

version 0.6.0

  • python -m gzip's behaviour has been changed since fixing bug: bpo-43316. This bug was not present in python -m isal.igzip but it handled the error differently than the solution in CPython. This is now corrected and python -m isal.igzip handles the error the same as the fixed python -m gzip.
  • Installation on Windows is now supported. Wheels are provided for Windows as well.

version 0.5.0

  • Fix a bug where negative integers were not allowed for the adler32 and crc32 functions in isal_zlib.
  • Provided stubs (type-hint files) for isal_zlib and _isal modules. Package is now tested with mypy to ensure correct type information.
  • The command-line interface now reads in blocks of 32K instead of 8K. This improves performance by about 6% when compressing and 11% when decompressing. A hidden -b flag was added to adjust the buffer size for benchmarks.
  • A -c or --stdout flag was added to the CLI interface of isal.igzip. This allows it to behave more like the gzip or pigz command line interfaces.

version 0.4.0

  • Move wheel building to cibuildwheel on github actions CI. Wheels are now provided for Mac OS as well.
  • Make a tiny change in setup.py so python-isal can be build on Mac OS X.

version 0.3.0

  • Set included ISA-L library at version 2.30.0.
  • Python-isal now comes with a source distribution of ISA-L in its source distribution against which python-isal is linked statically upon installation by default. Dynamic linking against system libraries is now optional. Wheels with the statically linked ISA-L are now provided on PyPI.

version 0.2.0

  • Fixed a bug where writing of the gzip header would crash if an older version of Python 3.7 was used such as on Debian or Ubuntu. This is due to differences between point releases because of a backported feature. The code now checks if the backported feature is present.
  • Added Python 3.9 to the testing.
  • Fixed setup.py to list setuptools as a requirement.
  • Changed homepage to reflect move to pycompression organization.

version 0.1.0

  • Publish API documentation on readthedocs.
  • Add API documentation.
  • Ensure the igzip module is fully compatible with the gzip stdlib module.
  • Add compliance tests from CPython to ensure isal_zlib and igzip are validated to the same standards as the zlib and gzip modules.
  • Added a working gzip app using python -m isal.igzip
  • Add test suite that tests all possible settings for functions on the isal_zlib module.
  • Create igzip module which implements all gzip functions and methods.
  • Create isal_zlib module which implements all zlib functions and methods.