Skip to content

pangeo-data/rechunker

Folders and files

NameName
Last commit message
Last commit date
Mar 1, 2025
Mar 1, 2025
Mar 1, 2025
Mar 1, 2025
Jul 16, 2020
Nov 29, 2021
Mar 31, 2023
Mar 1, 2025
Jul 25, 2020
Nov 27, 2020
Jul 16, 2020
Jul 8, 2022
Mar 1, 2025
Jan 27, 2021
Jul 13, 2020

Repository files navigation

Rechunker

Documentation Status Tests Coverage

Rechunker is a Python package which enables efficient and scalable manipulation of the chunk structure of chunked array formats such as Zarr and TileDB. Rechunker takes an input array (or group of arrays) stored in a persistent storage device (such as a filesystem or a cloud storage bucket) and writes out an array (or group of arrays) with the same data, but different chunking scheme, to a new location.

Rechunker is designed to be used within a parallel execution framework such as Dask.

See the documentation for more.