#blosc2

Blosc Development TeamBlosc2@fosstodon.org
2025-11-29

SSDs, but specially RAM prices are skyrocketing: arstechnica.com/gadgets/2025/1

These are excellent times for using compression to reduce your storage pressure.

blosc.org/posts/roofline-analy

#Blosc2 #Compression #HPC

2025-11-20

Interact with your vasts remote datasets right in your phone! πŸ“±

I've built a demo Jupyter notebook that connects to a Cat2Cloud server from an Android phone and slices into an 8 TB dataset, downloading a 1 MB chunk in under 100 milliseconds. ⚑

The 8 TB dataset is from the Gaia DR3 catalogue. As it turns out, there are ~1000 stars in a cube of 100 light-years in our vicinity; the space is mostly empty.πŸ”­ 🌌

Try this out by visiting: cat2.cloud/demo/roots/@public/

#Cat2Cloud #Blosc2 #BigData

Blosc Development TeamBlosc2@fosstodon.org
2025-11-07

Our @EuroSciPy 2025 tutorial on modern Blosc2 features is now online! πŸš€

For the first time, we present our holistic view on how compression can revolutionize data handling, sharing, and computing.

Learn how Blosc2 boosts performance for large datasets, how to serve data online with Caterva2, and compute directly in the cloud.

Watch now! πŸ‘‰ youtube.com/watch?v=BdpTtzX2cuk

#Blosc2 #EuroSciPy #Python #DataScience #BigData #HPC #OpenSource

luke.shaw@ironarray.ioluke_shaw_ironarray
2025-10-28

πŸ’ŠIronPill 2πŸ’Š
In the second of our series of short videos ("ironPills") showcasing ironArray's work, we see how Blosc2 can be used to power heavy-duty linear algebra (100GB!) workflows
⚑1.5-2x faster than PyTorch + h5py!
🧱 automated chunking optimised for your machine's cache hierarchy
🐍 simple one-line syntax πš‹πš•πš˜πšœπšŒπŸΈ.πš–πšŠπšπš–πšžπš•(𝙰, 𝙱, πšžπš›πš•πš™πšŠπšπš‘='𝚘𝚞𝚝.πš‹πŸΈπš—πš')

See blog here: ironarray.io/blog/la-blosc




luke.shaw@ironarray.ioluke_shaw_ironarray
2025-10-20

πŸ’Š IronPill 1πŸ’Š
In the first of a series of short videos ("ironPills") showcasing ironArray's work, we see how Blosc2 can be used to calculate Fourier approximations:
⚑5x faster than NumPy
πŸ‘£ fraction of the memory footprint
🐍 pythonic one-line syntax πšœπšžπš–(πšŠβ€‚*β€‚πšŒπš˜πšœ(𝚝) +β€‚πš‹β€‚*β€‚πšœπš’πš—(𝚝),β€‚πšŠπš‘πš’πšœ=𝟷)

See full notebook here: github.com/Blosc/python-blosc2

(inspired by this blog post: towardsdatascience.com/numexpr)



luke.shaw@ironarray.ioluke_shaw_ironarray
2025-09-12

πŸ—£οΈ Announcing Python-Blosc2 3.8.0 πŸš€

A step closer to compliance with the array-api standard: data-apis.org/array-api!
This is an effort across all array-based libraries so that your code works (e.g. for both blosc2 and NumPy) by simply changing the import statement below!

Highlights:
βœ… C-Blosc2 updated to latest 2.21.2
βœ… Incorporate isnan, isfinite, isinf
βœ… Better indexing coverage
βœ… linspace and arange functions more numerically stable
βœ… Improved array-api compliance

ironArray SLUironArray
2025-09-08

Struggling to get performant code from LLMs? πŸ€” They can't do the empirical, target-specific optimization needed for modern CPUs.

We can help! πŸ’ͺ We've spent countless hours profiling and micro-benchmarking Blosc2 for you. Use our C/Python implementations as building blocks for your high-performance apps.

Ready to dive in? πŸš€
πŸ”Ή EuroSciPy Talk: blosc.org/docs/2025-EuroSciPy-
πŸ”Ή Tutorial: github.com/Blosc/EuroSciPy2025

Blosc Development TeamBlosc2@fosstodon.org
2025-09-01

πŸ“’ Great to see the community building powerful tools on Blosc2! πŸš€

Check out compress-image: a new C++/Python library for working with compressed images directly in memory.

It allows you to keep lots of images in RAM while minimizing I/O and memory footprint.

Kudos to Emil Dohne for this fantastic work!

Project here: github.com/EmilDohne/compresse

#Blosc2 #FOSS #ImageProcessing #DataCompression #Cpp #Python

Blosc Development TeamBlosc2@fosstodon.org
2025-08-20

πŸš€ We are thrilled to announce *TreeStore*, a new class in Python-Blosc2! It lets you give your datasets a hierarchical structure while keeping the speed and efficiency of `NDArray` instances. ⚑️

πŸ“ We've blogged all about it here:
blosc.org/posts/new-treestore-

It's in beta, and you can start using it now in the latest Python-Blosc2 v3.7.2. Enjoy!

#Python #Blosc2 #TreeStore #DataScience #OpenSource

Diagram showing several arrays structured on a hierarchy via blosc2.TreeStore.  Also, some benchmarks showing that it is fast and efficient.
ironArray SLUironArray
2025-08-07

πŸš€ Caterva2 2025.8.7 is out!

This release features a major refactoring for a simpler, more robust system. Best of all, client APIs are unaffected. βœ…

🌟 Highlights:

The new cat2agent πŸ€–: A CLI client to watch a directory and auto-sync it with a Caterva2 server.
New stack and concat commands in the web UI for easier data manipulation.
Full release notes: github.com/ironArray/Caterva2/

Learn more about Caterva2: ironarray.io/caterva2

Blosc Development TeamBlosc2@fosstodon.org
2025-07-18

πŸ—£οΈ Announcing Python-Blosc2 3.6.1

!Unlock new levels of data manipulation with Blosc2! πŸš€

We've introduced a major improvement: powerful fancy indexing and orthogonal indexing for Blosc2 arrays.

We've tamed the complexity of fancy indexing to make it intuitive, efficient, and consistent with NumPy's behavior. πŸ’ͺ

Read all about it on our blog! πŸ“ blosc.org/posts/blosc2-fancy-i

Compress Better, Compute Bigger!

#Blosc2 #Python #DataScience #BigData #NumPy #Performance #HPC

Chart comparing performance of fancy indexing of NumPy, Blosc2, Zarr and HDF5.  Blosc2 performs quite well overal.
ironArray SLUironArray
2025-07-09

Thanks to the advanced double partitioning techniques in , our package can serve small slices of big datasets (3.8 GB) through internet in less than the blink of an eye.

See how you can do that with the help of in using two different techniques:

1) Plain Python-Blosc2 library for quick and dirty access
2) Caterva2 Python client for a more heavy-duty and flexible operation

Try it out! πŸ‘‰ cat2.cloud/demo/roots/@public/

Downloading a slice of a large dataset in the blink of an eye
2025-07-08

#Blosc2 now runs directly in your browser! Leveraging the power of #WASM, #Pyodide, and #JupyterLite, you can harness efficient, adaptable compression through the web's universal interface. Experience the future of large-scale data processing without leaving your browser window.

Compress Better, Compute Bigger, Share Faster

#WebAssembly #DataCompression #WebDevelopment #DataScience

Blosc2 can run in the browser!
2025-06-23

πŸ“’ We are pleased to announce the integration of a new stack feature in #Blosc2 πŸš€, which allows for stacking large arrays along a new axis.

Performance benchmarks show that while aligned chunks yield the best results, #Blosc2 with unaligned chunks can still outperform #NumPyβ€”a welcome discovery! πŸŽ‰

Many thanks to Luke Shaw for his excellent work on this new functionality. πŸ™

We've updated our recent blog post:
Check it out! πŸ”— blosc.org/posts/blosc2-new-con

#Python #DataScience #Performance #OpenSource

Blosc Development TeamBlosc2@fosstodon.org
2025-06-17

πŸ“’ Blosc2 just launched a super-efficient array concatenation feature! πŸŽ‰

Combine massive arrays quickly and with minimal memory. If your array chunks are aligned, it's even faster – no need to decompress first! Perfect for big data tasks.

Check out our blog post: blosc.org/posts/blosc2-new-con

Compress Better, Compute Bigger

#DataScience #Blosc2 #DataStorage #Performance #MachineLearning

Performance of blosc2.concatenate() compared with NumPy
Blosc Development TeamBlosc2@fosstodon.org
2025-06-13

πŸš€ C-Blosc2 2.18.0 is out now!

✨ What's new:

* Introducing b2nd_concatenate() - now you can easily join b2nd arrays together!

* Fixed mmap files to flush modified pages only in write mode (thanks Jan Sellner!)

Get the full details: github.com/Blosc/c-blosc2/blob

#Blosc2 #DataCompression #OpenSource #CLibrary #Performance

2025-06-06

πŸš€ Excited to share more about Caterva2, your ultimate gateway to Blosc2/HDF5 repositories! πŸš€

Caterva2 is designed to redefine how you interact with large datasets.

Want to see it in action? πŸ€” We've just released a new introductory video showcasing Caterva2's main functionalities! 🎬

πŸ‘‰ ironarray.io/caterva2

#Caterva2 #Blosc2 #HDF5 #BigData #DataManagement #FreeSoftware #Python #DataScience #Tech

Screenshot the web graphical user interface of Caterva2
Blosc Development TeamBlosc2@fosstodon.org
2025-06-05

#Python-Blosc2 is hitting 1 million weekly downloads on PyPI! πŸŽ‰ pypacktrends.com/?packages=blo

Users are rapidly adopting #Blosc2, which now accounts for over 95% of downloads compared to Blosc1. πŸ“ˆ This success is thanks to our amazing users and community contributors. πŸ™ We're dedicated to making Python-Blosc2 even better. πŸš€

Our motto: Compress Better, Compute Bigger! πŸ’ͺ

#Milestone #CommunitySupport #DataCompression

Graphic showing how Blosc2 is becoming rapidly adopted by community
2025-05-26

Now it's @FrancescAlted to introduce the #Blosc2 #compression algorithm to reduce #HDF5 file size.

Blosc Development TeamBlosc2@fosstodon.org
2025-05-20

πŸ’‘ Did you know you can supercharge your #HDF5 datasets with #Blosc2? πŸš€

Leverage hdf5plugin (hdf5plugin.readthedocs.io) to integrate Blosc2 as a filter within HDF5. Create, write, and read data using popular Python wrappers like h5py or PyTables, while achieving excellent performance! πŸ’¨

More speed?

* h5py users: b2h5py offers optimized reads for n-dim slices.
* PyTables users: Optimized support is already built-in.

Learn more: blosc.org/posts/pytables-b2nd-

Compress Better, Compute Bigger :-)

Throughput of slice reads in HDF5 with the Blosc2 plugin

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst