Comparative Insights: Fuzzy Clustering Versus Archetypal Analysis in Vector Quantisation for Blosc2

ABSTRACT Blosc2 is a high‐performance compression library and data format designed for binary data such as numerical arrays, tensors, and other structured types. In this proposal, we aim to develop two new codecs for Blosc2 by leveraging its extensible plugin‐based codec framework. Specifically, we propose integrating archetypal analysis (AA) and fuzzy clustering as novel codecs within Blosc2. Our proposal has been assessed by using the Olivetti faces dataset and results have demonstrated that A