• [Photo] Divon Lan July 20, 2021
    Genozip is a universal compressor for genomic files – it is optimized to compress FASTQ, SAM/BAM/CRAM, VCF/BCF, FASTA, GVF, PHYLIP, Chain, Kraken and 23andMe files, but it can also compress any other file (including non-genomic files).

    Typically, a 2X-5X improvement over the existing compression is achieved when compressing already-compressed files like .fastq.gz .bam vcf.gz, and up to 200X for a high-sample-count VCF file.

    Yes, Genozip can compress already-compressed files (.gz .bz2 .xz .bam .cram).

    The compression is lossless – the decompressed file is 100% identical to the original file.

    Details: https://genozip.com. Available on conda (conda-forge channel) and https://github.com/divonlan/genozip

    Reference:
    Lan, D., et al. (2021) Genozip: a universal extensible genomic data compressor Bioinformatics, btab102, https://doi.org/10.1093/bioinformatics/btab102

Discussion forums: Software: Genozip: A universal compressor for genomic files

Expanded view | Monitor forum | Save place

Start a new thread:

You have to be logged in to post a reply.

© 1998-2025 Scilico, LLC. All rights reserved.