-
Genozip is a universal compressor for genomic files – it is optimized to compress FASTQ, SAM/BAM/CRAM, VCF/BCF, FASTA, GVF, PHYLIP, Chain, Kraken and 23andMe files, but it can also compress any other file (including non-genomic files).
Typically, a 2X-5X improvement over the existing compression is achieved when compressing already-compressed files like .fastq.gz .bam vcf.gz, and up to 200X for a high-sample-count VCF file.
Yes, Genozip can compress already-compressed files (.gz .bz2 .xz .bam .cram).
The compression is lossless – the decompressed file is 100% identical to the original file.
Details: https://genozip.com. Available on conda (conda-forge channel) and https://github.com/divonlan/genozip
Reference:
Lan, D., et al. (2021) Genozip: a universal extensible genomic data compressor Bioinformatics, btab102, https://doi.org/10.1093/bioinformatics/btab102
Discussion forums: Software: Genozip: A universal compressor for genomic files
Expanded view | Monitor forum | Save place
Start a new thread:
You have to be logged in to post a reply.