Bioinformatics.org
|
|
Research
|
Online databases
Online analysis tools
Online education tools
|
Development
|
![[?]](https://www.bioinformatics.org/images/icons/info.png)
|
Forums
|
News & Commentary
Jobs Forum (Career Center)
|
|
News & Commentary - Message forums
|
|
|
|
Software: Genozip: A universal compressor for genomic files
Submitted by Divon Lan; posted on Tuesday, July 20, 2021
Genozip is a universal compressor for genomic files – it is optimized to compress FASTQ, SAM/BAM/CRAM, VCF/BCF, FASTA, GVF, PHYLIP, Chain, Kraken and 23andMe files, but it can also compress any other file (including non-genomic files).
Typically, a 2X-5X improvement over the existing compression is achieved when compressing already-compressed files like .fastq.gz .bam vcf.gz, and up to 200X for a high-sample-count VCF file.
Yes, Genozip can compress already-compressed files (.gz .bz2 .xz .bam .cram).
The compression is lossless – the decompressed file is 100% identical to the original file.
Details: https://genozip.com. Available on conda (conda-forge channel) and https://github.com/divonlan/genozip
Reference:
Lan, D., et al. (2021) Genozip: a universal extensible genomic data compressor Bioinformatics, btab102, https://doi.org/10.1093/bioinformatics/btab102
|
|
Expanded view | Monitor forum | Save place
Start a new thread:
You have to be to post a reply.
|
|