Bioinformatics.org
Not logged in
  • Log in
  • Bioinformatics.org
    Membership (42689+) Group hosting [?] Wiki
    Franklin Award
    Sponsorships

    Careers
    About bioinformatics
    Bioinformatics training
    Bioinformatics jobs

    Research
    All information groups
    Online databases Online analysis tools Online education tools More tools

    Development
    All software groups
    FTP repository
    SVN & CVS repositories [?]
    Mailing lists

    Forums
    News & Commentary
  • Submit
  • Archives
  • Subscribe

  • Jobs Forum
    (Career Center)
  • Submit
  • Archives
  • Subscribe
  • News & Commentary - Message forums

    Software: Genozip: A universal compressor for genomic files
    Submitted by Divon Lan; posted on Tuesday, July 20, 2021

    Genozip is a universal compressor for genomic files – it is optimized to compress FASTQ, SAM/BAM/CRAM, VCF/BCF, FASTA, GVF, PHYLIP, Chain, Kraken and 23andMe files, but it can also compress any other file (including non-genomic files).

    Typically, a 2X-5X improvement over the existing compression is achieved when compressing already-compressed files like .fastq.gz .bam vcf.gz, and up to 200X for a high-sample-count VCF file.

    Yes, Genozip can compress already-compressed files (.gz .bz2 .xz .bam .cram).

    The compression is lossless – the decompressed file is 100% identical to the original file.

    Details: genozip.com. Available on conda (conda-forge channel) and github.com/divonlan/genozip

    Reference:
    Lan, D., et al. (2021) Genozip: a universal extensible genomic data compressor Bioinformatics, btab102, doi.org/10.1[...]ab102

    Expanded view | Monitor forum | Save place

    Start a new thread:
    You have to be logged in to post a reply.

     

    Copyright © 2021 Scilico, LLC · Privacy Policy