Simplify Genomic Data Management
Raw genomic datasets are huge, and scientists and bioinformaticians have long sought ways to reduce the size of the datasets they work with by using a combination of data compression and reduction techniques.
Raw genomic datasets are huge, and scientists and bioinformaticians have long sought ways to reduce the size of the datasets they work with by using a combination of data compression and reduction techniques. When genomic sequencing was in its infancy, raw sequencer output, around 2TB, was often stored for extended periods while bioinformaticians carried out the complex tasks in assembling and aligning of the sequencing data. With these steps complete, the data could be used in variant calling and interpretation, which are vital steps in understanding gene expression and disease.
Today this process is highly automated and has been greatly accelerated through a combination of parallel processing and the availability of reference genomes. Work that previously took months or years can now be turned around in little more than a day, and a number of compressed genomic file formats are available that reduce the size of an individually stored genome down to a few tens of gigabytes. This reduction has greatly improved the bioinformatician's ability to work with and transfer data to clinicians quickly and efficiently.
Features and Benefits
The capabilities that set PetaGene PetaSuite with NetApp Data Fabric apart.
Key Benefits
Increase collaborative efficiency
Use less storage capacity and lower costs
Leverage the flexibility of the cloud
Maintain interoperability with existing workflows and formats
Solution Capabilities
The Advent of Precision or Personalised Medicine
Beyond Storage Efficiency
ONTAP Cloud and OnCommand Cloud Manager
To simplify the management experience, NetApp also offers OnCommand® Cloud Manager software, a centralised management environment for ONTAP Cloud software that fully supports hybrid storage environments.
Improve Analysis Speed
Storage Tiering
Thrive with expert-led storage guidance
Get tailored advice on how PetaGene PetaSuite with NetApp Data Fabric fits your environment — from sizing and deployment to long-term optimization.
Technical Specifications
Exhaustive hardware and software metrics extracted directly from official documentation.
-
PetaGene PetaSuite Typical Full Genome File Size16GiB
-
FASTQ.GZ and BAM Formats File Size65GiB to 85GiB
-
Raw Sequencer OutputAround 2TB
-
Lossless Cost Reduction vs BAM or FASTQ.GZUp to 6:1
-
Data Reduction vs Raw FASTQ.GZ96%
-
File ServicesNFS, CIFS
-
Block ServicesiSCSI
-
Data ReplicationSnapMirror®
-
Storage Tiering IntegrationHybrid FAS Solution and NetApp FabricPool
-
ManagementOnCommand® Cloud Manager
-
File Access SystemPetaView command line
-
DecompressionOn-the-fly random-access client-side
Compare PetaGene PetaSuite with NetApp Data Fabric Series
Select the right scale for your workload demands.
Request a custom quote
Build a configuration with a Genomic Data Management specialist.
Request a quoteLearn more
Explore resources
Datasheets, whitepapers, case studies, and technical documentation.
Explore resources