Leading Edge Genomic Services & Solutions

Whole Genome Sequencing

Service Overview
Novogene Data
Contact Us

Whole Genome Sequencing ServiceHuman whole genome sequencing enables researchers to catalog the genetic constitution of individuals and capture all the variants present in a single assay. It is applied to the study of cancer and a variety of diseases, as well as human population evolution studies and pharmacogenomics.

Equipped with 30 powerful Illumina HiSeq X and 25 NovaSeq 6000 systems, Novogene is capable of sequencing up to 280,000 human genomes per year at the lowest cost per genome. Being one of the first few companies to adopt HiSeq X Ten since early 2014, we have extensive experience providing whole genome sequencing services, having successfully sequenced hundreds of thousands of genomes with high quality results. With world’s largest sequencing capacity, deep experience in whole genome sequencing, and advanced bioinformatics capabilities, Novogene is able to expertly meet customer needs for executing large projects with timely turn-around and the highest quality results.

“We did whole genome sequencing plus advanced bioinformatics analysis using Novogene earlier this year. I am very satisfied with their professional, skillful and high-quality services. In particular, the bioinformatics knowledge and expertise in the project team were very impressive. They provided great support, rapid turn-around, and pricing that enables us to do more science with our limited budget. I really appreciate their willingness and ability to explore more advanced bioinformatics analysis to meet the specific requirements of my projects. With these in mind, I have initiated two additional RNA-Seq projects with them and recommended two collaborators to use Novogene’s sequencing service.”

Wenhui Hu, M.D., Ph.D.
Associate Professor, Department of Neuroscience Temple University School of Medicine, Philadelphia, USA

“I am extremely satisfied with the quality and turn-around of the WGS results Novogene delivered. They have outstanding informatics/analysis, highly responsive and effective support, advanced Illumina technology (such as the X Ten), all at highly competitive prices.”

Justin Loe
CEO, Full Genomes Corporation, Maryland USA

The Novogene Advantage

  • State-of-the-art NGS technologies: Novogene is a world leader in sequencing capacity using state-of-the-art technology, including 30 Illumina HiSeq X and 25 NovaSeq 6000 Systems.
  • Highest data quality: We guarantee a Q30 score ≥ 80%, exceeding Illumina’s official guarantee of ≥75%. See our data example.
  • Extraordinary informatics expertise: Novogene uses its cutting-edge bioinformatics pipeline and internationally recognized best-in-class software to provide customers highly reliable “publication-ready data”.

Project Workflow

Human Whole Genome Sequencing Service Workflow

Sequencing Strategy

  • 350 bp insert DNA library
  • HiSeq X platform or NovaSeq 6000 platform, paired-end 150 bp

Data Quality Guarantee

  • We guarantee that ≥ 80% of bases have a sequencing quality score ≥ Q30, which exceeds Illumina’s official guarantee of ≥ 75%.

Sample Requirements

  • Input DNA:
    • For fresh samples: ≥ 1.0 μg
    • For FFPE samples: ≥ 1.5 μg
  • DNA concentration: ≥ 20 ng/μl
  • DNA volume: ≥ 20 μl
  • Purity: No degradation or RNA contamination; fragments should be longer than 1,500 bp for FFPE samples

Turnaround Time

  • 15 working days after verification of sample quality (without data analysis)
  • Additional 8 working days for data analysis

Recommended Sequencing Depth

  • For tumor tissues: 50×, adjacent normal tissues and blood 30×
  • For rare diseases: 30~50×

Analysis Pipeline

 Human Whole Genome Sequencing Analysis Pipeline

Bioinformatics Analysis includes:

  • Data quality control: filtering out reads containing adapters or with low quality
  • Alignment with reference genome, statistics of sequencing depth and coverage
  • SNP/InDel/SV/CNV calling, annotation and statistics
  • Somatic SNP/InDel/SV/CNV calling, annotation and statistics (paired tumor samples)

Advanced Analysis

Monogenic disorders

1. Variant filtering
2. Analysis under dominant/recessive model (Pedigree information is needed)
2.1 Analysis under dominant model
2.2 Analysis under recessive model
3. Functional annotation of candidate genes
4. Pathway enrichment analysis of candidate genes
5. Linkage analysis
6. Regions of homozygosity (ROH) analysis

Complex/multifactorial disorders

1. Variant filtering
2. Analysis under dominant/recessive model (Pedigree information is needed)
2.1 Analysis under dominant model
2.2 Analysis under recessive model
3. Functional annotation of candidate genes
4. Pathway enrichment analysis of candidate genes
5. De novo mutation analysis (Trio/Quartet)
5.1 De novo SNP/InDel detection
5.2 Calculation of de novo mutation rates
5.3 De novo CNV/SV and De novo SV/CNV detection
6. Protein-protein interaction (PPI) analysis
7. Association analysis of candidate genes (at least 20 trios or case/control pairs)

Cancer (for tumor-normal pair samples)

1. Screening for predisposing genes
2. Mutation spectrum & mutation signature analyses
3. Screening for known driver genes
4. Analyses of tumor significantly mutated genes
5. Analysis of copy number variations (CNV)
5.1. Analysis of CNV distribution
5.2.Analysis of CNV recurrence
6. Fusion gene detection (for WGS porject only)
7. Purity & ploidy analyses of tumor samples
8. Tumor heterogeneity analyses
9. Tumor evolution analysis
10. Display of genomic variants with Circos

Table. Representative data of Novogene's human whole genome sequencing service.

 1  Original sequencing data (in gigabases).
 2  Percentage of clean reads from all raw reads.
 3  Average error rate of all bases in read1 and read2.
 4  Percentage of reads with an average quality greater than Q20.
 5  Percentage of reads with an average quality greater than Q30.
 6  Percentage of G and C bases from total bases.
 7  Percentage of total reads that mapped to the reference genome (UCSC hg19).
 8  Average sequencing depth.
 9  Percentage of genome covered by sequencing.
10  Percentage of bases in genome with a sequencing depth ≥ 4x.
11  Percentage of bases in genome with a sequencing depth ≥ 10x.
12  Percentage of bases in genome with a sequencing depth ≥ 20x.

Project Example

The following studies utilized Novogene's expertise in whole genome sequencing on HiSeq X Ten.

Genetic alterations in esophageal tissues from squamous dysplasia to carcinoma
Gastroenterology 153:166-177 (2017)

Esophageal cancer is a major worldwide health threat, and intraepithelial neoplasia (IEN) is considered to be a precancerous lesion of esophageal squamous cell carcinoma (ESCC), one subtype of esophageal cancer. This study incorporated Novogene’s whole genome sequencing, together with whole exome sequencing and targeted sequencing, to track genetic changes in patients during development of ESCC. The study revealed significant similarities in the types and frequency of mutations between IEN and ESCC, including similarity in the DNA damage mutation signature. Mutations in the CCND1, CDKN2A and FGFR1 genes were also revealed as the early driver events from phylogenetic and clonal analysis. However, the number of non-overlapping SNVs in tissues taken from the same individuals indicated that various lesions formed independently and that there was independent clonal expansion of mutations. As shown in this study, using multiple NGS applications provides novel approaches for exploring early diagnostics and treatments for cancer.

Figure. The mutation variation landscape of ESCC, IEN, and simple hyperplasia (ESSH) from whole genome sequencing and whole exome sequencing

Loss of BRCA1 or BRCA2 markedly increases the rate of base substitution mutagenesis and has distinct effects on genomic deletions
Oncogene, 36: 746-755 (2017)

Genome instability, caused by DNA repair failure and DNA response system damage, is a hallmark of cancer. BRCA1 and BRCA2 play important roles in homologous recombination repair. Here Novogene’s HiSeq X Ten was incorporated on chicken DT40 cell clones whole genome sequencing to compare the consequences of loss of BRCA1/2 on genomic mutagenesis, under normal cell culture or under a MMS treatment that designed to accelerate one class of endogenous mutagenic processes. Loss of BRCA1 or BRCA2 increased seven- to eightfold higher level of spontaneous base mutation rate, and this increased mutation is strongly correlated with a BRCA1/2 mutant signature. Loss of BRCA1 or BRCA2 also induced more insertion/deletion mutations and large rearrangements under the endogenous damage induction condition. The high rate of base substitution mutagenesis demonstrated in this study indicates significant oncogenic effect of the inactivation of BRCA1/2, with distinct roles of BRCA1/2 in the DNA lesions processing.

Figure. Number and spectrum of SNVs

  Whole Genome Sequencing on HiSeq X (Human/ Animal/ Plant)
  Whole Exome Sequencing
  LncRNA Sequencing
  Small RNA Sequencing
  Whole Genome Bisulfite Sequencing
  Animal & Plant Re-Sequencing
  de novo Sequencing
  Pan-genome Sequencing
  Metagenomic Sequencing
  Single-cell DNA Sequencing
  Single-cell RNA Sequencing
  16S/18S/ITS Amplicon
  HiSeq Lane Sequencing
  NovaSeq Flowcell Sequencing
  Others- please specify