서비스

식물 및 동물 전장 유전체 시퀀싱 (PAWGS)

동물 및 식물 전장유전체 시퀀싱 (PAWGS) 은 동물과 식물의 전체 유전체를 분석하는 핵심 기술로, SNP, InDel, CNV, SV 등의 유전체 변이를 식별하는 데 활용됩니다. 전장유전체 시퀀싱은 단일 염기 수준에서 유전 정보를 완전히 해독할 수 있는 최적의 방법입니다.

Novogene는 초고속 분석 속도, 고품질 시퀀싱 데이터, 신뢰할 수 있는 결과를 제공하는 동물 및 식물 전장유전체 시퀀싱 서비스를 제공합니다. 해당 기술은 집단 유전학 연구, 전장유전체 연관 분석 (GWAS), 농업 육종 프로그램 등 다양한 연구 분야에서 폭넓게 활용되고 있습니다.

응용 분야 (Applications)

노보진 서비스의 장점 및 특징

빠르고 신뢰할 수 있는 프로젝트 수행 : 엄격한 SOP(Standard Operating Procedure) 준수를 통해 효율적이고 신속한 시퀀싱 결과를 보장합니다.
고품질 데이터 제공: Q30 ≥ 85% 이상의 탁월한 시퀀싱 정확도로 신뢰할 수 있는 연구 데이터를 제공합니다.
맞춤형 및 고급 솔루션 제공: 집단 연구 및 전장 유전체 연관 분석(GWAS)에 최적화된 PAWGS 시퀀싱 전략을 제공합니다.
다양한 생물 종 시퀀싱 경험 보유: 돼지, 생쥐, 호랑이, 벌 등 폭넓은 식물 및 동물 유전체 시퀀싱 경험을 갖추고 있습니다.

사양: DNA 샘플 요구사항

Platform Type	Sample Type	Amount (Qubit®)	Purity
Illumina NovaSeq System	Genomic DNA	≥ 300 ng	A260/280=1.8-2.0; no degradation, no contamination
Illumina NovaSeq System	Genomic DNA (PCR free library)	≥ 1.2 μg	A260/280=1.8-2.0; no degradation, no contamination
PacBio Revio System	HMW Genomic DNA (HiFi SMRTbell® library)	≥ 5 μg	A260/280=1.8-2.0; A260/230=1.5-2.6; *NC/QC=0.95-3.00 Fragments should be ≥ 30 kb
Nanopore PromethION	HMW Genomic DNA	≥ 8 μg	A260/280=1.8-2.0; A260/230=1.5-2.6; Fragments should be ≥ 30 kb

*NC/QC: NanoDrop concentration/Qubit concentration

사양: 시퀀싱 및 분석

Platform Type	Illumina NovaSeq System	PacBio Revio System	Nanopore PromethION
Read Length	Paired-end 150 bp	Average > 15 kb	Average > 17 kb
Recommended Sequencing Depth	For SNP/InDel detection: ≥ 10×	For SV detection: ≥ 20×
	For SV/CNV detection: ≥ 20×
Content of Analysis	Standard Analysis	Standard Analysis
	• Data quality control • Sequencing error rate • Filtering reads containing adapter or with low quality • Alignment with reference genome • Statistics of mapping, sequencing depth and coverage • SNP calling, annotation and statistics	• Data quality control • Sequencing alignment • Structural Variant (SV) detection
	Advanced Analysis
	• SV calling, annotation and statistics • CNV calling, annotation and statistics

프로젝트 진행 과정

노보진은 프로젝트의 모든 과정에서 최고 수준의 품질과 전문적인 서비스를 제공하여 연구자들이 신뢰할 수 있는 결과를 얻을 수 있도록 지원합니다. 모든 단계는 엄격한 과학적 기준에 따라 설계되고 수행되며, 시퀀싱 데이터의 정확성과 신뢰성을 보장하기 위해 철저한 품질 관리(QC) 절차가 적용됩니다.
전체 워크플로우는 샘플 준비 및 정량화, DNA 절단 및 라이브러리 제작, 라이브러리 품질 검증, 시퀀싱, 생물정보 분석 등의 주요 단계로 이루어져 있습니다.

[1] https://www.genome.gov/about-genomics/fact-sheets/Sequencing-Human-Genome-cost

식물 및 동물 전장 유전체 시퀀싱 (PAWGS) 주요 논문

Resequencing of 1,143 indica rice accessions reveals important genetic variations and different heterosis patterns

Journal: Nature Communications Date: 22 September, 2020IF: 12.121 DOI:https://www.nature.com/articles/s41467-020-18608-0

Reference information

Lv Q, Li W, Sun Z, Ouyang N, Jing X, He Q, Wu J, Zheng J, Zheng J, Tang S, Zhu R, Tian Y, Duan M, Tan Y, Yu D, Sheng X, Sun X, Jia G, Gao H, Zeng Q, Li Y, Tang L, Xu Q, Zhao B, Huang Z, Lu H, Li N, Zhao J, Zhu L, Li D, Yuan L, Yuan D. Resequencing of 1,143 indica rice accessions reveals important genetic variations and different heterosis patterns. Nat Commun. 2020 Sep 22;11(1):4778. doi: 10.1038/s41467-020-18608-0. PMID: 32963241; PMCID: PMC7508829.

Genomic analyses of diverse wild and cultivated accessions provide insights into the evolutionary history of jujube

Journal:Plant Biotechnology JournalIssue date: 18 September, 2020IF: 8.154 DOI: https://onlinelibrary.wiley.com/doi/10.1111/pbi.13480

Reference information

Guo M, Zhang Z, Li S, Lian Q, Fu P, He Y, Qiao J, Xu K, Liu L, Wu M, Du Z, Li S, Wang J, Shao P, Yu Q, Xu G, Li D, Wang Y, Tian S, Zhao J, Feng X, Li R, Jiang W, Zhao X. Genomic analyses of diverse wild and cultivated accessions provide insights into the evolutionary history of jujube. Plant Biotechnol J. 2021 Mar;19(3):517-531. doi: 10.1111/pbi.13480. Epub 2020 Sep 30. PMID: 32946650; PMCID: PMC7955879.

Resequencing of 145 Landmark Cultivars Reveals Asymmetric Sub-genome Selection and Strong Founder Genotype Effects on Wheat Breeding in China

Journal: Molecular Plant Date: 5 September, 2020IF:12.084 DOI: https://www.cell.com/molecular-plant/fulltext/S1674-2052(20)30296-3?_returnURL=https%3A%2F%2Flinkinghub.elsevier.com%2Fretrieve%2Fpii%2FS1674205220302963%3Fshowall%3Dtrue

Reference information

Hao C, Jiao C, Hou J, Li T, Liu H, Wang Y, Zheng J, Liu H, Bi Z, Xu F, Zhao J, Ma L, Wang Y, Majeed U, Liu X, Appels R, Maccaferri M, Tuberosa R, Lu H, Zhang X. Resequencing of 145 Landmark Cultivars Reveals Asymmetric Sub-genome Selection and Strong Founder Genotype Effects on Wheat Breeding in China. Mol Plant. 2020 Dec 7;13(12):1733-1751. doi: 10.1016/j.molp.2020.09.001. Epub 2020 Sep 5. PMID: 32896642.

Resequencing core collection of upland cotton (Gossypium hirsutum) identifies genomic variation and loci influencing quality and yield of fiber

Journal:Nature GeneticsIssue Date: 2018IF: 27.959 DOI: https://www.nature.com/articles/s41588-018-0119-7

Reference Information

Ma Z, He S, Wang X, Sun J, Zhang Y, Zhang G, Wu L, Li Z, Liu Z, Sun G, Yan Y, Jia Y, Yang J, Pan Z, Gu Q, Li X, Sun Z, Dai P, Liu Z, Gong W, Wu J, Wang M, Liu H, Feng K, Ke H, Wang J, Lan H, Wang G, Peng J, Wang N, Wang L, Pang B, Peng Z, Li R, Tian S, Du X. Resequencing a core collection of upland cotton identifies genomic variation and loci influencing fiber quality and yield. Nat Genet. 2018 Jun;50(6):803-813. doi: 10.1038/s41588-018-0119-7. Epub 2018 May 7. PMID: 29736016.

Genome re-sequencing reveals the evolutionary history of peach fruit edibility

Journal: Nature CommunicationsIssue Date: 2018IF:12.353 DOI:https://www.nature.com/articles/s41467-018-07744-3

Reference Information

Yu Y, Fu J, Xu Y, Zhang J, Ren F, Zhao H, Tian S, Guo W, Tu X, Zhao J, Jiang D, Zhao J, Wu W, Wang G, Ma R, Jiang Q, Wei J, Xie H. Genome re-sequencing reveals the evolutionary history of peach fruit edibility. Nat Commun. 2018 Dec 20;9(1):5404. doi: 10.1038/s41467-018-07744-3. PMID: 30573726; PMCID: PMC6302090.

Sequencing Quality Distribution

시퀀싱 품질 분포는 전체 서열 길이에 걸쳐 분석되며, 오류 염기가 비정상적으로 높은 빈도로 포함되는 영역을 검출하기 위해 평가됩니다. 자세한 시퀀싱 품질 분포는 다음과 같습니다.

Figure 1. Distribution of sequencing quality

참고: x축은 시퀀싱 리드 내 염기 위치를 나타내며, y축은 각 위치에서 모든 리드의 평균 Phred 점수를 나타냅니다.

(페어엔드 시퀀싱 데이터는 함께 플로팅되며, 첫 번째 PE150 bp는 Read 1을, 이어지는 PE150 bp는 Read 2를 나타냅니다.)

Sequencing Error Rate

시퀀싱 오류율은 딥 시퀀싱을 통해 저빈도 변이를 정확하게 검출하는 데 있어 주요 혼란 요인입니다. 이는 시퀀싱 데이터의 품질을 결정하는 중요한 요소입니다. 시퀀싱 오류율은 시퀀싱 사이클과 밀접한 관련이 있으며, 화학 시약의 소모로 인해 각 리드의 말미로 갈수록 증가하는 경향이 있습니다. 이러한 현상은 Illumina 고처리량 시퀀싱 플랫폼의 일반적인 특징입니다.

Figure 2. Distribution of sequencing error

참고: X축은 각 시퀀싱 리드의 염기 위치를 나타내며, Y축은 염기 오류율을 나타냅니다.
(페어엔드 시퀀싱 데이터는 함께 표시되며, 첫 번째 PE150 bp는 Read 1을 나타내고, 이후 PE150 bp는 Read 2를 나

Filtering reads containing adapter or with low quality

시퀀싱 원본 리드는 종종 저품질 리드 또는 어댑터 서열을 포함하고 있어 후속 분석의 품질에 영향을 미칠 수 있습니다. 이를 방지하기 위해 원본 리드를 필터링하여 클린 리드를 확보하는 과정이 필요합니다. 원본 리드 필터링은 다음 조건에 따라 수행해야 합니다:

(1) Remove the paired reads when either read contains adapter contamination;
(2) Remove the paired reads when uncertain nucleotides (N) constitute more than 10 percent of either read;
(3) Remove the paired reads when low quality nucleotides (base quality less than 5, Q ≤ 5) constitute more than 50 percent of either read.

Figure 3. Classification of the sequenced reads (1) Adapter related: (reads containing adapter) / (total raw reads). (2) Containing N: (reads with more than 10% N) / (total raw reads). (3) Low quality: (reads of low quality) / (total raw reads). (4) Clean reads: (clean reads) / (total raw reads).

Alignment with reference genome

Mapping Statistics with Reference Genome

Figure 4. The mean depth of each chromosome.

참고: x축은 chromosome을 나타내며, y축은 mean depth를 나타냅니다.

Standard Analysis

SNP Detection & Annotation

ANNOVAR는 최신 정보를 활용하여 유전체에서 발견된 유전 변이를 효율적으로 주석(annotation)하는 소프트웨어입니다. 참조 뉴클레오타이드(reference nucleotide), 시작 위치(start position), 종료 위치(end position), 관찰된 뉴클레오타이드(observed nucleotides), 그리고 chromosome이 포함된 변이 목록을 제공하면, ANNOVAR는 유전자 기반(gene-based) 주석, 영역 기반(region-based) 주석, 필터 기반(filter-based) 주석을 비롯한 다양한 기능을 수행할 수 있습니다.

Figure 5: Distribution and Annotation of SNPs in the Genome

참고: 이 그래프들은 SNP의 품질 분포(SNP quality distribution), SNP 지원 리드 수 분포(distribution of SNP support reads number), 인접한 SNP 간 거리 분포(distribution of distances between adjacent SNPs), 그리고 SNP 품질의 누적 분포(cumulative distribution of SNP quality) 를 나타냅니다.

SNP Quality Distribution

검출된 SNP의 신뢰도를 평가하기 위해, 우리는 **지원 리드 수(support reads number), SNP 품질(SNP quality), 인접한 SNP 간 거리(distance between adjacent SNPs)**의 분포를 확인하였습니다. 결과는 다음과 같습니다.

Figure 6: Cumulative distribution of SNP quality.

참고: 이 그래프들은 SNP의 품질 분포 (SNP quality distribution), SNP 지원 리드 수 분포(distribution of SNP support reads number), 인접한 SNP 간 거리 분포 (distribution of distances between adjacent SNPs), 그리고 SNP 품질의 누적 분포 (cumulative distribution of SNP quality) 를 나타냅니다.

SNP Mutation Frequency

T:A>C:G 변이를 예로 들면, 이 범주에는 T에서 C로의 변이(T>C)와 A에서 G로의 변이 (A>G) 가 포함됩니다. 이중 가닥 (DNA double-strand) 중 어느 한 가닥에서 T>C 변이가 발생하면, 동일한 위치의 반대 가닥에서는 A>G 변이가 관찰됩니다. 따라서 T>C 변이와 A>G 변이는 동일한 범주로 분류됩니다. 이와 같은 원리에 따라, 전장유전체 (whole-genome) SNP 변이는 총 6가지 범주로 분류될 수 있습니다.

Figure 7. SNP Mutation type distibution

Figure 8: Frequency of SNP mutations

참고: x축은 SNP의 개수를 나타내며, y축은 돌연변이 유형 (mutation types)을 나타냅니다.

CNV Detection & Annotation

Copy-number variation (CNV)**는 **구조적 변이(structural variation)**의 한 유형으로, 특정 DNA 조각(fragment)이 **참조 유전체(reference genome)**와 비교하여 가변적인 복제 수(copy number)로 존재할 때 발생합니다. CNV는 유전체 내 **결실(deletions)과 중복(duplications)**을 식별하는 데 중요한 역할을 합니다.

참조 유전체의 **리드 깊이(reads depth)**를 기반으로, CNVnator는 -call 100 파라미터를 사용하여 잠재적인 결실 및 중복 변이를 검출하는 데 활용될 수 있습니다. 검출된 CNV는 이후 ANNOVAR를 이용하여 추가로 주석(annotation)됩니다.

Figure 10. CNV annotation

Figure 11: Ann Variation type statistics distribution of CNVs

참고: x축은 **샘플(samples)**을 나타내며, y축은 각 **CNV 유형의 비율(proportion of each type of CNVs)**을 나타냅니다.

동물 및 식물 De Novo
시퀀싱

추가 서비스

아이소폼 시퀀싱 (Iso-Seq)

ChIP-Seq

전장유전체 바이설파이트
시퀀싱(WGBS)

아래에서 지역별 정보를 확인하려면 위치를 선택하세요.

Americas (North America, South America, and the Caribbean)

Asia-Pacific (Including Hong Kong SAR, Macau SAR, and Taiwan)

Europe

China Mainland

Middle-East & Africa

Japan

서비스

식물 및 동물 전장 유전체 시퀀싱 (PAWGS)

응용 분야 (Applications)

노보진 서비스의 장점 및 특징

사양: DNA 샘플 요구사항

사양: 시퀀싱 및 분석

프로젝트 진행 과정

식물 및 동물 전장 유전체 시퀀싱 (PAWGS) 주요 논문

Resequencing of 1,143 indica rice accessions reveals important genetic variations and different heterosis patterns

Genomic analyses of diverse wild and cultivated accessions provide insights into the evolutionary history of jujube

Resequencing of 145 Landmark Cultivars Reveals Asymmetric Sub-genome Selection and Strong Founder Genotype Effects on Wheat Breeding in China

Resequencing core collection of upland cotton (Gossypium hirsutum) identifies genomic variation and loci influencing quality and yield of fiber

Genome re-sequencing reveals the evolutionary history of peach fruit edibility

Sequencing Quality Distribution

Sequencing Error Rate

Filtering reads containing adapter or with low quality

Alignment with reference genome

Mapping Statistics with Reference Genome

Standard Analysis

SNP Detection & Annotation

SNP Quality Distribution

SNP Mutation Frequency

CNV Detection & Annotation

추가 서비스

Like this:

귀하의 다음 프로젝트에 대해 소개해주세요.

최신 소식 구독

Connect with us

* Mandatory fields.

아래에서 지역별 정보를 확인하려면 위치를 선택하세요.

서비스

식물 및 동물 전장 유전체 시퀀싱 (PAWGS)

응용 분야 (Applications)

노보진 서비스의 장점 및 특징

사양: DNA 샘플 요구사항

사양: 시퀀싱 및 분석

프로젝트 진행 과정

식물 및 동물 전장 유전체 시퀀싱 (PAWGS) 주요 논문

Sequencing Quality Distribution

Sequencing Error Rate

Filtering reads containing adapter or with low quality

Alignment with reference genome

Standard Analysis

추가 서비스

Share this:

Like this:

DOWNLOAD NOW!

* Mandatory fields.