Skip to content

Rarevariantburden #1636

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
22 changes: 16 additions & 6 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,26 +11,36 @@ This branch contains test data to be used for automated testing with the [nf-cor

`case/samples.txt`: One column text file containing list of samples, one sample ID per line. Here we used 25 test samples from 1000 Genomes Project (build GRCh37) as case data. This is the input of pipeline parameter `caseSample`

`case/21.annotated.vcf.gz`: Pre-annotated case VCF file for chr 21, we used the pre-annotated file to skip the annotation steps in the pipelie for testing purpose. This is the input of pipeline parameter `caseAnnotatedVCFPrefix` and `caseAnnotatedVCFSuffix`
`case/annotationFiles.csv`: List of annotated files for test profile. This is the input of pipeline parameter `caseAnnotatedVCFFileList`

`case/annotationGDSFiles.csv`: List of annotated GDS files for test profile. This is the input of pipeline parameter `caseAnnotationGDSFileList`

`case/genotypeGDSFiles.csv`: List of genotype GDS files for test profile. This is the input of pipeline parameter `caseGenotypeGDSFileList`

`case/21.annotated.vcf.gz`: Pre-annotated case VCF file for chr 21, we used the pre-annotated file to skip the annotation steps in the pipelie for testing purpose.

`case/21.annotated.vcf.gz.tbi`: The tabix index file for the pre-annotated case VCF file for chr 21

`case/21.annotated.vcf.gz.gds`: The GDS format file for the pre-annotated case VCF file for chr 21, we used the GDS file to skip the VCF to GDS conversion steps in the pipelie for testing purpose. This is the input of pipeline parameter `caseAnnotationGDSPrefix` and `caseAnnotationGDSSuffix`
`case/21.annotated.vcf.gz.gds`: The GDS format file for the pre-annotated case VCF file for chr 21, we used the GDS file to skip the VCF to GDS conversion steps in the pipelie for testing purpose.

`case/22.annotated.vcf.gz`: Pre-annotated case VCF file for chr 22, we used the pre-annotated file to skip the annotation steps in the pipelie for testing purpose. This is the input of pipeline parameter `caseAnnotatedVCFPrefix` and `caseAnnotatedVCFSuffix`
`case/22.annotated.vcf.gz`: Pre-annotated case VCF file for chr 22, we used the pre-annotated file to skip the annotation steps in the pipelie for testing purpose.

`case/22.annotated.vcf.gz.tbi`: The tabix index file for the pre-annotated case VCF file for chr 22

`case/22.annotated.vcf.gz.gds`: The GDS format file for the pre-annotated case VCF file for chr 22, we used the GDS file to skip the VCF to GDS conversion steps in the pipelie for testing purpose. This is the input of pipeline parameter `caseAnnotationGDSPrefix` and `caseAnnotationGDSSuffix`
`case/22.annotated.vcf.gz.gds`: The GDS format file for the pre-annotated case VCF file for chr 22, we used the GDS file to skip the VCF to GDS conversion steps in the pipelie for testing purpose.

`case/21.biallelic.leftnorm.ABCheck.vcf.gz.gds`: The GDS format for the left normalized case VCF file for chr 21, we used this to skip the normalization and convert nomalized VCF file to GDS format steps in the pipeline. This is the input of pipeline parameter `caseGenotypeGDSPrefix` and `caseGenotypeGDSSuffix`
`case/21.biallelic.leftnorm.ABCheck.vcf.gz.gds`: The GDS format for the left normalized case VCF file for chr 21, we used this to skip the normalization and convert nomalized VCF file to GDS format steps in the pipeline.

`case/22.biallelic.leftnorm.ABCheck.vcf.gz.gds`: The GDS format for the left normalized case VCF file for chr 22, we used this to skip the normalization and convert nomalized VCF file to GDS format steps in the pipeline. This is the input of pipeline parameter `caseGenotypeGDSPrefix` and `caseGenotypeGDSSuffix`
`case/22.biallelic.leftnorm.ABCheck.vcf.gz.gds`: The GDS format for the left normalized case VCF file for chr 22, we used this to skip the normalization and convert nomalized VCF file to GDS format steps in the pipeline.

`case/casePopulation.txt`: The predicted ancestry for each sample, if not specified pipeline will estimate the ancestry/ethnicity of each sample using gnomAD classifier. This is the input of pipeline parameter `casePopulation`

`control/`: Input files needed for the pipeline from the control dataset (here we used gnomAD v2 exome data as control dataset)

`control/controlAnnotationGDS.csv`: List of pre-annotated and GDS converted VCF file from gnomAD v2 exome.

`control/controlGenotypeGDS.csv`: List of normalized and GDS converted VCF file from gnomAD v2 exome.

`control/annotation/chr21.annovar.vep.vcf.gz.gds`: The pre-annotated and GDS converted VCF file from gnomAD v2 exome. Here we used a small exome region for chr 21 as the control data.

`control/annotation/chr22.annovar.vep.vcf.gz.gds`: The pre-annotated and GDS converted VCF file from gnomAD v2 exome. Here we used a small exome region for chr 22 as the control data.
Expand Down
3 changes: 3 additions & 0 deletions case/annotationFiles.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
chr,vcf,index
21,https://raw.githubusercontent.com/nf-core/test-datasets/rarevariantburden/case/21.annotated.vcf.gz,https://raw.githubusercontent.com/nf-core/test-datasets/rarevariantburden/case/21.annotated.vcf.gz.tbi
22,https://raw.githubusercontent.com/nf-core/test-datasets/rarevariantburden/case/22.annotated.vcf.gz,https://raw.githubusercontent.com/nf-core/test-datasets/rarevariantburden/case/22.annotated.vcf.gz.tbi
3 changes: 3 additions & 0 deletions case/annotationGDSFiles.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
chr,gds
21,https://raw.githubusercontent.com/nf-core/test-datasets/rarevariantburden/case/21.annotated.vcf.gz.gds
22,https://raw.githubusercontent.com/nf-core/test-datasets/rarevariantburden/case/22.annotated.vcf.gz.gds
3 changes: 3 additions & 0 deletions case/genotypeGDSFiles.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
chr,gds
21,https://raw.githubusercontent.com/nf-core/test-datasets/rarevariantburden/case/21.biallelic.leftnorm.ABCheck.vcf.gz.gds
22,https://raw.githubusercontent.com/nf-core/test-datasets/rarevariantburden/case/22.biallelic.leftnorm.ABCheck.vcf.gz.gds
3 changes: 3 additions & 0 deletions control/controlAnnotationGDS.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
chr,gds
21,https://raw.githubusercontent.com/nf-core/test-datasets/rarevariantburden/control/annotation/chr21.annovar.vep.vcf.gz.gds
22,https://raw.githubusercontent.com/nf-core/test-datasets/rarevariantburden/control/annotation/chr22.annovar.vep.vcf.gz.gds
3 changes: 3 additions & 0 deletions control/controlGenotypeGDS.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
chr,gds
21,https://raw.githubusercontent.com/nf-core/test-datasets/rarevariantburden/control/genotypeCount/gnomad.21.vcf.bgz.gds
22,https://raw.githubusercontent.com/nf-core/test-datasets/rarevariantburden/control/genotypeCount/gnomad.22.vcf.bgz.gds