Skip to content

Fixes to Consensus Calling and Updated Depth Calculation #5

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 42 commits into
base: master
Choose a base branch
from

Conversation

zacherydickson
Copy link
Collaborator

The following changes have been tested on two analysis runs.

Fixed a bug where called variants which are incompatible (eg. overlapping indel and snp) would lead to truncated consensus sequences.
The Bug was fixed by introducing a step where such incompatible variants are filtered out.

As samtools mileup is now no longer supported, consensus calling has been updated to use bcftools mpileup.

Depth of coverage calculation has been updated to use samtools bedcov, this allows ignoring excessively covered regions and automatically ignores secondary and supplementary reads which artificially inflate the depth.

Made a few changes to streamline and generalize code:

  • The output files for variant calls do not included sorted/filtered/etc so that the intermediate steps may be changed without affecting the output name.
  • Use of variable so file names needn't be changed in multiple places

Zachery Dickson added 30 commits September 19, 2024 00:23
- Added script fo filter incompataible variants from VCF file
- Modified consensus calling scripts to use this filtering
Fixed Bug where TRACESPipe tried to save incorrectly named VCF
Restructured VCF naming during consensus calling
Also modified handling of sensitivity to reduce duplication of code
Added git ignore for swp files
These changes moved to a branch

This reverts commit 8d962be.
Changed VDB.fa in code to a variable for future maintainability
all branches of TRACESPipe (will use updated depth calcs)
    virus REFTYPE swapped with viral
    depth calculation no longer inverted
    This effectively converts all depth and breadth calculations to the
    non-supp/sec+maxDepth method
Fixed typos/language in some comments
Zachery Dickson added 12 commits June 12, 2025 16:47
Bring up typo fixed made in parent
Internal Viral Names System file created to track default supported viruses
Viral Names in src dir now contains the set of viruses used in the analysis
Removed all hardcoded viral lists
Removed blank line from internal viral names which leads to a fatal
 check for empty virus filtering scripts
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant