Skip to content

Commit 28f53bb

Browse files
authored
Merge pull request #124 from assemblerflow/dev
1.2.2 version release
2 parents d58caf5 + ec17346 commit 28f53bb

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

59 files changed

+2107
-209
lines changed

.gitignore

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
# ignore ide folders
2+
.idea/
3+
.vscode/
4+
5+
# ignore python generated files
6+
*.pyc

changelog.md

Lines changed: 48 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,54 @@
22

33
## Upcoming release (`dev` branch)
44

5+
### Components changes
6+
7+
- `mapping_patlas`: refactored to remove temporary files used to create
8+
sam and bam files and added data to .report.json. Updated databases to pATLAS
9+
version 1.5.2.
10+
- `mash_screen` and `mash_dist`: added data to .report.json. Updated databases
11+
to pATLAS version 1.5.2.
12+
13+
### New components
14+
15+
- Added component `fasterq_dump`
16+
- Added component `mash_sketch_fasta`
17+
- Added component `mash_sketch_fastq`
18+
- Added component `downsample_fastq` for FastQ read sub sampling using seqtk
19+
- Added component `momps` for typing of Legionella pneumophila
20+
- Added component `split_assembly`
21+
- Added component `mafft`
22+
- Added component `raxml`
23+
- Added component `viral_assembly`
24+
- Added component `progressive_mauve`
25+
- Added component `dengue_typing`
26+
27+
### Minor/Other changes
28+
29+
- Added check for `params.accessions` that enables to report a proper
30+
error when it is set to `null`.
31+
- Added `build` option to export component parameters information in JSON format.
32+
- Fixed minor issue preventing the `maxbin2` and `split_assembly` components
33+
from being used multiples times in a pipeline
34+
- Added a catch to the `filter_poly` process for cases where the input file is empty.
35+
- spades template now reports the exit code of spades' execution
36+
37+
### Bug fixes
38+
39+
- Removed the need for the nf process templates to have an empty line
40+
at the beginning of the template files.
41+
- Fixed issue when the `inspect` mode was executed on a pipeline directory
42+
with failed processes but with the work directory removed (the log files
43+
where no longer available).
44+
- Fixed issue when the `inspect` mode was executed on a pipeline without the
45+
memory directory defined.
46+
- Fixed issue in the `inspect` mode, where there is a rare race condition between
47+
tags in the log and trace files.
48+
- Fixed bug on `midas_species` process where the output file was not being
49+
linked correctly, causing the process to fail
50+
- Fixed bug on `bowtie` where the reference parameter was missing the pid
51+
- Fixed bug on `filter_poly` where the tag was missing
52+
553
## 1.2.1
654

755
### Improvements

docs/user/available_components.rst

Lines changed: 17 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -14,6 +14,9 @@ Download
1414
- :doc:`components/reads_download`: Downloads reads from the SRA/ENA public
1515
databases from a list of accessions.
1616

17+
- :doc:`components/fasterq_dump`: Downloads reads from the SRA public databases
18+
from a list of accessions, using ``fasterq-dump``.
19+
1720
Reads Quality Control
1821
--------------------
1922

@@ -35,6 +38,9 @@ Reads Quality Control
3538

3639
- :doc:`components/trimmomatic`: Runs Trimmomatic on paired-end FastQ files.
3740

41+
- :doc:`components/sample_fastq`: Subsamples fastq files up to a target coverage
42+
depth.
43+
3844

3945
Assembly
4046
--------
@@ -86,14 +92,18 @@ Distance Estimation
8692
- :doc:`components/mash_dist`: Executes mash distance against a reference index
8793
plasmid database and generates a `JSON` for pATLAS. This component calculates
8894
pairwise distances between sequences (one from the database and the query
89-
sequence). However if a
90-
different database is provided it can use mash dist for other purposes.
95+
sequence). However if a different database is provided it can use mash dist
96+
for other purposes.
9197

9298
- :doc:`components/mash_screen`: Performs mash screen against a reference index
9399
plasmid database and generates a JSON input file for pATLAS. This component
94100
searches for containment of a given sequence in read sequencing data.
95-
However if a different
96-
database is provided it can use mash screen for other purporses.
101+
However if a different database is provided it can use mash screen for other
102+
purporses.
103+
104+
- :doc:`components/mash_sketch_fasta`: Performs mash sketch for fasta files.
105+
106+
- :doc:`components/mash_sketch_fastq`: Performes mash sketch for fastq files.
97107

98108
Mapping
99109
-------
@@ -142,3 +152,6 @@ Typing
142152

143153
- :doc:`components/sistr`: Serovar predictions from whole-genome sequence assemblies
144154
by determination of antigen gene and cgMLST gene alleles.
155+
156+
- :doc:`components/momps`: Multi-locus sequence typing for Legionella pneumophila
157+
from assemblies and reads.

docs/user/components/fasterq_dump.rst

Lines changed: 50 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,50 @@
1+
fasterq_dump
2+
============
3+
4+
Purpose
5+
-------
6+
7+
This component downloads reads from the SRA public databases from a
8+
list of accessions. This component uses ``fasterq-dump`` from
9+
`NCBI sra-tools <https://github.com/ncbi/sra-tools>`_. ``fasterq-dump``
10+
increases the download speed in comparison from ``fastq-dump`` by
11+
**multi-threading** the extraction of FASTQ from SRA-accessions.
12+
The reads for each accession are then emitted through
13+
the main output of this component to any other component (or components) that
14+
receive FastQ data.
15+
16+
Input/Output type
17+
------------------
18+
19+
- Input type: ``accessions``
20+
- Output type: ``fastq``
21+
22+
.. note::
23+
The default input parameter for Accessions data is ``--accessions``.
24+
25+
Parameters
26+
----------
27+
28+
- ``option_file``: This options enables the *option-file* parameter of
29+
``fasterq-dump``, allowing parameters to be passed.
30+
- ``compress_fastq``: This options allows users to disable the compression of
31+
the fastq files resulting from this component. The default (``true``) behavior
32+
compresses the fastq files to *fastq.gz*.
33+
34+
Published results
35+
-----------------
36+
37+
- ``reads/<accession>``: Stores the reads for each provided accession.
38+
39+
Published reports
40+
-----------------
41+
42+
None.
43+
44+
Default directives
45+
------------------
46+
47+
- ``cpus``: 1
48+
- ``memory``: 1GB
49+
- ``container``: flowcraft/sra-tools
50+
- ``version``: 2.9.1-1

docs/user/components/mapping_patlas.rst

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -17,8 +17,8 @@ can be imported into `pATLAS <http://www.patlas.site/>`_.
1717
Input/Output type
1818
------------------
1919

20-
- Input type: ``FastQ``
21-
- Output type: ``JSON``
20+
- Input type: ``fastq``
21+
- Output type: ``json``
2222

2323

2424
Parameters
@@ -59,11 +59,11 @@ Default directives
5959
------------------
6060

6161
- ``mappingBowtie``:
62-
- ``container``: tiagofilipe12/patlasflow_mapping
63-
- ``version``: 1.1.2
62+
- ``container``: flowcraft/mapping-patlas
63+
- ``version``: 1.4.1
6464
- ``samtoolsView``:
65-
- ``container``: tiagofilipe12/patlasflow_mapping
66-
- ``version``: 1.1.2
65+
- ``container``: flowcraft/mapping-patlas
66+
- ``version``: 1.4.1
6767
- ``jsonDumpingMapping``:
68-
- ``container``: tiagofilipe12/patlasflow_mapping
69-
- ``version``: 1.1.2
68+
- ``container``: flowcraft/mapping-patlas
69+
- ``version``: 1.4.1

docs/user/components/mash_dist.rst

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -21,8 +21,8 @@ database than the default that is intended for plasmid searches.
2121
Input/Output type
2222
------------------
2323

24-
- Input type: ``Fasta``
25-
- Output type: ``JSON``
24+
- Input type: ``fasta``
25+
- Output type: ``json``
2626

2727

2828
Parameters
@@ -64,8 +64,8 @@ Default directives
6464
------------------
6565

6666
- ``runMashDist``:
67-
- ``container``: tiagofilipe12/patlasflow_mash_screen
68-
- ``version``: 1.1
67+
- ``container``: flowcraft/mash-patlas
68+
- ``version``: 1.4.1
6969
- ``mashDistOutputJson``:
70-
- ``container``: tiagofilipe12/patlasflow_mash_screen
71-
- ``version``: 1.1
70+
- ``container``: flowcraft/mash-patlas
71+
- ``version``: 1.4.1

docs/user/components/mash_screen.rst

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -21,8 +21,8 @@ database than the default that is intended for plasmid searches.
2121
Input/Output type
2222
------------------
2323

24-
- Input type: ``FastQ``
25-
- Output type: ``JSON``
24+
- Input type: ``fastq``
25+
- Output type: ``json``
2626

2727

2828
Parameters
@@ -59,8 +59,8 @@ Default directives
5959
------------------
6060

6161
- ``mashScreen``:
62-
- ``container``: tiagofilipe12/patlasflow_mash_screen
63-
- ``version``: 1.1
62+
- ``container``: flowcraft/mash-patlas
63+
- ``version``: 1.4.1
6464
- ``mashOutputJson``:
65-
- ``container``: tiagofilipe12/patlasflow_mash_screen
66-
- ``version``: 1.1
65+
- ``container``: flowcraft/mash-patlas
66+
- ``version``: 1.4.1
Lines changed: 47 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,47 @@
1+
mash_sketch_fasta
2+
=================
3+
4+
Purpose
5+
-------
6+
7+
This component performs mash sketch for fasta input files.
8+
9+
.. note::
10+
- MASH documentation can be found `here <https://mash.readthedocs.io/en/latest/>`_.
11+
12+
13+
Input/Output type
14+
------------------
15+
16+
- Input type: ``fasta``
17+
- Output type: ``msh``
18+
19+
20+
Parameters
21+
----------
22+
23+
- ``kmerSize``: Parameter to set the kmer size for hashing. Default: 21.
24+
Default: false.
25+
26+
- ``sketchSize``: Parameter to set the number of hashes per sketch.
27+
Default: 1000.
28+
29+
30+
Published results
31+
-----------------
32+
33+
None.
34+
35+
36+
Published reports
37+
-----------------
38+
39+
None.
40+
41+
42+
Default directives
43+
------------------
44+
45+
- ``mashSketchFasta``:
46+
- ``container``: flowcraft/mash-patlas
47+
- ``version``: 1.4.1
Lines changed: 55 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,55 @@
1+
mash_sketch_fastq
2+
=================
3+
4+
Purpose
5+
-------
6+
7+
This component performs mash sketch for fastq input files.
8+
9+
.. note::
10+
- MASH documentation can be found `here <https://mash.readthedocs.io/en/latest/>`_.
11+
12+
13+
Input/Output type
14+
------------------
15+
16+
- Input type: ``fastq``
17+
- Output type: ``msh``
18+
19+
20+
Parameters
21+
----------
22+
23+
- ``kmerSize``: Parameter to set the kmer size for hashing. Default: 21.
24+
Default: false.
25+
26+
- ``sketchSize``: Parameter to set the number of hashes per sketch.
27+
Default: 1000.
28+
29+
- ``minKmer``: Minimum copies of each k-mer required to pass noise filter for
30+
reads. Default: 1.
31+
32+
- ``genomeSize``: Genome size (raw bases or with K/M/G/T). If specified, will
33+
be used for p-value calculation instead of an estimated size from k-mer
34+
content. Default: *false*, meaning that it won't be used. If you want to use
35+
it pass a number to this parameter.
36+
37+
38+
Published results
39+
-----------------
40+
41+
None.
42+
43+
44+
Published reports
45+
-----------------
46+
47+
None.
48+
49+
50+
Default directives
51+
------------------
52+
53+
- ``mashSketchFastq``:
54+
- ``container``: flowcraft/mash-patlas
55+
- ``version``: 1.4.1

docs/user/components/momps.rst

Lines changed: 53 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,53 @@
1+
momps
2+
========
3+
4+
Purpose
5+
-------
6+
7+
This component performs Multi-Locus Sequence Typing (MLST) on Legionella pneumophila
8+
from reads and assemblies.
9+
10+
.. note::
11+
Software page: https://github.com/bioinfo-core-BGU/mompS
12+
13+
Input/Output type
14+
------------------
15+
16+
- Input type: ``fasta``
17+
- Output type: None
18+
19+
.. note::
20+
The default input parameter for fasta data is ``--fasta``. This process
21+
also requires FastQ reads provided via the ``--fastq`` parameter.
22+
23+
Parameters
24+
----------
25+
26+
None.
27+
28+
Published results
29+
-----------------
30+
31+
- ``results/typing/momps``: Stores TSV files with the ST and allelic profiles
32+
for each strain.
33+
34+
Published reports
35+
-----------------
36+
37+
None.
38+
39+
Default directives
40+
------------------
41+
42+
- ``momps``:
43+
- ``container``: flowcraft/momps
44+
- ``version``: 0.1.0-4
45+
46+
Advanced
47+
--------
48+
49+
Reports JSON
50+
^^^^^^^^^^^^
51+
52+
``typing``:
53+
- ``momps``: <typing result>

0 commit comments

Comments
 (0)