Publish operator #1540

szymonwieloch · 2020-03-23T09:42:47Z

New feature

Publish operator

Usage scenario

The current publishDir attribute of processes is not flexible enough. Please consider the following example:

process test {
input:
 file('in.txt') from inputChannel
script:
"""
somecmd in.txt -o out1.txt -p out2.txt -r out3.txt
"""
}

Now, what if I want to publish out1.txt using move mode, out2.txt using copy mode and output (but not pubish) out3.txt? The current approach is not flexible enough and it becomes complex. I think it would be good t replace the publishDir parameter with a publish operator,like this:

output: 
file('out1.txt') into out1
file('out2.txt') into out2
file('out3.txt') into out3

And then:

out1.publish('.', mode:'move')
out2.publish('output', mode: 'copy')

The text was updated successfully, but these errors were encountered:

tamuanand · 2020-04-01T21:32:06Z

I think you can mv and cp commands after your 1st line in script

Something like;

publishDir "${params.outdir}/my_wanted_folder", mode:'copy'

input:
 file('in.txt') from inputChannel

ouptut:
 file(*.txt) 

"""
somecmd in.txt -o out1.txt -p out2.txt -r out3.txt
mv out1.txt my_move_out1,txt
cp out2.txt my_copy_out2.txt
"""

DaGaMs · 2020-04-15T13:57:08Z

I think publish should indeed be an operator. That seems like the conceptually most "correct" way to handle this.

Puumanamana · 2020-08-04T18:58:53Z

I also feel like a publish operator could be useful. For example, with the DSL2 syntax, I could not find an easy way to run twice the same process (within 2 different workflows) and publish it, e.g. something like this:

nextflow.enable.dsl = 2

process fastqc {
    publishDir 'QC'

    input: file(fastq)
    output: file("*.{html,zip}")
    script: "fastqc $fastq"
}

process multiqc {
    publishDir 'QC'

    input: file(fqc_outputs)
    output: file("*.html")
    script: "multiqc ."
}

process fastp {
// snip
}

workflow qc {
    take: reads
    main: reads | fastqc | collect | multiqc
}

workflow trimming {
    take: reads
    main: reads | fastp | qc
}

workflow {
    Channel.fromPath(params.reads) | (qc & trimming)
}

In the above example, all files generated by multiqc will be published in the same folder (for fastqc as well, but they will be named differently at least). Since the multiqc process cannot distinguish between the files that arrived directly from the ones that arrive after trimming, it cannot name the summary files differently. I could use an extra parameter to track the workflow execution, but it makes the code less clean.

EDIT: I just found out the variable task.process keeps track of the workflow being called, so it can be used in my example, something like this:

publishDir "QC/${task.process.replaceAll(':', '-')}"

I could not find it in the documentation though

stale · 2021-01-07T17:58:45Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

DaGaMs referenced this issue Apr 25, 2020

Remove deprecated workflow publish [DSL2]

1bd7b4a

stale bot added the stale label Jan 7, 2021

cjw85 mentioned this issue Jan 25, 2021

How to use publishDir on a workflow output? #1636

Closed

stale bot closed this as completed Mar 9, 2021

bentsherman added the lang/operators label Jul 19, 2022

bentsherman mentioned this issue Mar 6, 2023

Add publish operator #3724

Closed

bentsherman mentioned this issue Mar 22, 2024

Workflow output definitions and schema #4670

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Publish operator #1540

Publish operator #1540

szymonwieloch commented Mar 23, 2020

tamuanand commented Apr 1, 2020

DaGaMs commented Apr 15, 2020

Puumanamana commented Aug 4, 2020 •

edited

Loading

stale bot commented Jan 7, 2021

Publish operator #1540

Publish operator #1540

Comments

szymonwieloch commented Mar 23, 2020

New feature

Usage scenario

tamuanand commented Apr 1, 2020

DaGaMs commented Apr 15, 2020

Puumanamana commented Aug 4, 2020 • edited Loading

stale bot commented Jan 7, 2021

Puumanamana commented Aug 4, 2020 •

edited

Loading