MultiQC | Aggregate results from bioinformatics analyses | Genomics library
kandi X-RAY | MultiQC Summary
kandi X-RAY | MultiQC Summary
Aggregate results from bioinformatics analyses across many samples into a single report.
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
- Parse bcftools stats
- Clean s_name
- Add a section to the report
- Add a data source
- Return a dictionary of custom module classes
- Add a section to the plot
- Write data to a file
- Parse the file header
- Parse qualimap reports
- Return a list of filenames
- Find bamPE fragment size table
- Chart the quality of the contigs
- Plot data
- Parse the qc3 log file
- Add statistics table
- Take the parsed stats from the somalier stats
- Parse samtools stats
- Parse cutadapt log file
- Finds and parses the data
- Take the verifyBAM id and add it to the header
- Find samtools idxstats
- Parse the fastp log file
- Parse bowtie2 output
- Parse the input file as a dictionary
- Return a parsed sample stats table
- Take the parsed stats from the Odg stats table
MultiQC Key Features
MultiQC Examples and Code Snippets
Community Discussions
Trending Discussions on MultiQC
QUESTION
I have a rule in my snakemake pipeline to run multiqc :
...ANSWER
Answered 2022-Apr-11 at 10:56You most likely want to install python as well, since according to docs it's not recommended to use the system-wide python:
QUESTION
This is an extension of a question I asked yesterday. I have looked all over StackOverflow and have not found an instance of this specific NameError:
...ANSWER
Answered 2022-Jan-06 at 06:30I think it's due to you used expand
function in a wrong way, expand
only accepts two positional arguments, where the first one is pattern and the second one is function (optional). If you want to supply multiple patterns you should wrap these patterns in list.
After some studying on source code of snakemake, it turns out expand
function doesn't check if user provides < 3 positional arguments, there is a variable combinator
in if-else
that would only be created when there are 1 or 2 positional arguments, the massive amount of positional arguments you provide skip this part and lead to the error when it tries to use combinator
later.
Source code: https://snakemake.readthedocs.io/en/v6.5.4/_modules/snakemake/io.html
QUESTION
I am trying to do QC on RNAseq data that is tarballed. I am using Snakemake as a workflow manager and am aware that Snakemake does not like one-to-many rules. I defining a checkpoint would fix the problem but when I run the script I get this this error message with rule fastqc.
...ANSWER
Answered 2022-Jan-05 at 06:18First, glob_wildcards(INPUTDIR + "{basenames}_R1.fastq.gz")
returns a Wildcards object that contains the key:value pair for each wildcard. If you want to get the basenames, it should be glob_wildcards(INPUTDIR + "{basenames}_R1.fastq.gz").basenames
Second, I assume all fastq.gz files are generated by decompress_h1n1
checkpoint, since you already include the fastqc output in aggregate_decompress_h1n1
function. You shouldn't include those outputs again in rule all
, it leads to snakemake try to do fastqc before checkpoint got executed.
Third, you should also put your trim_qc
outputs in the aggregate_decompress_h1n1
function, basically it's the same issue as fastqc, probably the same for salmon
related rules too.
There might be a potential issue, I noticed you use wrapper for fastqc, I remember that official fastqc wrapper requires the output must have html
and zip
, while you have a raw
prefix. But I didn't see a 0.80.3
release in official document, not sure if you are using some other repository to access wrapper
QUESTION
My question is very similar to this one.
I am writing a snakemake pipeline, and it does a lot pre- and post-alignment quality control. At the end of the pipeline, I run multiQC on those QC results.
Basically, the workflow is: preprocessing -> fastqc -> alignment -> post-alignment QCs such as picard, qualimap, and preseq -> peak calling -> motif analysis -> multiQC.
MultiQC should generate a report on all those outputs as long as multiQC support them.
One way to force multiqc to run at the very end is to include all the output files from the above rules in the input directive of multiqc rule, as below:
...ANSWER
Answered 2021-Jul-03 at 16:38From your comments I gather that what you really want to do is run a flexibly configured number of QC methods and then summarise them in the end. The summary should only run, once all the QC methods you want to run have completed.
Rather than forcing the MultiQC rule to be executed in the end, manually, you can set up the MultiQC rule in such a way that it automatically gets executed in the end - by requiring the QC method's output as input.
Your goal of flexibly configuring which QC rules to run can be easily achieved by passing the names of the QC rules through a config file, or even easier as a command line argument.
Here is a minimal working example for you to extend:
QUESTION
I'm using Ubuntu to learn basic bioinformatics. I just downloaded multiqc using conda, but when I want to run multiqc, it returns: enter image description here
I am very naive and don't know how to solve this problem. ANY help would be highly appreciated. Thanks
...ANSWER
Answered 2021-Apr-19 at 16:26I think gcd was moved to the math pack in 3.9. See https://docs.python.org/3/library/fractions.html
Changed in version 3.9: The math.gcd() function is now used to normalize the numerator and denominator. math.gcd() always return a int type. Previously, the GCD type depended on numerator and denominator.
I suggest you create a virtual environment with 3.8 and try that. There are tons of tutorials on how to do this.
QUESTION
I am trying to create a pipeline that will take a user-configured directory in config.yml
(where they have downloaded a project directory of .fastq.gz files from BaseSpace), to run fastqc on sequence files. I already have the downstream steps of merging the fastqs by lane and running fastqc on the merged files.
However, the wildcards are giving me problems running fastqc on the original basespace files. The following is my error when I try running snakemake.
...ANSWER
Answered 2021-Jan-29 at 13:18In your rule all
you have:
QUESTION
I am currently writing a Snakefile, which does a lot of post-alignment quality control (CollectInsertSizeMetics, CollectAlignmentSummaryMetrics, CollectGcBiasMetrics
, ...).
At the very end of the Snakefile, I am running multiQC to combine all the metrics in one html report.
I know that if I use the output of rule A as input of rule B, rule B will only be executed after rule A is finished.
The problem in my case is that the input of multiQC is a directory, which exists right from the start. Inside of this directory, multiQC will search for certain files and then create the report.
If I am currently executing my Snakemake file, multiQC will be executed before all quality controls will be performed (e.g. fastqc
takes quite some time), thus these are missing in the final report.
So my question is, if there is an option, that specifies that a certain rule is executed last.
I know that I could use --wait-for-files
to wait for a certain fastqc
report, but that seems very inflexible.
The last rule currently looks like this:
...ANSWER
Answered 2020-Sep-03 at 15:38You could give to the input of multiqc
rule the files produced by the individual QC rules. In this way, multiqc will start once all those files are available. E.g.:
QUESTION
I know this is a common error and I already checked other posts but it didn't resolved my issue. I would like to use the name of the database I use for SortMeRNA
rule (rRNAdb=config["rRNA_database"]
) the same way I use version=config["genome_version"]
. But obviously, I can't.
ANSWER
Answered 2020-Jan-21 at 16:26All output
files need to have same wildcards, or else it would cause conflict in resolving job dependencies. Not all files in output:
have {rRNAdb}
wildcard, which is causing this problem. For example, if you have two {rRNAdb}
values, both would write to file "{OUTDIR}/temp/{sample}.fastq"
, which snakemake correctly doesn't allow.
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install MultiQC
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page