FastQC | quality control analysis tool for high throughput | Genomics library

by s-andrews Java Version: v0.12.1 License: GPL-3.0

X-Ray Key Features Code Snippets Community Discussions(10)Vulnerabilities Install Support

kandi X-RAY | FastQC Summary

FastQC is a Java library typically used in Artificial Intelligence, Genomics applications. FastQC has no bugs, it has no vulnerabilities, it has a Strong Copyleft License and it has low support. However FastQC build file is not available. You can download it from GitHub.

FastQC is a program designed to spot potential problems in high througput sequencing datasets. It runs a set of analyses on one or more raw sequence files in fastq or bam format and produces a report which summarises the results. FastQC will highlight any areas where this library looks unusual and where you should take a closer look. The program is not tied to any specific type of sequencing technique and can be used to look at libraries coming from a large number of different experiment types (Genomic Sequencing, ChIP-Seq, RNA-Seq, BS-Seq etc etc). This project page contains the source code for the application and is only really useful only to people wanting to develop new functionality or trace bugs in FastQC. If you just want to run the program then you want to go to the project web page where you can download the compiled pacakges for Windows, OSX and Linux.

Support

Quality

Security

License

Reuse

Support

FastQC has a low active ecosystem.

It has 307 star(s) with 74 fork(s). There are 11 watchers for this library.

It had no major release in the last 12 months.

There are 10 open issues and 93 have been closed. On average issues are closed in 275 days. There are 4 open pull requests and 0 closed requests.

It has a neutral sentiment in the developer community.

The latest version of FastQC is v0.12.1

Quality

FastQC has 0 bugs and 0 code smells.

Security

FastQC has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

FastQC code analysis shows 0 unresolved vulnerabilities.

There are 0 security hotspots that need review.

License

FastQC is licensed under the GPL-3.0 License. This license is Strong Copyleft.

Strong Copyleft licenses enforce sharing, and you can use them when creating open source projects.

Reuse

FastQC releases are available to install and integrate.

FastQC has no build file. You will be need to create the build yourself to build the component from source.

FastQC saves you 10732 person hours of effort in developing the same functionality from scratch.

It has 21781 lines of code, 1181 functions and 155 files.

It has high code complexity. Code complexity directly impacts maintainability of the code.

Top functions reviewed by kandi - BETA

kandi has reviewed FastQC and discovered the below as its top functions. This is intended to give you an instant insight into FastQC implemented functionality, and help decide if they suit your requirements.

Builds the sine and cosine tables
Compute cosine
Compute the sine
Compute the reciprocal of a c
Start the document
Loads an icon from a path
Draw the background
Gets the RGB value of the RGB color
Computes the hyperbolic tangent of a number
Internal helper method
Returns the root to lie on the specified side
Runs the analysis
Compute an exponential value for a given integer p
Returns the inverse probability of the principal p
Processes the sequence
Compute the product of the polynomial
Compute the hyperbolic cosine of a number
Calculate the tile sequence
Compute the angle between two points
Paint the background
Compute the hyperbolic sine of a number
Read the parameters from the configuration file
Draw the label
Compute the cubic root of a number
Compute the arc cosine of a number
Compute the arc sine of a number

Get all kandi verified functions for this library.

FastQC Key Features

No Key Features are available at this moment for FastQC.

FastQC Examples and Code Snippets

No Code Snippets are available at this moment for FastQC.

Community Discussions

Trending Discussions on FastQC

Snakemake with integrated conda env not properly installed

can't exec " java " no such file or directory at ./fastqc line 307

Weird NameError with python function in Snakemake script

Missing input files after defining them in function

MissingRule Exception in Snakemake

Combine outputs of mutually exclusive processes in a Nextflow (DSL2) pipeline

Force a certain rule to execute at the end

Snakemake --use-conda with --cluster and NFS4 storage

Error in Snakemake 'Unexpected keyword expand in rule definition'

snakemake - Missing input files for rule all

QUESTION

Snakemake with integrated conda env not properly installed

Asked 2022-Apr-11 at 11:47

I have a rule in my snakemake pipeline to run multiqc :

...

ANSWER

Answered 2022-Apr-11 at 10:56

You most likely want to install python as well, since according to docs it's not recommended to use the system-wide python:

Source https://stackoverflow.com/questions/71825092

QUESTION

can't exec " java " no such file or directory at ./fastqc line 307

Asked 2022-Mar-09 at 20:04

When I want to install fastqc file in Ubuntu, I face this error: can't exec " java " no such file or directory at ./fastqc line 307 I tried to install Java with this command " sudo apt install java-common" but it doesn't work I would be happy if you help me with this problem.

Thank you

...

ANSWER

Answered 2022-Mar-09 at 20:04

You need to install JRE or JDK. Check this page.

Source https://stackoverflow.com/questions/71414447

QUESTION

Weird NameError with python function in Snakemake script

Asked 2022-Jan-06 at 06:30

This is an extension of a question I asked yesterday. I have looked all over StackOverflow and have not found an instance of this specific NameError:

...

ANSWER

Answered 2022-Jan-06 at 06:30

I think it's due to you used expand function in a wrong way, expand only accepts two positional arguments, where the first one is pattern and the second one is function (optional). If you want to supply multiple patterns you should wrap these patterns in list.

After some studying on source code of snakemake, it turns out expand function doesn't check if user provides < 3 positional arguments, there is a variable combinator in if-else that would only be created when there are 1 or 2 positional arguments, the massive amount of positional arguments you provide skip this part and lead to the error when it tries to use combinator later.

Source code: https://snakemake.readthedocs.io/en/v6.5.4/_modules/snakemake/io.html

Source https://stackoverflow.com/questions/70600649

QUESTION

Missing input files after defining them in function

Asked 2022-Jan-05 at 06:18

I am trying to do QC on RNAseq data that is tarballed. I am using Snakemake as a workflow manager and am aware that Snakemake does not like one-to-many rules. I defining a checkpoint would fix the problem but when I run the script I get this this error message with rule fastqc.

...

ANSWER

Answered 2022-Jan-05 at 06:18

First, glob_wildcards(INPUTDIR + "{basenames}_R1.fastq.gz") returns a Wildcards object that contains the key:value pair for each wildcard. If you want to get the basenames, it should be glob_wildcards(INPUTDIR + "{basenames}_R1.fastq.gz").basenames

Second, I assume all fastq.gz files are generated by decompress_h1n1 checkpoint, since you already include the fastqc output in aggregate_decompress_h1n1 function. You shouldn't include those outputs again in rule all, it leads to snakemake try to do fastqc before checkpoint got executed.

Third, you should also put your trim_qc outputs in the aggregate_decompress_h1n1 function, basically it's the same issue as fastqc, probably the same for salmon related rules too.

There might be a potential issue, I noticed you use wrapper for fastqc, I remember that official fastqc wrapper requires the output must have html and zip, while you have a raw prefix. But I didn't see a 0.80.3 release in official document, not sure if you are using some other repository to access wrapper

Source https://stackoverflow.com/questions/70587426

QUESTION

MissingRule Exception in Snakemake

Asked 2021-Oct-06 at 01:17

I am new to snakemake workflow management and I'm struggling to grasp how the wildcards input works. I tried to do QC of some SRR data but the snakemake is giving the "MissingRuleException error".

my config file(config.yaml) contain the content:

samples: sample.csv

path: /Users/path/Bioinformatics/srr_practice

sample.csv is

...

ANSWER

Answered 2021-Sep-27 at 12:16

By issues command snakemake -np Snakefile you are asking snakemake to produce Snakefile, and it doesn't know how to do it.

If your file is named Snakefile, there is no need to specify its name, if it has a different name then you can specify it using -s option. So right now, running snakemake -n should be sufficient to show you what snakemake would run.

Source https://stackoverflow.com/questions/69346283

QUESTION

Combine outputs of mutually exclusive processes in a Nextflow (DSL2) pipeline

Asked 2021-Aug-09 at 12:48

I have a DSL2 workflow in Nextflow set up like this:

...

ANSWER

Answered 2021-Aug-09 at 12:48

I was able to figure this out, with a lot of trial and error.

Assigning a variable to a process output acts like the .out property of said process. So I set the same variable for the two exclusive processes, set the same outputs (as seen in the question) and then accessed them directly without using .out:

Source https://stackoverflow.com/questions/68710124

QUESTION

Force a certain rule to execute at the end

Asked 2021-Jul-13 at 19:12

My question is very similar to this one.

I am writing a snakemake pipeline, and it does a lot pre- and post-alignment quality control. At the end of the pipeline, I run multiQC on those QC results.

Basically, the workflow is: preprocessing -> fastqc -> alignment -> post-alignment QCs such as picard, qualimap, and preseq -> peak calling -> motif analysis -> multiQC.

MultiQC should generate a report on all those outputs as long as multiQC support them.

One way to force multiqc to run at the very end is to include all the output files from the above rules in the input directive of multiqc rule, as below:

...

ANSWER

Answered 2021-Jul-03 at 16:38

From your comments I gather that what you really want to do is run a flexibly configured number of QC methods and then summarise them in the end. The summary should only run, once all the QC methods you want to run have completed.

Rather than forcing the MultiQC rule to be executed in the end, manually, you can set up the MultiQC rule in such a way that it automatically gets executed in the end - by requiring the QC method's output as input.

Your goal of flexibly configuring which QC rules to run can be easily achieved by passing the names of the QC rules through a config file, or even easier as a command line argument.

Here is a minimal working example for you to extend:

Source https://stackoverflow.com/questions/68118582

QUESTION

Snakemake --use-conda with --cluster and NFS4 storage

Asked 2021-Apr-01 at 11:12

I am using snakemake in cluster mode to submit a simple one rule workflow to the HPCC, which runs Torque with several compute nodes. The NFSv4 storage is mounted on /data. There is a link /PROJECT_DIR -> /data/PROJECT_DIR/

I submit the job using:

...

ANSWER

Answered 2021-Apr-01 at 11:12

Solved by providing a jobscript (--jobscript SCRIPT) with:

Source https://stackoverflow.com/questions/66868935

QUESTION

Error in Snakemake 'Unexpected keyword expand in rule definition'

Asked 2021-Feb-09 at 05:19

As in the title, my Snakefile is giving me a SyntaxError for the expand function in the all rule. I am aware that this is typically caused by whitespace/indentation errors HOWEVER I have confirmed that there are no tabs in the file. I've gone through an deleted every whitespace as well as searched the file with grep. I appreciate any advice.
Error Message:

SyntaxError in line 14 of /PATH/to/Snakefile:
Unexpected keyword expand in rule definition (Snakefile, line 14)

Code:

...

ANSWER

Answered 2021-Feb-09 at 05:19

This is an error from python as the rule all has two functions separated by a comma. In this case the second expand call is causing the error. You could replace the , with a + to resolve the error like given below.

Source https://stackoverflow.com/questions/66112918

QUESTION

snakemake - Missing input files for rule all

Asked 2021-Feb-03 at 19:16

I am trying to create a pipeline that will take a user-configured directory in config.yml (where they have downloaded a project directory of .fastq.gz files from BaseSpace), to run fastqc on sequence files. I already have the downstream steps of merging the fastqs by lane and running fastqc on the merged files.

However, the wildcards are giving me problems running fastqc on the original basespace files. The following is my error when I try running snakemake.

...

ANSWER

Answered 2021-Jan-29 at 13:18

In your rule all you have:

Source https://stackoverflow.com/questions/65948047

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

No vulnerabilities reported

Install FastQC

You can download it from GitHub.
You can use FastQC like any standard Java library. Please include the the jar files in your classpath. You can also use any IDE and you can run and debug the FastQC component as you would do with any other Java program. Best practice is to use a build tool that supports dependency management such as Maven or Gradle. For Maven installation, please refer maven.apache.org. For Gradle installation, please refer gradle.org .

Support

For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .

Find more information at: