bowtie2 | A fast and sensitive gapped read aligner | Genomics library

by BenLangmead C++ Version: v2.5.1 License: GPL-3.0

X-Ray Key Features Code Snippets Community Discussions(10)Vulnerabilities Install Support

kandi X-RAY | bowtie2 Summary

bowtie2 is a C++ library typically used in Artificial Intelligence, Genomics applications. bowtie2 has no bugs, it has no vulnerabilities, it has a Strong Copyleft License and it has low support. You can download it from GitHub.

Bowtie 2 is an ultrafast and memory-efficient tool for aligning sequencing reads to long reference sequences. It is particularly good at aligning reads of about 50 up to 100s or 1,000s of characters, and particularly good at aligning to relatively long (e.g. mammalian) genomes. Bowtie 2 indexes the genome with an FM Index to keep its memory footprint small: for the human genome, its memory footprint is typically around 3.2 GB. Bowtie 2 supports gapped, local, and paired-end alignment modes.

Support

Quality

Security

License

Reuse

Support

bowtie2 has a low active ecosystem.

It has 528 star(s) with 154 fork(s). There are 29 watchers for this library.

It had no major release in the last 12 months.

There are 137 open issues and 241 have been closed. On average issues are closed in 277 days. There are 7 open pull requests and 0 closed requests.

It has a neutral sentiment in the developer community.

The latest version of bowtie2 is v2.5.1

Quality

bowtie2 has no bugs reported.

Security

bowtie2 has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

License

bowtie2 is licensed under the GPL-3.0 License. This license is Strong Copyleft.

Strong Copyleft licenses enforce sharing, and you can use them when creating open source projects.

Reuse

bowtie2 releases are available to install and integrate.

Installation instructions, examples and code snippets are available.

Top functions reviewed by kandi - BETA

kandi's functional review helps you automatically verify the functionalities of the libraries and avoid rework.
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of bowtie2

Get all kandi verified functions for this library.

bowtie2 Key Features

No Key Features are available at this moment for bowtie2.

bowtie2 Examples and Code Snippets

No Code Snippets are available at this moment for bowtie2.

Community Discussions

Trending Discussions on bowtie2

Bowtie2 is expecting a Documents/Python folder. why?

applying restriction of files for a loop (for loop) using bowtie2

How to use Snakemake container for htslib (bgzip + tabix)

How can i eliminate duplicated sequences in fasta file

subprocess.check_call command called not using threads

Snakemake Wildcards SyntaxError

bash script - loop through files and add to different variables

snakemake log files for failed jobs on slurm don't exist

How do I write a for-loop so a program reiterates itself for a set of 94 DNA samples?

Is srun needed in a sbatch script for a single node/multicore job?

QUESTION

Bowtie2 is expecting a Documents/Python folder. why?

Asked 2021-Jun-02 at 16:08

I have recently downloaded Bowtie2-2.3.4 via the precompiled binaries, and added the file location to my path. I have also installed strawberry perl, and added that location to my path as well. When I try to determine whether bowtie2 is correctly installed by typing bowtie2, bowtie2 -- version, bowtie1 - inspect, get the message: "Can't open perl script "C:\Users\lberd\Documents\Python": No such file or directory". If I create an empty directory called Python in my documents folder, it give me "Can't open perl script "C:\Users\lberd\Documents\Python": Permission denied". I have python 3.9.5 installed, and the location is in my path as well.

What is bowtie2 looking for in the python folder?

...

ANSWER

Answered 2021-Jun-02 at 16:08

Found the problem: the bowtie binaries were saved in a folder called "python scripts" bowtie did not know how to handle the space in the folder name, which is why it stopped at "python".

Source https://stackoverflow.com/questions/67744165

QUESTION

applying restriction of files for a loop (for loop) using bowtie2

Asked 2020-Nov-06 at 22:04

Hello I just want to apply a loop to a set of files, but instead of doing it to all my files I want to make the loop just only to certain files in a directory

Here is the command that I use, is a bowtie2 based alignment of genomic sequences:

...

ANSWER

Answered 2020-Nov-06 at 22:04

Create 2 files, each with 1 basename per line: (1) your inputs, here read 1 fastq base file names, and (2) your existing outputs, here bam base file names. Sort the files and use comm -23 file1 file2 > file3 to select only the basenames that have not been mapped yet. Then loop over those, saved in file3.

Quick and dirty solution (assuming the filenames do not have whitespace):

Source https://stackoverflow.com/questions/64638763

QUESTION

How to use Snakemake container for htslib (bgzip + tabix)

Asked 2020-Sep-24 at 21:01

I have a pipeline which uses a global singularity image and rule-based conda wrappers.

However, some of the tools don't have wrappers (i.e. htslib's bgzip and tabix).

Now I need to learn how to run jobs in containers.

In the official documentation link it says:

"Allowed image urls entail everything supported by singularity (e.g., shub:// and docker://)."

Now I've tried the following image from singularity hub but I get an error:

minimal reproducible example: config.yaml ...

ANSWER

Answered 2020-Sep-24 at 17:38

Using another container solves the issue; however, the fact I'm getting errors from biocontainers is troubling given that these are both very common and used as examples in the literature so I will award the top-answer to whomever can solve that specific issue.

As it were, the use of stackleader/bgzip-utility solve the issue of actually running this rule in a container.

Source https://stackoverflow.com/questions/64050974

QUESTION

How can i eliminate duplicated sequences in fasta file

Asked 2020-Apr-23 at 21:56

I'm trying to build database bacteria genre using all the sequences published to calculate the coverage of my reads against this database using bowtie2 for mapping, for that, I merge all the genomes sequences I downloaded from ncbi in one fasta_library ( i merge 74 files in on fasta file ), the problem is that in this fasta file (the library I created ) I have a lot of duplicated sequences, and that affected the coverage in a big way, so I'm asking if there's any way to eliminate duplication I have in my Library_File, or if there's any way to merge the sequences without having the duplication, or also if there's any other way to calculate the coverage of my reads against reference sequences

I hope I'm clear enough, please tell me if there's anything not clear.

...

ANSWER

Answered 2020-Apr-23 at 21:56

If you have control over your setup, then you could install seqkit and run the following on your FASTA file:

Source https://stackoverflow.com/questions/61374573

QUESTION

subprocess.check_call command called not using threads

Asked 2020-Feb-23 at 01:45

I'm running the following command using subprocess.check_call

['/home/user/anaconda3/envs/hum2/bin/bowtie2-build', '-f', '/media/user/extra/tmp/subhm/sub_humann2_temp/sub_custom_chocophlan_database.ffn', '/media/user/extra/tmp/subhm/sub_humann2_temp/sub_bowtie2_index', ' --threads 8']

But for some reason, it ignores the --threads argument and runs on one thread only. I've checked outside of python with the same command that the threads are launched. This only happens when calling from subprocess, any idea on how to fix this?

thanks

...

ANSWER

Answered 2020-Feb-23 at 01:45

You are passing '--threads 8' and not '--threads', '8'. Although it could be '--threads=8' but I don't know the command.

Source https://stackoverflow.com/questions/60358475

QUESTION

Snakemake Wildcards SyntaxError

Asked 2020-Jan-21 at 16:26

I know this is a common error and I already checked other posts but it didn't resolved my issue. I would like to use the name of the database I use for SortMeRNA rule (rRNAdb=config["rRNA_database"]) the same way I use version=config["genome_version"]. But obviously, I can't.

...

ANSWER

Answered 2020-Jan-21 at 16:26

All output files need to have same wildcards, or else it would cause conflict in resolving job dependencies. Not all files in output: have {rRNAdb} wildcard, which is causing this problem. For example, if you have two {rRNAdb} values, both would write to file "{OUTDIR}/temp/{sample}.fastq", which snakemake correctly doesn't allow.

Source https://stackoverflow.com/questions/59824038

QUESTION

bash script - loop through files and add to different variables

Asked 2020-Jan-16 at 20:31

I have files with names like:

...

ANSWER

Answered 2020-Jan-16 at 20:31

If you loop over all the R1 and R2 files, you'll run bowtie for all possible pairs of data files. If I understand correctly, that's not what you want - you only want to process the corresponding pairs.

To do that, loop over R1 files only, and try to find the corresponding R2 file for each:

Source https://stackoverflow.com/questions/59777276

QUESTION

snakemake log files for failed jobs on slurm don't exist

Asked 2019-Oct-29 at 09:25

I am running a snakemake pipeline on a slurm HPC. Occasionally, jobs will fail due to exceeded wall time or memory. Such failed jobs do not create log files, or their log files are deleted as part of snakemakes automatic removal of files associated with failed jobs. It would be convenient to get the logging information for failed jobs so that I could more easily understand why the job failed.

I currently have logs params sat for each job, and the cluster.json file then calls those logs for each job specifically. A general rule, it's cluster.json call and my snakemake call are shown below.

...

ANSWER

Answered 2019-Oct-29 at 09:25

You need to direct the output of the command to the log

Source https://stackoverflow.com/questions/58597811

QUESTION

How do I write a for-loop so a program reiterates itself for a set of 94 DNA samples?

Asked 2019-Aug-07 at 20:02

I have written some code in a bash shell (so I can submit it to my university's supercomputer) to edit out contaminant sequences from a batch of DNA extracts I have. Essentially what this code does is take the sequences from the negative extraction blank I did (A1-BLANK) and subtract it from all of the other samples.

I have figured out how to get this to work with individual samples, but I'm attempting to write a for loop so that the little chunks of code will reiterate themselves for each sample, with the outcome of this file being a .sam file with a unique name for each sample where both the forward and reverse reads for the sample are merged and edited for contamination.I have checked stack overflow extensively for help with this specific problem, but haven't been able to apply related answered questions to my code.

Here's an example of part what I'm trying to do for an individual sample, named F10-61C-3-V4_S78_L001_R1_001.fastq:

...

ANSWER

Answered 2019-Aug-07 at 19:33

Here's a first try. Note I assume the entire fragment between do and done is one command, and therefore needs continuation markers (\).

Also note in my example "$file" occurs twice. I feel a bit uneasy about this, but you seem to explicity need this in your described example.

And finally note I am giving the sam file just a numeric name, because I don't really know what you would like that name to be.

I hope this provides enough information to get you started.

Source https://stackoverflow.com/questions/57400609

QUESTION

Is srun needed in a sbatch script for a single node/multicore job?

Asked 2018-Dec-03 at 08:31

I am running a very common bioinformatic tool/command bowtie2-build. It can use multi-threads on a single node (not a MPI type job). I have the following sbatch script (basically):

...

ANSWER

Answered 2018-Dec-03 at 08:31

For same-node computations (multithread, etc.) srun is not mandatory but using it offers better control and better feedback from Slurm.

If your program is started with srun, it will be easier for Slurm to manage it (send UNIX signals, kill it if it uses more resource than requested, etc.), and the sstat command will be able to provide you with near-real time memory usage, CPU efficiency, etc.

Source https://stackoverflow.com/questions/53564681

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

No vulnerabilities reported

Install bowtie2

Looking to try out Bowtie 2? Check out the Bowtie 2 UI (currently in beta).

Support

For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .

Find more information at: