Official git repository for Biopython (originally converted from CVS)
Support
Quality
Security
License
Reuse
d
data-science-at-the-command-lineby jeroenjanssens
HTML 
3499
Version:Current
License: Proprietary (Proprietary)
Data Science at the Command Line
Support
Quality
Security
License
Reuse
A simple and efficient tool to parallelize Pandas operations on all available CPUs
Support
Quality
Security
License
Reuse
DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.
Support
Quality
Security
License
Reuse
R toolkit for single cell genomics
Support
Quality
Security
License
Reuse
Repository for the main Dockerfile with the Openworm software stack and project-wide issues
Support
Quality
Security
License
Reuse
An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)
Support
Quality
Security
License
Reuse
RNA-seq aligner
Support
Quality
Security
License
Reuse
Single-cell analysis in Python. Scales to >1M cells.
Support
Quality
Security
License
Reuse
Conda recipes for the bioconda channel.
Support
Quality
Security
License
Reuse
A versatile pairwise aligner for genomic and spliced nucleotide sequences
Support
Quality
Security
License
Reuse
Official code repository for GATK versions 4 and up
Support
Quality
Security
License
Reuse
Tools (written in C using htslib) for manipulating next-generation sequencing data
Support
Quality
Security
License
Reuse
Burrow-Wheeler Aligner for short-read alignment (see minimap2 for long-read alignment)
Support
Quality
Security
License
Reuse
gentle forced aligner
Support
Quality
Security
License
Reuse
Informatics for RNA-seq: A web resource for analysis on the cloud. Educational tutorials and working pipelines for RNA-seq analysis including an introduction to: cloud computing, critical file formats, reference genomes, gene annotation, expression, differential expression, alternative splicing, data visualization, and interpretation.
Support
Quality
Security
License
Reuse
Toolkit for processing sequences in FASTA/Q formats
Support
Quality
Security
License
Reuse
Data intensive science for everyone.
Support
Quality
Security
License
Reuse
s
single-cell-tutorialby theislab
Jupyter Notebook 
1076
Version:Current
License: No License (No License)
Single cell current best practices tutorial case study for the paper:Luecken and Theis, "Current best practices in single-cell RNA-seq analysis: a tutorial"
Support
Quality
Security
License
Reuse
A cross-platform and ultrafast toolkit for FASTA/Q file manipulation
Support
Quality
Security
License
Reuse
Aggregate results from bioinformatics analyses across many samples into a single report.
Support
Quality
Security
License
Reuse
MMseqs2: ultra fast and sensitive search and clustering suite
Support
Quality
Security
License
Reuse
Deep probabilistic analysis of single-cell omics data
Support
Quality
Security
License
Reuse
tools for working with genome variation graphs
Support
Quality
Security
License
Reuse
Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis
Support
Quality
Security
License
Reuse
SRA Tools
Support
Quality
Security
License
Reuse
Cloud-native genomic dataframes and batch computing
Support
Quality
Security
License
Reuse
g
getting-started-with-genomics-tools-and-resourcesby crazyhottommy
Shell 
878
Version:Current
License: No License (No License)
Unix, R and python tools for genomics and data science
Support
Quality
Security
License
Reuse
A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF.
Support
Quality
Security
License
Reuse
bedtools - the swiss army knife for genome arithmetic
Support
Quality
Security
License
Reuse
Accelerated BLAST compatible local sequence aligner.
Support
Quality
Security
License
Reuse
:bar_chart: A universal enrichment tool for interpreting omics data
Support
Quality
Security
License
Reuse
A collection of links and notes on forced alignment tools
Support
Quality
Security
License
Reuse
Scripts to download genomes from the NCBI FTP servers
Support
Quality
Security
License
Reuse
RNAseq analysis notes from Ming Tang
Support
Quality
Security
License
Reuse
🧬 gget enables efficient querying of genomic reference databases
Support
Quality
Security
License
Reuse
Trinity RNA-Seq de novo transcriptome assembly
Support
Quality
Security
License
Reuse
Python and C++ code for reading and writing genomics data.
Support
Quality
Security
License
Reuse
A scalable pipeline for designing reconfigurable organisms
Support
Quality
Security
License
Reuse
A Java Virtual Machine and Bytecode-to-IL Converter for .NET
Support
Quality
Security
License
Reuse
C library for high-throughput sequencing data formats
Support
Quality
Security
License
Reuse
Python package for building, comparing, annotating, manipulating and visualising trees. It provides a comprehensive API and a collection of command line tools, including utilities to work with the NCBI taxonomy tree.
Support
Quality
Security
License
Reuse
Bayesian haplotype-based genetic polymorphism discovery and genotyping.
Support
Quality
Security
License
Reuse
GUI/CommandLine Tool Box for biologistists to utilize NGS data.
Support
Quality
Security
License
Reuse
A high-performance, Pythonic language for bioinformatics
Support
Quality
Security
License
Reuse
Tools for mapping a sentence with arbitrary length to vector space
Support
Quality
Security
License
Reuse
:zap: :aquarius: Rapid prokaryotic genome annotation
Support
Quality
Security
License
Reuse
Deep probabilistic analysis of single-cell omics data
Support
Quality
Security
License
Reuse
Pysam is a Python module for reading and manipulating SAM/BAM/VCF/BCF files. It's a lightweight wrapper of the htslib C-API, the same one that powers samtools, bcftools, and tabix.
Support
Quality
Security
License
Reuse
In-memory nucleotide sequence k-mer counting, filtering, graph traversal and more
Support
Quality
Security
License
Reuse
b
biopythonby biopython
Official git repository for Biopython (originally converted from CVS)
Python
3633
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
d
data-science-at-the-command-lineby jeroenjanssens
Data Science at the Command Line
HTML
3499
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
p
pandarallelby nalepae
A simple and efficient tool to parallelize Pandas operations on all available CPUs
Python
3108
Updated: 2 y ago
License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
d
deepvariantby google
DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.
Python
2819
Updated: 2 y ago
License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
s
seuratby satijalab
R toolkit for single cell genomics
R
1834
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
O
OpenWormby openworm
Repository for the main Dockerfile with the Openworm software stack and project-wide issues
Python
1762
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
f
fastpby OpenGene
An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)
C++
1525
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
S
Support
Quality
Security
License
Reuse
s
scanpyby scverse
Single-cell analysis in Python. Scales to >1M cells.
Python
1497
Updated: 2 y ago
License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
b
bioconda-recipesby bioconda
Conda recipes for the bioconda channel.
Shell
1491
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
minimap2by lh3
A versatile pairwise aligner for genomic and spliced nucleotide sequences
C
1468
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
g
gatkby broadinstitute
Official code repository for GATK versions 4 and up
Java
1450
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
s
samtoolsby samtools
Tools (written in C using htslib) for manipulating next-generation sequencing data
C
1358
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
b
bwaby lh3
Burrow-Wheeler Aligner for short-read alignment (see minimap2 for long-read alignment)
C
1308
Updated: 2 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
g
Support
Quality
Security
License
Reuse
r
rnaseq_tutorialby griffithlab
Informatics for RNA-seq: A web resource for analysis on the cloud. Educational tutorials and working pipelines for RNA-seq analysis including an introduction to: cloud computing, critical file formats, reference genomes, gene annotation, expression, differential expression, alternative splicing, data visualization, and interpretation.
R
1248
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
s
seqtkby lh3
Toolkit for processing sequences in FASTA/Q formats
C
1154
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
g
galaxyby galaxyproject
Data intensive science for everyone.
Python
1116
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
s
single-cell-tutorialby theislab
Single cell current best practices tutorial case study for the paper:Luecken and Theis, "Current best practices in single-cell RNA-seq analysis: a tutorial"
Jupyter Notebook
1076
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
s
seqkitby shenwei356
A cross-platform and ultrafast toolkit for FASTA/Q file manipulation
Go
1026
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
M
MultiQCby ewels
Aggregate results from bioinformatics analyses across many samples into a single report.
Python
1019
Updated: 2 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
M
MMseqs2by soedinglab
MMseqs2: ultra fast and sensitive search and clustering suite
C
970
Updated: 2 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
s
scvi-toolsby scverse
Deep probabilistic analysis of single-cell omics data
Python
937
Updated: 2 y ago
License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
v
vgby vgteam
tools for working with genome variation graphs
C++
931
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
b
bcbio-nextgenby bcbio
Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis
Python
923
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
Support
Quality
Security
License
Reuse
h
hailby hail-is
Cloud-native genomic dataframes and batch computing
Python
881
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
g
getting-started-with-genomics-tools-and-resourcesby crazyhottommy
Unix, R and python tools for genomics and data science
Shell
878
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
p
picardby broadinstitute
A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF.
Java
873
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
b
bedtools2by arq5x
bedtools - the swiss army knife for genome arithmetic
C
825
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
d
diamondby bbuchfink
Accelerated BLAST compatible local sequence aligner.
C++
821
Updated: 2 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
c
clusterProfilerby YuLab-SMU
:bar_chart: A universal enrichment tool for interpreting omics data
R
779
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
f
forced-alignment-toolsby pettarin
A collection of links and notes on forced alignment tools
Python
767
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
n
ncbi-genome-downloadby kblin
Scripts to download genomes from the NCBI FTP servers
Python
759
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
R
RNA-seq-analysisby crazyhottommy
RNAseq analysis notes from Ming Tang
Python
757
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
g
ggetby pachterlab
🧬 gget enables efficient querying of genomic reference databases
Python
751
Updated: 2 y ago
License: Permissive (BSD-2-Clause)
Support
Quality
Security
License
Reuse
t
trinityrnaseqby trinityrnaseq
Trinity RNA-Seq de novo transcriptome assembly
Perl
735
Updated: 2 y ago
License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
n
nucleusby google
Python and C++ code for reading and writing genomics data.
C++
734
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
r
reconfigurable_organismsby skriegman
A scalable pipeline for designing reconfigurable organisms
C++
719
Updated: 4 y ago
License: Permissive (CC0-1.0)
Support
Quality
Security
License
Reuse
i
ikvmby ikvm-revived
A Java Virtual Machine and Bytecode-to-IL Converter for .NET
C#
717
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
h
htslibby samtools
C library for high-throughput sequencing data formats
C
696
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
e
eteby etetoolkit
Python package for building, comparing, annotating, manipulating and visualising trees. It provides a comprehensive API and a collection of command line tools, including utilities to work with the NCBI taxonomy tree.
Python
694
Updated: 2 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
f
freebayesby freebayes
Bayesian haplotype-based genetic polymorphism discovery and genotyping.
C++
685
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
T
TBtoolsby CJ-Chen
GUI/CommandLine Tool Box for biologistists to utilize NGS data.
Shell
683
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
s
seqby seq-lang
A high-performance, Pythonic language for bioinformatics
C++
678
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
s
sentence2vecby klb3713
Tools for mapping a sentence with arbitrary length to vector space
Python
671
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
p
prokkaby tseemann
:zap: :aquarius: Rapid prokaryotic genome annotation
Perl
666
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
s
scvi-toolsby YosefLab
Deep probabilistic analysis of single-cell omics data
Python
659
Updated: 3 y ago
License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
p
pysamby pysam-developers
Pysam is a Python module for reading and manipulating SAM/BAM/VCF/BCF files. It's a lightweight wrapper of the htslib C-API, the same one that powers samtools, bcftools, and tabix.
C
658
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
k
khmerby dib-lab
In-memory nucleotide sequence k-mer counting, filtering, graph traversal and more
Python
647
Updated: 4 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse