bioawk | BWK awk modified for biological data | Genomics library

 by   lh3 C Version: v1.0 License: No License

kandi X-RAY | bioawk Summary

kandi X-RAY | bioawk Summary

bioawk is a C library typically used in Artificial Intelligence, Genomics applications. bioawk has no bugs, it has no vulnerabilities and it has low support. You can download it from GitHub.

Bioawk is an extension to Brian Kernighan's awk, adding the support of several common biological data formats, including optionally gzip'ed BED, GFF, SAM, VCF, FASTA/Q and TAB-delimited formats with column names. It also adds a few built-in functions and an command line option to use TAB as the input/output delimiter. When the new functionality is not used, bioawk is intended to behave exactly the same as the original BWK awk. The original awk requires a YACC-compatible parser generator (e.g. Byacc or Bison). Bioawk further depends on zlib so as to work with gzip'd files.

            kandi-support Support

              bioawk has a low active ecosystem.
              It has 476 star(s) with 104 fork(s). There are 35 watchers for this library.
              It had no major release in the last 12 months.
              There are 17 open issues and 12 have been closed. On average issues are closed in 212 days. There are 3 open pull requests and 0 closed requests.
              It has a neutral sentiment in the developer community.
              The latest version of bioawk is v1.0

            kandi-Quality Quality

              bioawk has 0 bugs and 0 code smells.

            kandi-Security Security

              bioawk has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.
              bioawk code analysis shows 0 unresolved vulnerabilities.
              There are 0 security hotspots that need review.

            kandi-License License

              bioawk does not have a standard license declared.
              Check the repository for any license declaration and review the terms closely.
              Without a license, all rights are reserved, and you cannot use the library in your applications.

            kandi-Reuse Reuse

              bioawk releases are available to install and integrate.
              Installation instructions are not available. Examples and code snippets are available.

            Top functions reviewed by kandi - BETA

            kandi's functional review helps you automatically verify the functionalities of the libraries and avoid rework.
            Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of bioawk
            Get all kandi verified functions for this library.

            bioawk Key Features

            No Key Features are available at this moment for bioawk.

            bioawk Examples and Code Snippets

            No Code Snippets are available at this moment for bioawk.

            Community Discussions


            Select sequences in a fasta file with more than 300 aa and "C" occurs at least 4 times
            Asked 2018-Oct-08 at 14:59

            I have a fasta file which contains protein sequences. I'd like to select sequences with more than 300 amino acids and Cysteine (C) amino acid appears more than 4 times.

            I've used this command to select sequences with more than 300 aa:



            Answered 2018-Oct-08 at 14:59

            I do not know bioawk but I assume it is identical to awk with some initial parsing and constant definitions.

            I would proceed as follows. Assuming you want the find the strings with more then 4 times the letter C in and a length of more than 300, then you could do :


            Community Discussions, Code Snippets contain sources that include Stack Exchange Network


            No vulnerabilities reported

            Install bioawk

            You can download it from GitHub.


            For any new features, suggestions and bugs create an issue on GitHub. If you have any questions check and ask questions on community page Stack Overflow .
            Find more information at:

            Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items

            Find more libraries
          • HTTPS


          • CLI

            gh repo clone lh3/bioawk

          • sshUrl


          • Stay Updated

            Subscribe to our newsletter for trending solutions and developer bootcamps

            Agree to Sign up and Terms & Conditions

            Share this Page

            share link