datamash | Send questions

by agordon C Version: v1.8 License: GPL-3.0

X-Ray Key Features Code Snippets Community Discussions(10)Vulnerabilities Install Support

kandi X-RAY | datamash Summary

datamash is a C library. datamash has no bugs, it has no vulnerabilities, it has a Strong Copyleft License and it has low support. You can download it from GitHub.

GNU Datamash is a command-line program which performs basic numeric,textual and statistical operations on input textual data files. it is designed to be portable and reliable, and aid researchers to easily automate analysis pipelines, without writing code or even short scripts.

Support

Quality

Security

License

Reuse

Support

datamash has a low active ecosystem.

It has 67 star(s) with 3 fork(s). There are 5 watchers for this library.

It had no major release in the last 6 months.

datamash has no issues reported. There are no pull requests.

It has a neutral sentiment in the developer community.

The latest version of datamash is v1.8

Quality

datamash has no bugs reported.

Security

datamash has no vulnerabilities reported, and its dependent libraries have no vulnerabilities reported.

License

datamash is licensed under the GPL-3.0 License. This license is Strong Copyleft.

Strong Copyleft licenses enforce sharing, and you can use them when creating open source projects.

Reuse

datamash releases are not available. You will need to build from source code and install.

Installation instructions, examples and code snippets are available.

Top functions reviewed by kandi - BETA

kandi's functional review helps you automatically verify the functionalities of the libraries and avoid rework.
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of datamash

Get all kandi verified functions for this library.

datamash Key Features

No Key Features are available at this moment for datamash.

datamash Examples and Code Snippets

No Code Snippets are available at this moment for datamash.

Community Discussions

Trending Discussions on datamash

datamash tool : transposition rows->columns and inversely - Shortcut with vim

Datamash 1.7 outputs zero on floating point values binning

Transposing text files and adding columns

Bash or Awk Script to Merge Lines Where X Number of Fields Match While Creating Ranges in Fields that Don't Match

Calculate median of a sliding window with awk

Convert multiple rows into a single column

How to fast sum values in directed graph in shell?

Count combinations of packages (row values) per user (unique value)

Transpose a file and fill missing fields

how to escape the < <() idiom in perl

QUESTION

datamash tool : transposition rows->columns and inversely - Shortcut with vim

Asked 2020-Sep-04 at 21:10

I have just discovered the tool datamash to swap between transposition of rows/columns or columns/rows.

How can I define a shortcut in ~/.vimrc to automatically transpose in the 2 senses ?

I tried to set (delimiter is whitespace) : noremap :% !datamash transpose -W

I don't know how to specify the current opened file on which I want to toggle between the rows/columns ans inversely columns/rows.

Anyone could see how to perform this transposition by a simple shortcut on vim ?

EDIT: everyting works fine by setting into ~/.vimrc:

...

ANSWER

Answered 2020-Sep-04 at 21:10

Try including by the end of the noremap line.

Source https://stackoverflow.com/questions/63433346

QUESTION

Datamash 1.7 outputs zero on floating point values binning

Asked 2020-Jul-18 at 21:21

I am using Datamash 1.7 on Centos 7.7 Linux x86_64 machine to sort and bin data which is 24 GB in size. Input data looks as follows (only first 50 samples)

...

ANSWER

Answered 2020-Jul-18 at 21:21

Looking at the source (Since unfortunately binning isn't described very well in the documentation), numeric binning is done by this code:

Source https://stackoverflow.com/questions/62970836

QUESTION

Transposing text files and adding columns

Asked 2020-Apr-28 at 20:13

I have a large text file with 100000 rows and columns as like this

...

ANSWER

Answered 2020-Apr-28 at 20:05

Please use df.melt and drop the variable column

Source https://stackoverflow.com/questions/61488332

QUESTION

Bash or Awk Script to Merge Lines Where X Number of Fields Match While Creating Ranges in Fields that Don't Match

Asked 2020-Apr-01 at 02:42

NEED: I have a file containing data like the sample below. I need to:

Merge all lines into one line when X number of fields match
Create range of values in X number of fields when the values vary

In this case: Merge all lines into one where fields $1 through $6 and $8 match, create a range of the associated values in field $7

Merge all lines into one where fields $1 through $7 match, create a range of the associated values in field $8*

The solution has to be something native to Linux (like a bash or awk script) that doesn't require installation of additional software (e.g. datamash).

@rtx13's solution below using TCL does work (thanks again), I'm just not sure if I can install TCL in my live environment so I hope and AWK/BASH/etc. solution can also be proposed.

Original data:

...

ANSWER

Answered 2020-Mar-31 at 03:14

The following works on one column at a time.

mergecolumn script:

Source https://stackoverflow.com/questions/60942362

QUESTION

Calculate median of a sliding window with awk

Asked 2020-Mar-24 at 23:58

I need to produce a sliding window of millions of lines and to calculate the median of column 3. My data looks like this with column 1 always being the same, column 2 equaling the line number and column 3 being the information that I need the median for:

...

ANSWER

Answered 2020-Mar-24 at 14:16

The following script with GNU awk seems to generate the output you presented:

Source https://stackoverflow.com/questions/60832286

QUESTION

Convert multiple rows into a single column

Asked 2020-Jan-22 at 14:29

Im trying to "convert" the following file from multiple rows into separated column.

classr#94 mesur#237 high#228 cash#232
classr#118 mesur#332 high#430 cash#421 Sar#380 classr#57 mesur#89 hight#65

My desired output:

classr#94
mesur#237
high#228
cash#232
classr#118
mesur#332
high#430
cash#421
Sar#380 classr#57
mesur#89
hight#65

I tried datamash -t: transpose < Filename but converted my file in very "weird" way

I also tried grep -o # File_name but i got only the #.

I think in the grep case if I find the way to get the entire word I will obtain the desired output.

...

ANSWER

Answered 2020-Jan-22 at 14:29

cat filetoconvert | tr " " "\n"

Source https://stackoverflow.com/questions/59861551

QUESTION

How to fast sum values in directed graph in shell?

Asked 2019-Aug-28 at 19:23

I have a directed graph with like 2000 nodes stored in a file. Each line represents an edge from the node stored in the first column to the node stored in the second column, it is even easy to visualize the data for example in dot(1). Columns are separated by tabs, rows separated by newlines and nodes are named with any of the a-zA-Z0-9_ characters. Tree can have multiple roots, it may have cycles, which should be ignored. I don't care about cycles, they are redundant, but they can happen in the input. Below I am presenting an example of the graph, with tr to substitute spaces for tabs and here-document, to easy reproduce the input file:

...

ANSWER

Answered 2019-Aug-28 at 19:23

Since no one has posted an answer yet, here is an awk solution as a starting point:

Source https://stackoverflow.com/questions/57559614

QUESTION

Count combinations of packages (row values) per user (unique value)

Asked 2017-Nov-25 at 00:19

Consider the following:

...

ANSWER

Answered 2017-Nov-25 at 00:19

You need to concatenate the packages into a single string (ordered by package) for each customer, then you can count by that concatenated string:

For Postgres:

Source https://stackoverflow.com/questions/47480966

QUESTION

Transpose a file and fill missing fields

Asked 2017-Nov-07 at 00:45

I have tried several awk and sed commands and GNU datamash to change the format and code the missing fields as "??" of this data file with no success. I have a file with a format that looks like the following:

...

ANSWER

Answered 2017-Nov-07 at 00:41

awk to the rescue!

with true multidimensional arrays it would be easier, but this works for most awks

Source https://stackoverflow.com/questions/47146615

QUESTION

how to escape the < <() idiom in perl

Asked 2017-Aug-04 at 16:49

I have a system command which i am trying to execute, but it gives me error "Syntax error: redirection unexpected"

Trying command:

...

ANSWER

Answered 2017-Aug-04 at 16:49

Backticks aka readpipe expect a command passed to sh (or cmd in Windows). You appear to have a bash command rather than a sh command. Fixed:

Source https://stackoverflow.com/questions/45493781

Community Discussions, Code Snippets contain sources that include Stack Exchange Network

Vulnerabilities

No vulnerabilities reported

Install datamash

Download the latest source code at https://www.gnu.org/software/datamash . See Platform/OS-specific download instructions at https://www.gnu.org/software/datamash/download/. To build from latest git sources, see the HACKING.md file. This file is available when cloning from git, but is not distributed in the tar archive. To clone the git repository run git clone git://git.savannah.gnu.org/datamash.git It is also available online at https://git.savannah.gnu.org/cgit/datamash.git/tree/HACKING.md.

Support

Please send questions and bug reports to bug-datamash@gnu.org . Searchable archive at https://lists.gnu.org/archive/html/bug-datamash . Subscribe at https://lists.gnu.org/mailman/listinfo/bug-datamash .

Find more information at: