Select columns by name #25

Llammissar · 2017-02-02T17:37:39Z

Another day, another feature request. ;)

I was doing some ad hoc data mangling late yesterday and kept thinking it would be very useful to be able to refer to columns by name. Part of it was the constant checking of which data had which column number. Especially once I started using multiple tools in pipelines, it would have saved time to be able to just call things by name. A simple example:
tsv-join -k 1,2 -f transaction-rate.tsv -a 3 average-crossover-times.tsv | tsv-select -f 1,2,7 --rest last
I'd find it more readable (and thus better for use in larger scripts) to be able to do something like this:
tsv-join -k cores,threads -f transaction-rate.tsv -a 't/m' average-crossover-times.tsv | tsv-select -f cores,threads,'t/m' --rest last

The text was updated successfully, but these errors were encountered:

jondegenhardt · 2017-02-03T04:45:59Z

Yup, agreed. It's on the list.

nickray · 2017-07-27T11:02:15Z

Is there any progress on this issue? Otherwise, I would like to start implementing towards a pull request. Are there any code guidlines besides https://ebay.github.io/tsv-utils-dlang/docs/AboutTheCode.html?

jondegenhardt · 2017-07-27T15:59:10Z

@nickray No, I haven't had time recently to do this. It'd be great if you wanted to do it. Code guidelines are the same as Phobos (see D Style). The other key things are consistency between all the tools, primarily command line argument consistency, and unit tests. It won't be hard, but several of the tools will require some restructuring at the top-level, tsv-summarize especially. Helper functions/classes would probably make this simpler. Still, it won't be a trivial amount of work.

If you take this on, I suggest starting with a one or two tools and submit a pull request before proceeding. That'll give me a chance to provide feedback.

wavefancy · 2018-02-26T17:21:26Z

Is there any progress on this issue? Very useful feature. :-)

jondegenhardt · 2018-02-26T19:26:02Z

@wavefancy I haven't done any further work with specifying columns via names. Not sure when I'll get to it, I don't have time for it currently. Though not hard, it's a reasonable amount of work because at present all the tools assume the columns numbers are known at the conclusion of command line argument processing. There's a bit of design work involved to change this so it works smoothly and consistently across all the tools.

jondegenhardt · 2019-11-05T03:17:37Z

Did a review of this, and some of the key internal building blocks are now available in the code. In particular, the "fields list" processing code and BufferedByLine in utils.d provide some the key abstractions. It'll still a reasonable bit of work, don't know when I'll get to it, but it's perhaps not as far off as I once thought.

jondegenhardt · 2020-07-11T04:53:00Z

Enhancement complete as part of release v2.0.0. Yah!

jondegenhardt added the enhancement label Feb 3, 2017

jammur added the enhancement label Jun 14, 2018

jondegenhardt mentioned this issue Apr 5, 2020

InputSourceRange #281

Merged

jondegenhardt mentioned this issue May 26, 2020

Experimental named field support in tsv-select #284

Merged

This was referenced Jun 2, 2020

Experimental named field support in tsv-filter #285

Merged

Experimental named field support in tsv-summarize #286

Merged

Experimental named field support in tsv-join #288

Merged

This was referenced Jun 17, 2020

Fieldlists: Refactor command line arg processing, part 2 #294

Merged

Fieldlist help #295

Merged

jondegenhardt mentioned this issue Jul 5, 2020

Fieldlist documentation, part 1 #297

Merged

jondegenhardt closed this as completed Jul 11, 2020

jondegenhardt added the fixed/done label Jul 11, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Select columns by name #25

Select columns by name #25

Llammissar commented Feb 2, 2017

jondegenhardt commented Feb 3, 2017

nickray commented Jul 27, 2017

jondegenhardt commented Jul 27, 2017

wavefancy commented Feb 26, 2018

jondegenhardt commented Feb 26, 2018

jondegenhardt commented Nov 5, 2019

jondegenhardt commented Jul 11, 2020 •

edited

Loading

Select columns by name #25

Select columns by name #25

Comments

Llammissar commented Feb 2, 2017

jondegenhardt commented Feb 3, 2017

nickray commented Jul 27, 2017

jondegenhardt commented Jul 27, 2017

wavefancy commented Feb 26, 2018

jondegenhardt commented Feb 26, 2018

jondegenhardt commented Nov 5, 2019

jondegenhardt commented Jul 11, 2020 • edited Loading

jondegenhardt commented Jul 11, 2020 •

edited

Loading