Go Makefile Shell
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
bench
cmd
doc
man/md
test
.editorconfig
.gitignore
.goreleaser.yml
.travis.yml
CHANGELOG.md
Gopkg.lock
Gopkg.toml
LICENSE
Makefile
README.md
doc.go
interval.go
list.go
logo.png
main.go
math.go
math_test.go
pair.go
pipe.go
string.go
util.go

README.md

datautils logo

datautils

The best toolbox for processing textual data.

Release License Travis Coverage Status Go Report Card


Introduction & Rationale

The Data Utilities are a collection of handy text manipulation tools. These tools are supposed to make a data wrangler’s life on the command-line easier.

Much of the functionality can be solved with standard command-line tools (awk, sed, cut, sort, uniq, …), but that would often become tedious. Zealots of the Unix philosophy will probably not use these tools and create a set of sophisticated aliases instead.

On the other hand, some of the tools fix actual problems. The tools use UTF-8 by default. As a consequence, one does not have to deal with the quirks of sort and uniq w.r.t. non-ASCII input.

Tool Overview

These tools are part of the collection:

  • count
  • norm
  • rows
  • text
  • trim

Usage Examples

count

norm

$ echo "¹²³" | norm --nfc
¹²³

$ echo "¹²³" | norm --nfkc
123

rows

text

trim

$ echo "   abc" | trim -l
abc

Installation

Debian & Ubuntu

snap

sudo apt-get install snapd
sudo snap install --channel=candidate datautils
sudo snap alias datautils.norm count
sudo snap alias datautils.norm norm
sudo snap alias datautils.norm rows
sudo snap alias datautils.norm text
sudo snap alias datautils.trim trim

apt

sudo add-apt-repository ppa:sfischer13/datautils
sudo apt-get update
sudo apt-get install datautils

Developers

go get

go get github.com/sfischer13/datautils/...

go dep

go get -u github.com/golang/dep/cmd/dep
git clone https://github.com/sfischer13/datautils.git
cd datautils
dep ensure
go install

Credits

This project is authored and maintained by Stefan Fischer.
The source code is available under the MIT License.
See LICENSE for further details.