This is a set of command line utilities for manipulating large tabular data files. Files of numeric and text data commonly found in machine learning and data mining environments. Filtering, sampling, statistics, joins, and more. These tools are especially useful when working with large data sets. They run faster than other tools providing similar functionality, often by significant margins. See Pe
{{#tags}}- {{label}}
{{/tags}}