Skip to content

Commit

Permalink
Updated miller to 5.2.0.
Browse files Browse the repository at this point in the history
This release contains mostly feature requests.

Features:

    The stats1 verb now lets you use regular expressions to specify
    which field names to compute statistics on, and/or which to
    group by. Full details are here.

    The min and max DSL functions, and the min/max/percentile
    aggregators for the stats1 and merge-fields verbs, now support
    numeric as well as string field values. (For mixed string/numeric
    fields, numbers compare before strings.) This means in particular
    that order statistics -- min, max, and non-interpolated percentiles
    -- as well as mode, antimode, and count are now possible on
    string-only (or mixed) fields. (Of course, any operations
    requiring arithmetic on values, such as computing sums, averages,
    or interpolated percentiles, yield an error on string-valued
    input.)

    There is a new DSL function mapexcept which returns a copy of
    the argument with specified key(s), if any, unset. The motivating
    use-case is to split records to multiple filenames depending
    on particular field value, which is omitted from the output:
    mlr --from f.dat put 'tee > "/tmp/data-".$a, mapexcept($*, "a")'
    Likewise, mapselect returns a copy of the argument with only
    specified key(s), if any, set. This resolves #137.

    A new -u option for count-distinct allows unlashed counts for
    multiple field names. For example, with -f a,b and without -u,
    count-distinct computes counts for distinct pairs of a and b
    field values. With -f a,b and with -u, it computes counts for
    distinct a field values and counts for distinct b field values
    separately.

    If you build from source, you can now do ./configure without
    first doing autoreconf -fiv. This resolves #131.

    The UTF-8 BOM sequence 0xef 0xbb 0xbf is now automatically
    ignored from the start of CSV files. (The same is already done
    for JSON files.) This resolves #138.

    For put and filter with -S, program literals such as the 6 in
    $x = 6 were being parsed as strings. This is not sensible, since
    the -S option for put and filter is intended to suppress numeric
    conversion of record data, not program literals. To get string
    6 one may use $x = "6".

Documentation:

    A new cookbook example shows how to compute differences between
    successive queries, e.g. to find out what changed in time-varying
    data when you run and rerun a SQL query.

    Another new cookbook example shows how to compute interquartile
    ranges.

    A third new cookbook example shows how to compute weighted
    means.

Bugfixes:

    CRLF line-endings were not being correctly autodetected when
    I/O formats were specified using --c2j et al.

    Integer division by zero was causing a fatal runtime exception,
    rather than computing inf or nan as in the floating-point case.
  • Loading branch information
wiz committed Jun 19, 2017
1 parent 9de296e commit 29c812c
Show file tree
Hide file tree
Showing 2 changed files with 7 additions and 7 deletions.
4 changes: 2 additions & 2 deletions textproc/miller/Makefile
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# $NetBSD: Makefile,v 1.11 2017/04/19 13:39:38 wiz Exp $
# $NetBSD: Makefile,v 1.12 2017/06/19 20:28:49 wiz Exp $

DISTNAME= mlr-5.1.0
DISTNAME= mlr-5.2.0
PKGNAME= ${DISTNAME:S/mlr/miller/}
CATEGORIES= devel
MASTER_SITES= ${MASTER_SITE_GITHUB:=johnkerl/}
Expand Down
10 changes: 5 additions & 5 deletions textproc/miller/distinfo
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
$NetBSD: distinfo,v 1.12 2017/04/19 13:39:38 wiz Exp $
$NetBSD: distinfo,v 1.13 2017/06/19 20:28:49 wiz Exp $

SHA1 (mlr-5.1.0.tar.gz) = 7647bfe39b9dcff7c5319acb59ae64a761c48547
RMD160 (mlr-5.1.0.tar.gz) = 29907d15d970d1f2965252cbcd7403d30c9d9f6a
SHA512 (mlr-5.1.0.tar.gz) = 2cc16e920dfee0715679141359b71851e5b461ce58bd70817550fb4b2f667d73cec6a824cfa6c199e27836184171c58f80cbce386cd6839fb7755bff5b4df370
Size (mlr-5.1.0.tar.gz) = 1189051 bytes
SHA1 (mlr-5.2.0.tar.gz) = f33f07902578f1f4338a2b40ac52ceee9d25434d
RMD160 (mlr-5.2.0.tar.gz) = 8c53517710ec3b3f68c80ef0e376cd7ce67e343f
SHA512 (mlr-5.2.0.tar.gz) = b202ee5007c720336e0fddfbfe3629f51d0eee3d9a7b0e4bc54b157398182572549a40c869da2c2205770f49ee548eca7f8080d4e46e1c804275a2e76c1dbe7e
Size (mlr-5.2.0.tar.gz) = 1188564 bytes

0 comments on commit 29c812c

Please sign in to comment.