From 29c812cd891e1c8e8fecefb900f629c0c6956373 Mon Sep 17 00:00:00 2001 From: wiz Date: Mon, 19 Jun 2017 20:28:49 +0000 Subject: [PATCH] Updated miller to 5.2.0. This release contains mostly feature requests. Features: The stats1 verb now lets you use regular expressions to specify which field names to compute statistics on, and/or which to group by. Full details are here. The min and max DSL functions, and the min/max/percentile aggregators for the stats1 and merge-fields verbs, now support numeric as well as string field values. (For mixed string/numeric fields, numbers compare before strings.) This means in particular that order statistics -- min, max, and non-interpolated percentiles -- as well as mode, antimode, and count are now possible on string-only (or mixed) fields. (Of course, any operations requiring arithmetic on values, such as computing sums, averages, or interpolated percentiles, yield an error on string-valued input.) There is a new DSL function mapexcept which returns a copy of the argument with specified key(s), if any, unset. The motivating use-case is to split records to multiple filenames depending on particular field value, which is omitted from the output: mlr --from f.dat put 'tee > "/tmp/data-".$a, mapexcept($*, "a")' Likewise, mapselect returns a copy of the argument with only specified key(s), if any, set. This resolves #137. A new -u option for count-distinct allows unlashed counts for multiple field names. For example, with -f a,b and without -u, count-distinct computes counts for distinct pairs of a and b field values. With -f a,b and with -u, it computes counts for distinct a field values and counts for distinct b field values separately. If you build from source, you can now do ./configure without first doing autoreconf -fiv. This resolves #131. The UTF-8 BOM sequence 0xef 0xbb 0xbf is now automatically ignored from the start of CSV files. (The same is already done for JSON files.) This resolves #138. For put and filter with -S, program literals such as the 6 in $x = 6 were being parsed as strings. This is not sensible, since the -S option for put and filter is intended to suppress numeric conversion of record data, not program literals. To get string 6 one may use $x = "6". Documentation: A new cookbook example shows how to compute differences between successive queries, e.g. to find out what changed in time-varying data when you run and rerun a SQL query. Another new cookbook example shows how to compute interquartile ranges. A third new cookbook example shows how to compute weighted means. Bugfixes: CRLF line-endings were not being correctly autodetected when I/O formats were specified using --c2j et al. Integer division by zero was causing a fatal runtime exception, rather than computing inf or nan as in the floating-point case. --- textproc/miller/Makefile | 4 ++-- textproc/miller/distinfo | 10 +++++----- 2 files changed, 7 insertions(+), 7 deletions(-) diff --git a/textproc/miller/Makefile b/textproc/miller/Makefile index d7fbc5e1c42bb..88b05fe78e081 100644 --- a/textproc/miller/Makefile +++ b/textproc/miller/Makefile @@ -1,6 +1,6 @@ -# $NetBSD: Makefile,v 1.11 2017/04/19 13:39:38 wiz Exp $ +# $NetBSD: Makefile,v 1.12 2017/06/19 20:28:49 wiz Exp $ -DISTNAME= mlr-5.1.0 +DISTNAME= mlr-5.2.0 PKGNAME= ${DISTNAME:S/mlr/miller/} CATEGORIES= devel MASTER_SITES= ${MASTER_SITE_GITHUB:=johnkerl/} diff --git a/textproc/miller/distinfo b/textproc/miller/distinfo index b781bdca96b88..12c0610336cb3 100644 --- a/textproc/miller/distinfo +++ b/textproc/miller/distinfo @@ -1,6 +1,6 @@ -$NetBSD: distinfo,v 1.12 2017/04/19 13:39:38 wiz Exp $ +$NetBSD: distinfo,v 1.13 2017/06/19 20:28:49 wiz Exp $ -SHA1 (mlr-5.1.0.tar.gz) = 7647bfe39b9dcff7c5319acb59ae64a761c48547 -RMD160 (mlr-5.1.0.tar.gz) = 29907d15d970d1f2965252cbcd7403d30c9d9f6a -SHA512 (mlr-5.1.0.tar.gz) = 2cc16e920dfee0715679141359b71851e5b461ce58bd70817550fb4b2f667d73cec6a824cfa6c199e27836184171c58f80cbce386cd6839fb7755bff5b4df370 -Size (mlr-5.1.0.tar.gz) = 1189051 bytes +SHA1 (mlr-5.2.0.tar.gz) = f33f07902578f1f4338a2b40ac52ceee9d25434d +RMD160 (mlr-5.2.0.tar.gz) = 8c53517710ec3b3f68c80ef0e376cd7ce67e343f +SHA512 (mlr-5.2.0.tar.gz) = b202ee5007c720336e0fddfbfe3629f51d0eee3d9a7b0e4bc54b157398182572549a40c869da2c2205770f49ee548eca7f8080d4e46e1c804275a2e76c1dbe7e +Size (mlr-5.2.0.tar.gz) = 1188564 bytes