Releases: typelevel/frameless
Releases · typelevel/frameless
v0.8.0
v0.6.1
v0.5.2
v0.6.0
v0.5.1
Bug fixes and enhancements:
- Fixed null handling for nested classes (by @kmate)
- Better handling of case classes with fields of type Unit (by @kmate)
- TypedDatasets can now be created from Dataframes with different column ordering (by @mfelsche)
- Implicit derivation for Orderable types (by @martin1keogh)
- Generated bytecode classes now correctly target JVM 1.8 (by @imarios)
New method additions:
cube
androllup
aggregation operators (by @Avasil)size
for Map andisin
for values (by @ayoub-benali)- Trigonometric methods:
cos
,cosh
,sin
,sinh
,tan
,tanh
(by @Avasil) between
method for orderable values (by @Crossy147)substr
(by @bhop)
v0.5.0
Notable additions/changes:
- Great improvements in joins (theta join condition supported)
- Extended functionality for frameless-ml (details follow)
- Unifying projected and aggregated columns
- Injection for ordered columns
- Fixed multiplication for BigDecimal
- Adding a lot of missing operators, such as sort(), union(), drop(), when/otherwise (details follow)
- More documentation examples
Frameless-ml
- TypedTransformer and TypedEstimator
- TypedRandomForrestRegressor, TypedRandomForrestClassifier
- TypedIndexToString, TypedStringIndexer, TypedVectorAssembler
Encoders:
- java.math.BigDecimal
Upgrades:
- Spark to 2.2.1
- Scala to 2.11.12
Operators:
- union()
- sort()
- asCol to project entire dataset into a single column
- drop(), dropTupled()
- getOrElse()
- when()/otherwise()
- withColumn(), withColumnReplaced()
- head(n), headOption
- litAggr literal for aggregated columns
Column methods:
- abs, acts, add_months
- bin, bitwiseNot
- arrayContains
- inputFileName, monotonicallyIncreasingId
- ascii, asin, atan, atan2, base64
v0.4.1
v0.4.0
Notable additions/changes:
- support for Spark 2.2.0
- added Encoders for:
UDT
,Array
,Map
- added
explode()
on TypedColumns with types Vector/Array - added bitwise
and/or/xor
operators on TypedColumns - added
withColumn()
operator on TypedDataset - added
pivot()
aggregation - added statistical methods:
corr()
,skewness()
,kurtosis()
,cover_sample()
- migrated from
SparkContext
toSparkSessions
throughout - created the
frameless-ml
project - parameterize Spark actions over the effect type used (a much more powerful
Job[_]
) - [Internal] Improved test template
- [Internal] Moved methods that are not optimizable by Catalyst to a new package (map(), flatMap(), etc.)
- [bug] Fixed bug in computing equality of nullable types (Options)
- [bug] Fixed big-decimal devision incorrectly returning Double
v0.3.0
Notable additions/changes:
UDFs
now support columns with custom encoders (usingInjection
)map
andflatMap
onJob[A]
- more aggregation functions:
countDistinct
,approxCountDistinct
,collectList
,collectSet
,sumDistinct
- support for cats v0.9
createUnsafe
to instantiate aTypedDataset
from a SparkDataFrame
- whole dataset aggregation functions moved from
select
to an explicitagg
onTypedDataset
- bug fixes on joins, UDFs, and vector encoders