Releases · dataflint/spark

New alert - Large data Broadcast, for requesting to broadcast large data sets with the broadcast() function
New alert - Large filter conditions, for wiring long filter conditions instead of using join logic
UI Improvements

Assets 2

04 Jun 12:43

menishmueli

v0.2.2

2606c16

Version 0.2.2

Support spark versions 2.4 logs in history server with version later than 3.2
Limited feature-set is available due to events having less data than spark 3.0 and up

Assets 2

03 Jun 16:56

menishmueli

v0.2.1

142a96d

Version 0.2.1

Better Databricks stage to node support
Support spark.dataflint.runId in custom history server providers when appId is not the spark appId

Assets 2

20 May 13:09

menishmueli

v0.2.0

a957536

Version 0.2.0

Better support for Databricks Photon plans
Input nodes shows partitions filters and push down filters
Stage Breakdown - press the blue down arrow on sql node to see stage information
New alert - large number of small tasks

Assets 2

17 Apr 17:03

menishmueli

v0.1.7

36b2cf8

Version 0.1.7

Apache Iceberg alerts improvements

Add avg file size in read/write

More information when hovering on stage

Assets 2

26 Mar 11:06

menishmueli

v0.1.6

61cd317

Version 0.1.6

Apache Iceberg support:

Better node naming
Read metrics and reading small files alerts
Write metrics and overwriting most of table alerts
Write metrics require enabling an iceberg metric reporter, can be done for you by enabling spark.dataflint.iceberg.autoCatalogDiscovery to true, or setting the iceberg metric reporter manually for each catalog, for example:
spark.sql.catalog.[catalog name].metrics-reporter-impl org.apache.spark.dataflint.iceberg.DataflintIcebergMetricsReporter

Assets 2

24 Feb 19:48

menishmueli

v0.1.5

e647823

Version 0.1.5

Add support for history server with cluster-mode jobs (i.e. with attempt numbet)
Fix "wasted cores" calculation
Fix status tab SQL is flickering when there is SQL with sub queriers

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: dataflint/spark

Version 0.2.6

Version 0.2.5

Version 0.2.4

Version 0.2.3

Version 0.2.2

Version 0.2.1

Version 0.2.0

Version 0.1.7

Version 0.1.6

Version 0.1.5