[SPARK-12507][Streaming][Document]Expose closeFileAfterWrite and allowBatching configurations for Streaming #10453

zsxwing · 2015-12-23T19:14:02Z

/cc @tdas @brkyvz

SparkQA · 2015-12-23T19:45:39Z

Test build #48251 has finished for PR 10453 at commit 4295137.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

srowen · 2015-12-29T09:07:51Z

Let's improve the title of items like this. "Update x" is never descriptive

BenFradet · 2015-12-29T18:20:40Z

docs/configuration.md

+  <td><code>spark.streaming.driver.writeAheadLog.closeFileAfterWrite</code></td>
+  <td>false</td>
+  <td>
+    Whether to close the file after writing a write ahead log record in driver. Because S3 doesn't


I'd say on the driver instead of in driver.

BenFradet · 2015-12-29T18:26:04Z

I have a few comments on phrasing but otherwise it lgtm

zsxwing · 2015-12-29T18:27:14Z

docs/configuration.md

+</tr>
+<tr>
+  <td><code>spark.streaming.driver.writeAheadLog.allowBatching</code></td>
+  <td>false</td>


for me: the default value is true.

That's why I want to expose this one since the behavior is different from 1.5.0.

zsxwing · 2015-12-29T23:59:43Z

@BenFradet Addressed. Thanks for your reviewing.

SparkQA · 2015-12-30T00:18:12Z

Test build #48436 has finished for PR 10453 at commit bce7a29.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

brkyvz · 2015-12-30T10:20:13Z

LGTM

brkyvz · 2015-12-30T10:21:15Z

Maybe we can also include that allowBatching is not just helpful when closeFileAfterWrite is enabled, but is also very helpful to scale to a large number of receivers 50+ for example.

zsxwing · 2015-12-30T23:00:24Z

Maybe we can also include that allowBatching is not just helpful when closeFileAfterWrite is enabled, but is also very helpful to scale to a large number of receivers 50+ for example.

~~I guess enabled should be disabled?~~ Oops, ... misunderstood...

SparkQA · 2015-12-30T23:25:22Z

Test build #48516 has finished for PR 10453 at commit 7d9b038.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

brkyvz · 2015-12-31T13:42:41Z

@zsxwing Thanks! LGTM

tdas · 2016-01-07T21:02:37Z

docs/configuration.md

+  <td><code>spark.streaming.receiver.writeAheadLog.closeFileAfterWrite</code></td>
+  <td>false</td>
+  <td>
+    Whether to close the file after writing a write ahead log record on the receivers. Because S3


"Because S3 .... " --> Set this to 'true' when you want to use S3 (or any file system that does not support flushing) for the metadata WAL at the driver.

SparkQA · 2016-01-07T21:53:37Z

Test build #48971 has finished for PR 10453 at commit 4d55b03.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

tdas · 2016-01-07T22:45:29Z

docs/streaming-programming-guide.md

@@ -1985,7 +1985,11 @@ To run a Spark Streaming applications, you need to have the following.
  to increase aggregate throughput. Additionally, it is recommended that the replication of the
  received data within Spark be disabled when the write ahead log is enabled as the log is already
  stored in a replicated storage system. This can be done by setting the storage level for the
-  input stream to `StorageLevel.MEMORY_AND_DISK_SER`.
+  input stream to `StorageLevel.MEMORY_AND_DISK_SER`. While using S3 (or any file system that
+  does not support flushing) for Write Ahead Logs, please remember to enable


nit: Write Ahead Logs is not in caps in this text. so please be consistent.

tdas · 2016-01-07T22:46:37Z

just one more comment. then LGTM.

SparkQA · 2016-01-07T23:11:52Z

Test build #48980 has finished for PR 10453 at commit 28a750d.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

tdas · 2016-01-08T01:37:42Z

LGTM. Merging this to master and 1.6. Thanks!

…owBatching configurations for Streaming /cc tdas brkyvz Author: Shixiong Zhu <[email protected]> Closes #10453 from zsxwing/streaming-conf. (cherry picked from commit c94199e) Signed-off-by: Tathagata Das <[email protected]>

Update Streaming configurations for 1.6

4295137

zsxwing changed the title ~~[SPARK-12507][Streaming][Document]Update Streaming configurations for 1.6~~ [SPARK-12507][Streaming][Document]Expose closeFileAfterWrite and allowBatching configurations for Streaming Dec 29, 2015

BenFradet reviewed Dec 29, 2015
View reviewed changes

zsxwing reviewed Dec 29, 2015
View reviewed changes

Address comments

bce7a29

Address Burak's comment

7d9b038

tdas reviewed Jan 7, 2016
View reviewed changes

Address TD's comments

4d55b03

tdas reviewed Jan 7, 2016
View reviewed changes

Address more

28a750d

asfgit closed this in c94199e Jan 8, 2016

zsxwing deleted the streaming-conf branch January 8, 2016 01:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-12507][Streaming][Document]Expose closeFileAfterWrite and allowBatching configurations for Streaming #10453

[SPARK-12507][Streaming][Document]Expose closeFileAfterWrite and allowBatching configurations for Streaming #10453

zsxwing commented Dec 23, 2015

SparkQA commented Dec 23, 2015

srowen commented Dec 29, 2015

BenFradet Dec 29, 2015

BenFradet commented Dec 29, 2015

zsxwing Dec 29, 2015

zsxwing commented Dec 29, 2015

SparkQA commented Dec 30, 2015

brkyvz commented Dec 30, 2015

brkyvz commented Dec 30, 2015

zsxwing commented Dec 30, 2015

SparkQA commented Dec 30, 2015

brkyvz commented Dec 31, 2015

tdas Jan 7, 2016

SparkQA commented Jan 7, 2016

tdas Jan 7, 2016

tdas commented Jan 7, 2016

SparkQA commented Jan 7, 2016

tdas commented Jan 8, 2016

[SPARK-12507][Streaming][Document]Expose closeFileAfterWrite and allowBatching configurations for Streaming #10453

[SPARK-12507][Streaming][Document]Expose closeFileAfterWrite and allowBatching configurations for Streaming #10453

Conversation

zsxwing commented Dec 23, 2015

SparkQA commented Dec 23, 2015

srowen commented Dec 29, 2015

BenFradet Dec 29, 2015

Choose a reason for hiding this comment

BenFradet commented Dec 29, 2015

zsxwing Dec 29, 2015

Choose a reason for hiding this comment

zsxwing commented Dec 29, 2015

SparkQA commented Dec 30, 2015

brkyvz commented Dec 30, 2015

brkyvz commented Dec 30, 2015

zsxwing commented Dec 30, 2015

SparkQA commented Dec 30, 2015

brkyvz commented Dec 31, 2015

tdas Jan 7, 2016

Choose a reason for hiding this comment

SparkQA commented Jan 7, 2016

tdas Jan 7, 2016

Choose a reason for hiding this comment

tdas commented Jan 7, 2016

SparkQA commented Jan 7, 2016

tdas commented Jan 8, 2016