SPARK-19794 Release HDFS Client after read/write checkpoint #17135

darionyaphet · 2017-03-02T09:51:26Z

What changes were proposed in this pull request?

Close HDFS client and streams after reading and writing from HDFS .

How was this patch tested?

(Please explain how this patch was tested. E.g. unit tests, integration tests, manual tests)
(If this patch involves UI changes, please attach a screenshot; otherwise, remove this)

Please review http://spark.apache.org/contributing.html before opening a pull request.

srowen · 2017-03-02T11:54:33Z

I get the idea, but I'm not sure any of these are valid

srowen · 2017-03-02T11:53:31Z

core/src/main/scala/org/apache/spark/rdd/ReliableCheckpointRDD.scala

-        logInfo(s"Final output path $finalOutputPath already exists; not overwriting it")
-        if (!fs.delete(tempOutputPath, false)) {
-          logWarning(s"Error deleting ${tempOutputPath}")
+    try {


Given that this doesn't encompass the span of usage for fs -- better to just call fs.close() at the end and not worry about manually closing in an error case? or expand the try-finally?

Actually, I am not sure we are supposed to call FileSystem.close() because they are shared instances, cached and reused across the whole application.

Agreed with @srowen , FileSystem is a cached object, closing it means removed it from cache. I don't think we need to call this explicitly. Because by default it is designed to be shared.

srowen · 2017-03-02T11:54:09Z

core/src/main/scala/org/apache/spark/rdd/ReliableCheckpointRDD.scala

@@ -216,6 +221,8 @@ private[spark] object ReliableCheckpointRDD extends Logging {
        serializeStream.writeObject(partitioner)
      } {
        serializeStream.close()
+        fileOutputStream.close()


Ditto, this is OK if serializeStream.close() doesn't actually close the underlying stream (?) but not sure about the next line.

srowen · 2017-03-02T11:54:23Z

core/src/main/scala/org/apache/spark/rdd/ReliableCheckpointRDD.scala

@@ -279,8 +287,13 @@ private[spark] object ReliableCheckpointRDD extends Logging {

    // Register an on-task-completion callback to close the input stream.
    context.addTaskCompletionListener(context => deserializeStream.close())
-
-    deserializeStream.asIterator.asInstanceOf[Iterator[T]]
+    Utils.tryWithSafeFinally {


I don't think you can close it here, right? you're returning an iterator on the stream

This code will introduce issue, deserializaStream should be called after finished, the code here will close this stream prematurely.

Also please look at L289, it already takes care of close after the task is finished.

SparkQA · 2017-03-02T13:52:54Z

Test build #3592 has finished for PR 17135 at commit 60754bd.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

zsxwing · 2017-03-03T07:25:04Z

I remember FileSystem will be cached internally by default. Closing it probably will introduce some performance regression. Did you see any case that FileSystem cache doesn't work properly?

srowen · 2017-03-03T10:47:28Z

Yes this is substantially not something we can merge, so let's close this.

vanzin · 2017-03-03T22:20:53Z

It's not just a matter of performance regression - it will brake any other code that has references to the file system being closed. -1.

Closes apache#16819 Closes apache#13467 Closes apache#16083 Closes apache#17135 Closes apache#8785 Closes apache#16278 Closes apache#16997 Closes apache#17073 Closes apache#17220

SPARK-19794 Release HDFS Client after read/write checkpoint

60754bd

srowen requested changes Mar 2, 2017

View reviewed changes

srowen added a commit to srowen/spark that referenced this pull request Mar 22, 2017

Close stale PRs.

d88bc61

Closes apache#16819 Closes apache#13467 Closes apache#16083 Closes apache#17135 Closes apache#8785 Closes apache#16278 Closes apache#16997 Closes apache#17073 Closes apache#17220

srowen mentioned this pull request Mar 22, 2017

[INFRA] Close stale PRs #17386

Closed

asfgit closed this in b70c03a Mar 23, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SPARK-19794 Release HDFS Client after read/write checkpoint #17135

SPARK-19794 Release HDFS Client after read/write checkpoint #17135

darionyaphet commented Mar 2, 2017

srowen commented Mar 2, 2017

srowen Mar 2, 2017

jerryshao Mar 3, 2017

srowen Mar 2, 2017

srowen Mar 2, 2017

jerryshao Mar 3, 2017

SparkQA commented Mar 2, 2017

zsxwing commented Mar 3, 2017 •

edited

Loading

srowen commented Mar 3, 2017

vanzin commented Mar 3, 2017

SPARK-19794 Release HDFS Client after read/write checkpoint #17135

SPARK-19794 Release HDFS Client after read/write checkpoint #17135

Conversation

darionyaphet commented Mar 2, 2017

What changes were proposed in this pull request?

How was this patch tested?

srowen commented Mar 2, 2017

srowen Mar 2, 2017

Choose a reason for hiding this comment

jerryshao Mar 3, 2017

Choose a reason for hiding this comment

srowen Mar 2, 2017

Choose a reason for hiding this comment

srowen Mar 2, 2017

Choose a reason for hiding this comment

jerryshao Mar 3, 2017

Choose a reason for hiding this comment

SparkQA commented Mar 2, 2017

zsxwing commented Mar 3, 2017 • edited Loading

srowen commented Mar 3, 2017

vanzin commented Mar 3, 2017

zsxwing commented Mar 3, 2017 •

edited

Loading