release 0.188-tw-0.43 #123

Yaliang · 2017-11-08T20:47:19Z

No description provided.

processBatch() is responsible for telling the caller if it has produced a page, yields to time, or needs to shrink the batch size. Currently there is a local variable and a global variable to pass the information back to the caller, which is not elegant. Unify the interface to return a class to informm the caller the state returned by processBatch().

Add a map in SliceBigArray to track the underlying data of the slice it sets. The map aims to avoid under or over counting the memory usage. In production, we found the original approximation can be off by 2X.

`fieldMappings` is a `ImmutableList`, it cannot contain nulls.

We have observed repeated compilation of MethodHandle that leads to full GCs. We notice that flushing the specialized function cache mitigate the problem. We suspect that it is a JVM bug that is related to stale/corrupted profiling data associated with generated classes and/or dynamically-created MethodHandles. This might also mitigate problems like deoptimization storm or unintended interpreted execution.

Reverse the sign of offset when TZ is defined in Etc/GMT=/-<offset> format

When iterating the drivers, a snapshot will be taken to guarantee a view consistency. To iterate directly on drivers should be safe.

When `SpillAwareLookupSourceProvider.close` runs concurrently with `PartitionedLookupSourceFactory.closeCachedLookupSources`, `LookupSource.close` can be invoked twice concurrenctly on single `LookupSource` instance. This is a bug, since `LookupSource` doesn't need to be thread-safe.

With the current db-backed resource manager to support multiple clusters we need to have multiple databases (one database per cluster). With this change the schema of the db-backed resource manager will support multiple clusters with the new environment column. The db-backed resource manager will load the right configuration based on the configured environment.

Flattened plans contain textual representations of every fragment, the plan tree, and a flattened list of nodes. The flattened list is constructed using a visitor with a custom serializer which produces only ids and types for children. This format was selected to simplify post-hoc plan analysis.

Since `PartitionedLookupSource` implements `Closeable`, the `close()` method should be idempotent.

Since `PartitionedLookupSource` encapsulates an array of `LookupSource` instances, the `PartitionedLookupSource.close` should close all the partitions too. Currently the `LookupSource` implementations that have no-op `close` are `OuterLookupSource` (used for RIGHT OUTER JOIN when there would be only a single partition) and `IndexLookupSource` (used instead of `PartitionedLookupSource` when JOIN is optimized as index-join), so this commit effectively does not change anything, but makes code more future-proof.

We found in production users may flush a single cell with a size of 1GB. This can cause Hive writers to a huge amount of memory. Add stats to monitor the distribution of pages flushed.

We have observed deoptimization storm that leads to slowness. We suspect that it is a JVM bug that is related to stale/corrupted profiling data associated with generated classes.

This particular test takes 2 seconds on an idle mac. The original 4 second is too close, especially on a shared server.

The transaction was marked as cleared on the statement immediately following the START TRANSACTION statement.

This change fixes a corner case where we can pass -1 as offset to Block::getRegion(). This happens when fromIndex = -N, length = N, and array size is (N - 1).

When block.copyRegion() is used to copy the whole block and the block is already in a compact representation, the data in original block can be used and no memory allocation is needed.

If a slice is already in compact representation, the original slice can be used instead of copying it.

The following heuristic is used in PagesIndex.compact() and Page.compact() to decide whether a block should be compacted to save memory: if (block.getSizeInBytes() < block.getRetainedSizeInBytes()) { ...... } The purpose is to avoid unnecessary compact for a block already in compact representation. However, retained size is always larger than the logical size, as retained size includes the block instance size. We have optimized Block.copyRegion() so the original block will be returned when copying a whole compact block. Thus, this heuristic is no longer necessary.

Fixes prestodb#9025

nezihyigitbasi and others added 30 commits October 19, 2017 12:40

[maven-release-plugin] prepare release 0.187

4e7977f

[maven-release-plugin] prepare for next development iteration

c8ca80a

Update ReferenceCountMap method names to be more accurate

ef54384

Add reference counting to SliceBigArray

c6f1833

Add a map in SliceBigArray to track the underlying data of the slice it sets. The map aims to avoid under or over counting the memory usage. In production, we found the original approximation can be off by 2X.

Fix code style in ExpressionAnalyzer

7c61d92

Fix map access with wrong key generic type

d652085

Remove unused method

9355c91

Fail when no symbol for field is found

cf0d418

Remove redundant null check

b08d438

`fieldMappings` is a `ImmutableList`, it cannot contain nulls.

Improve performance of Analysis.getType(Expression)

26b8167

Fix formatting

168703a

Add warning message to 0.186 release notes

8bcd7cd

Minor fixes for Bing tiles documentation

cd858e0

Fix SHOW FUNCTIONS documentation for stddev

4640232

Fix typo in method name

6d2dab7

Fix inverted sign for time zones Etc/GMT(+/-)H[H]

1101b47

Reverse the sign of offset when TZ is defined in Etc/GMT=/-<offset> format

Add current memory to QueryProgressStats

b580545

Remove copy of drivers when getting pipeline status

669c041

When iterating the drivers, a snapshot will be taken to guarantee a view consistency. To iterate directly on drivers should be safe.

Build immutable map for prepared statements

fe9611d

Fix product test for stddev functions

2560456

Move S3ConfigurationUpdater to S3 package

6ed99fa

Add support for using EMRFS with Hive connector

609882f

Fix merging for JoinOperatorInfo

13a595a

Fix typo in exception message

45f5fe8

Fix unused import and update TestJoinOperator

bcee065

findepi and others added 29 commits October 29, 2017 23:03

Make PartitionedLookupSource.close idempotent

043ae8b

Since `PartitionedLookupSource` implements `Closeable`, the `close()` method should be idempotent.

Add page size stats to HivePageSink

130a525

We found in production users may flush a single cell with a size of 1GB. This can cause Hive writers to a huge amount of memory. Add stats to monitor the distribution of pages flushed.

Expire projection and filter cache entries after one hour

4920ba4

We have observed deoptimization storm that leads to slowness. We suspect that it is a JVM bug that is related to stale/corrupted profiling data associated with generated classes.

Create runner in test method and run as single threaded

1127ad0

Close runner properly in TestTpchDistributedStats

c2a4c81

Update to airlift 0.155

b98b483

Increase test timeout in TestPrestoDriver jdbc test

aaf7807

This particular test takes 2 seconds on an idle mac. The original 4 second is too close, especially on a shared server.

Run CLI table name completion in separate transaction

f85ff2f

Fix transaction support in client and CLI

0c8cfb6

The transaction was marked as cleared on the statement immediately following the START TRANSACTION statement.

Support client tags in resource group selector

dc872ba

Fix bound check in ArraySliceFunction

c16af0e

This change fixes a corner case where we can pass -1 as offset to Block::getRegion(). This happens when fromIndex = -N, length = N, and array size is (N - 1).

Upgrade to slice 0.32

9eb8617

Minor style and variable name fix for AbstractMapBlock

2c7f833

Avoid memory allocation when copying a whole compact block

f91d56d

When block.copyRegion() is used to copy the whole block and the block is already in a compact representation, the data in original block can be used and no memory allocation is needed.

Fix VariableWidthBlockBuilder.copyRegion to return a compact block

97897c0

Improve SliceArrayBlock.copyRegion perf by skipping compacted slice

b9bceeb

If a slice is already in compact representation, the original slice can be used instead of copying it.

Move ExpressionAnalyzer's Scope to Context

a91f0d1

Use Scope to resolve lambda arguments in ExpressionAnalyzer

6d55dd2

Support lambda captures using dereference expressions

ac827ea

Introduce Analysis.isColumnReference shorthand

e09344a

Fix planning when lambda argument shadows relation column

9721731

Fixes prestodb#9025

Revert 5 commits that fix analysis/planning for lambda arguments

9750c2e

Add release notes for 0.188

e4cd095

[maven-release-plugin] prepare release 0.188

98ee712

Switch the version number to oss base version 0.188

c1ed0b8

Merge 0.188 version

e9305f6

Bump up version to 0.188-tw-0.43

0360705

Yaliang merged commit 0239495 into twitter-forks:twitter-master Nov 8, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

release 0.188-tw-0.43 #123

release 0.188-tw-0.43 #123

Yaliang commented Nov 8, 2017

release 0.188-tw-0.43 #123

release 0.188-tw-0.43 #123

Conversation

Yaliang commented Nov 8, 2017