Upgrade to 2.2.0 #18

JonathonO · 2024-02-08T12:50:53Z

Conflict files:

.github/workflows/IntegrationTest.yml
.github/workflows/PerfTest.yml
pom.xml
pom_confluent.xml
src/main/java/com/snowflake/kafka/connector/Utils.java
src/main/java/com/snowflake/kafka/connector/internal/InternalUtils.java
src/main/java/com/snowflake/kafka/connector/internal/SnowflakeConnectionService.java
src/main/java/com/snowflake/kafka/connector/internal/streaming/SchematizationUtils.java
src/main/java/com/snowflake/kafka/connector/records/RecordService.java

)

…snowflakedb#688)

…edb#695)

…atization (snowflakedb#693) To Handle Reserved Keywords and Special Characters in Schematization, we will convert all names to uppercase and then add double quotes, except for the names that have double quotes already. By doing this, we tell compiler to bypass the reserved keyword or special characters check, and we’re also preserving the old behavior for customers Examples: start -> “START” a@b -> “A@B” c1/C1 -> c1, C1, “C1” would all work “ab” -> “ab”

…nowflakedb#696)

…t/Usage telemetry for Streaming (snowflakedb#694)

…2e tests (snowflakedb#709)

…nowflakedb#713) Co-authored-by: snyk-bot <[email protected]>

…kedb#718)

…db#728)

…t the behavior: (snowflakedb#732)

snowflakedb#736)

…r Schematization (snowflakedb#730) Before this change, every element in the array will be added as a STRING, this change preserves the old data type in the source, for example when the input is [1, 2], the ingested value will be [1, 2] now instead of ["1", "2"] Forked from snowflakedb#727, with additional tests

There're 10 AVRO logical types listed in https://avro.apache.org/docs/1.11.0/spec.html#Logical+Types and we're able to support 4 of them, the mapping is as below: date -> DATE time-mills -> TIME(6) timestamp-mills -> TIMESTAMP_NTZ(6) decimal -> VARCHAR (We can't do NUMBER because we could have precision bigger than 36) We can't do for the rest of 6 types because that's not supported by ConnectSchema, see code for more detail. We need to find another way to support other logical types or any sources (like Debezium).

…be reset (snowflakedb#729)

…old channel (V1) - Push to Main (snowflakedb#751)

…d of one client for multiple connectors configurations (snowflakedb#744)

…hner

…hner (snowflakedb#753) Co-authored-by: Stefan Wehner <[email protected]>

git cherry-pick d4d0e87

…owflakedb#758)

java.lang.NoClassDefFoundError: Could not initialize class net.snowflake.client.jdbc.internal.apache.arrow.memory.RootAllocator occurs when the JDBC driver tries to process the result of the offset migration query. This is a known long-standing issue in the JDBC driver so a workaround is introduced with this fix.

…ause non-exactly once delivery (snowflakedb#775) Fix two exactly once delivery behavior issues with Snowpipe Streaming: - When there're gaps between offsets due to records being put into the DLQ or NULL records being skipped, the current logic doesn't work after a channel reopening event (e.g. schema evolution) since it expects continuous offsets in order to guarantee exactly once delivery, causing ingestion to stop. - When a flush is triggered by size or rowcount threshold, it's possible that only partial rows in the buffer are flushed and with a channel reopening event (e.g. schema evolution), currently the leftover rows in the buffer are still being considered which is causing the offset to be advanced and skip some offsets. The fix in this PR makes sure we won't flush any leftover rows in the buffer, they will all be skipped and discarded. The next batch should start from the expected offset after the offset reset as part of the channel reopening event.

)

Conflicts: .github/workflows/IntegrationTest.yml .github/workflows/PerfTest.yml pom.xml pom_confluent.xml src/main/java/com/snowflake/kafka/connector/Utils.java src/main/java/com/snowflake/kafka/connector/internal/InternalUtils.java src/main/java/com/snowflake/kafka/connector/internal/streaming/SchematizationUtils.java src/main/java/com/snowflake/kafka/connector/records/RecordService.java

JonathonO · 2024-02-08T12:54:33Z

@acristu will have 2.2.0 code changes merged shortly. Can you please review this though and the features we've added?

For reference, here's the conflict file list.

We should have column ordering, debezium type handling and auto schematization changes in some of these classes.

Worth nothing Snowflake introduced special char/reserved keyword handling in this release and a new Utils method for quoting column names. Not sure if it affects your column ordering code?

The pom*.xmls also had some version changes (backwards in some cases) and a change in junit dependency. Not sure if that's an issue?

Conflicts:
.github/workflows/IntegrationTest.yml
.github/workflows/PerfTest.yml
pom.xml
pom_confluent.xml
src/main/java/com/snowflake/kafka/connector/Utils.java
src/main/java/com/snowflake/kafka/connector/internal/InternalUtils.java
src/main/java/com/snowflake/kafka/connector/internal/streaming/SchematizationUtils.java
src/main/java/com/snowflake/kafka/connector/records/RecordService.java

Conflicts: .github/workflows/snyk-issue.yml .github/workflows/snyk-pr.yml src/main/java/com/snowflake/kafka/connector/internal/SnowflakeConnectionService.java src/main/java/com/snowflake/kafka/connector/internal/streaming/SchematizationUtils.java src/main/java/com/snowflake/kafka/connector/records/RecordService.java

acristu · 2024-02-08T14:17:41Z

src/main/java/com/snowflake/kafka/connector/records/RecordService.java

@@ -413,8 +443,9 @@ public static JsonNode convertToJson(Schema schema, Object logicalValue) {
                ISO_DATE_TIME_FORMAT.get().format((java.util.Date) value));
          }
          if (schema != null && Time.LOGICAL_NAME.equals(schema.name())) {
-            return JsonNodeFactory.instance.textNode(
-                TIME_FORMAT.get().format((java.util.Date) value));
+            ThreadLocal<SimpleDateFormat> format =


@JonathonO this looks bad to me ... why instantiate a ThreadLocal as a local variable inside a function ? but I don't think it does any harm, just wasteful ... assume this comes from upstream ...

@acristu yes, this comes from upstream.

Perhaps raise an issue ticket to discuss?

JonathonO · 2024-02-12T09:07:09Z

@acristu any issue with merging this now?

sfc-gh-alhuang and others added 30 commits August 18, 2023 12:59

SNOW-189106 Kafka Connector to Support External OAuth (snowflakedb#671)

13cd421

Release v2.0.1 (snowflakedb#692)

1bb7bd2

[SNOW-870373] Enable JMX metrics for Snowpipe Streaming (snowflakedb#674

ef0c907

)

[SNOW-851840] Enable tombstone record ingestion in Snowpipe Streaming (…

e5c0e0c

…snowflakedb#688)

SNOW-903979 Set default to true for one client optimization (snowflak…

3526454

…edb#695)

[Streaming Telemetry 1] Prepares TopicPartitionChannel for telemetry (s…

a87e8b5

…nowflakedb#696)

Update ingest SDK version (snowflakedb#699)

bc7af5e

[Streaming Telemetry 2][SNOW-899866] Enables reportKafkaPartitionStar…

edb3eaf

…t/Usage telemetry for Streaming (snowflakedb#694)

[NO-SNOW] Prefix with channel name with connector name (snowflakedb#703)

3bf9106

Fix tombstone ingestion with schematization (snowflakedb#700)

f0e2db6

GH actions tests upgrade python from 3.6 to 3.9 (snowflakedb#708)

03bcba2

Bump KC version for new 2.1.0 release (snowflakedb#707)

33eb93e

[SNOW-916052] Use python None instead of empty string for tombstone e…

501ced5

…2e tests (snowflakedb#709)

[Snyk] Security upgrade org.apache.avro:avro from 1.11.1 to 1.11.3 (s…

3021c2b

…nowflakedb#713) Co-authored-by: snyk-bot <[email protected]>

[SNOW-934984] Update Avro version for security vulnerability (snowfla…

5201533

…kedb#718)

PRODSEC-3611 fix GHA parsing and pre-commit-config version (snowflake…

055dfca

…db#728)

PROD-39429 Add parameter for connector name in channel name and rever…

feaa491

…t the behavior: (snowflakedb#732)

SNOW-947864 Release version 2.1.1 (snowflakedb#733)

3f99a07

Revert "PROD-39429 Add parameter for connector name in channel name a… (

706a93d

snowflakedb#736)

NO-SNOW Make schema evolution add columns idempotent (snowflakedb#734)

d805bf3

[SNOW-943288] Do not skip records when we're expecting the offset to …

c9a3b2c

…be reset (snowflakedb#729)

SNOW-913746 upgrade jdbc 3.14.3 (snowflakedb#745)

afcb116

PROD-39429 Implement migrate sys func from new channel(Format V2) to …

6edd211

…old channel (V1) - Push to Main (snowflakedb#751)

[SNOW-954150] Use map of clients with different configurations instea…

869a90b

…d of one client for multiple connectors configurations (snowflakedb#744)

[external contributor] Support config providers in validation by @swe…

388b1fe

…hner (snowflakedb#753) Co-authored-by: Stefan Wehner <[email protected]>

v2.1.2 release change in master (snowflakedb#757)

9d08d2b

git cherry-pick d4d0e87

NO-SNOW: Expose the Ingest SDK MAX_CLIENT_LAG configuration in KC (sn…

7804e37

…owflakedb#758)

sfc-gh-rcheng and others added 9 commits December 7, 2023 15:29

Add authentication for max lag parameter and fix IT (snowflakedb#763)

2e71470

[SNOW-1015644] Update ingest sdk to 2.0.5 (snowflakedb#778)

b58af05

resolves issue with generating and building java libary (snowflakedb#780

1282b12

)

[SNOW-88848] Update readme with Snowflake documentation (snowflakedb#781

81df1bd

)

Update version to 2.2.0 for release (snowflakedb#779)

6f1e339

Downgrade JDBC to 3.13.30 (snowflakedb#783)

85b692c

JonathonO self-assigned this Feb 8, 2024

acristu reviewed Feb 8, 2024

View reviewed changes

acristu added 3 commits February 8, 2024 16:39

fixes for junit

2ba225f

fixes for junit

a0c4de8

bump multi-conveter to 0.0.8

8f25a09

JonathonO mentioned this pull request Feb 8, 2024

Snoflake 2.1.2 upgrade #17

Closed

fix pom_confluent

314c5e6

JonathonO merged commit 92b4b8f into streamkap-main Feb 19, 2024
0 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrade to 2.2.0 #18

Upgrade to 2.2.0 #18

JonathonO commented Feb 8, 2024 •

edited

Loading

JonathonO commented Feb 8, 2024

acristu Feb 8, 2024 •

edited

Loading

JonathonO Feb 8, 2024

JonathonO Feb 8, 2024

JonathonO commented Feb 12, 2024

Upgrade to 2.2.0 #18

Upgrade to 2.2.0 #18

Conversation

JonathonO commented Feb 8, 2024 • edited Loading

JonathonO commented Feb 8, 2024

acristu Feb 8, 2024 • edited Loading

Choose a reason for hiding this comment

JonathonO Feb 8, 2024

Choose a reason for hiding this comment

JonathonO Feb 8, 2024

Choose a reason for hiding this comment

JonathonO commented Feb 12, 2024

JonathonO commented Feb 8, 2024 •

edited

Loading

acristu Feb 8, 2024 •

edited

Loading