Adds new blog post announcing opensearch hadoop #1650

harshavamsi · 2023-06-02T22:18:54Z

Description

Adds a new blog post announcing the availability of the hadoop client

Issues Resolved

[List any issues this PR will resolve]

Check List

Commits are signed per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the BSD-3-Clause License.

Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

vagimeli

See editorial review comments and changes. Please reach out with any questions.

_posts/2023-06-05-opensearch-hadoop-launch.markdown

Co-authored-by: Melissa Vagi <[email protected]> Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

nknize

This is a good start. I like having the compatibility matrices. Might be good, though, to also add a simple "Getting Started Examples"?

Maybe an example on how to write to a dataframe in scala?

e.g.,

val spark = SparkSession.builder().master("local[*]")
    .config("opensearch.nodes", "127.0.0.1").config("opensearch.net.http.auth.user", "admin").config("opensearch.net.http.auth.pass", "admin").config("opensearch.net.ssl", "true")
    .config("opensearch.batch.size.bytes", "1kb").config("opensearch.net.ssl.cert.allow.self.signed", "true")
    .getOrCreate()

or how to use it with pyspark like I demonstrate in my comment on #153.

I'm happy to add if you'd like.

_posts/2023-06-05-opensearch-hadoop-launch.markdown

hdhalter · 2023-06-06T19:46:47Z

_posts/2023-06-05-opensearch-hadoop-launch.markdown

+
+We are excited to announce the release of the new OpenSearch-Hadoop connector. This tool enables efficient interaction between your Hadoop-based Big Data operations and OpenSearch clusters, supporting all versions of OpenSearch.
+
+## OpenSearch Hadoop connector features:


Can we add an opening paragraph here? For example, "The OpenSearch-Hadoop connector includes the following features:" (and remove the colon from the heading)

harshavamsi · 2023-06-06T20:14:58Z

This would be awesome to have!

Co-authored-by: Melissa Vagi <[email protected]> Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

Co-authored-by: Heather Halter <[email protected]> Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

_posts/2023-06-05-opensearch-hadoop-launch.markdown

Co-authored-by: Melissa Vagi <[email protected]> Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

wbeckler · 2023-06-07T20:39:44Z

This is a good start. I like having the compatibility matrices. Might be good, though, to also add a simple "Getting Started Examples"?

Maybe an example on how to write to a dataframe in scala?

e.g.,
val spark = SparkSession.builder().master("local[*]")
    .config("opensearch.nodes", "127.0.0.1").config("opensearch.net.http.auth.user", "admin").config("opensearch.net.http.auth.pass", "admin").config("opensearch.net.ssl", "true")
    .config("opensearch.batch.size.bytes", "1kb").config("opensearch.net.ssl.cert.allow.self.signed", "true")
    .getOrCreate()
or how to use it with pyspark like I demonstrate in my comment on #153.

I'm happy to add if you'd like.

Go for it!

hdhalter · 2023-06-14T20:05:50Z

Is anyone making any updates to this (@nknize )? We are targeting next week to publish it. Thanks!

nknize · 2023-06-15T01:41:26Z

Is anyone making any updates to this (@nknize )? We are targeting next week to publish it. Thanks!

Yes. I'll put the example in tomorrow.

wbeckler · 2023-06-26T15:51:15Z

Is anyone making any updates to this (@nknize )? We are targeting next week to publish it. Thanks!

Yes. I'll put the example in tomorrow.

Hi Nick, this is still awaiting your input. Thank you!!

pajuric · 2023-06-27T18:17:24Z

_posts/2023-06-05-opensearch-hadoop-launch.markdown

+categories:
+  - releases
+meta_keywords: opensearch hadoop, apache spark, apache hive, apache hadoop, openseearch, mapreduce, hdfs
+meta_description: OpenSearch Hadoop is now generally available with support for multiple versions of OpenSearch to run on Spark and Hive.


Please update the meta with the following:

Meta_keywords: OpenSearch Hadoop, Apache Hadoop, OpenSearch Hadoop client
Meta_description: The OpenSearch Hadoop connector is now generally available with support for multiple versions of OpenSearch running on Spark and Hive.

pajuric · 2023-06-29T02:39:27Z

@nknize @mnkugler @wbeckler - If you can make the final edits, update he blog date, and let @krisfreedain know when it's ready to go, we can get this posted to the blog tomorrow. Otherwise, we'll need to hold this until next Wednesday.

nknize · 2023-06-29T02:51:44Z

Otherwise, we'll need to hold this until next Wednesday.

Let's hold to Wednesday. I was working up the example with the published artifacts and noticed they don't support Spark 3. We may want to republish the Spark 3 artifacts before publishing the blog.

pajuric · 2023-07-07T15:27:14Z

@mnkugler and @wbeckler - Are we good to publish this today?

wbeckler · 2023-07-07T15:32:48Z

Still waiting on @nknize's changes.

nknize · 2023-07-07T18:45:04Z

@pajuric The blocker right now is that the released OpenSearched-Hadoop artifacts are not compatible with Spark 3. Thus the compatibility matrix in this blog post is not correct and the example code I'm providing will not work for the users / readers running Spark 3:

e.g.,

[error] Modules were resolved with conflicting cross-version suffixes in ProjectRef(uri("file:/...
[error]    org.apache.spark:spark-core _2.13, _2.11

From example build.sbt

ThisBuild / scalaVersion := "2.13.0"

lazy val root = (project in file("."))
  .settings(
    name := "opensearch-spark-example"
  )

libraryDependencies ++= Seq(
  "org.apache.spark" %% "spark-core" % "3.2.4" exclude("javax", "servlet") exclude("org.apache", "hadoop"),
  "org.opensearch.client" % "opensearch-hadoop" % "1.0.1",
  "org.antlr" % "antlr4-runtime" % "4.8",
  "org.codehaus.janino" % "commons-compiler" % "3.0.8",
  "org.codehaus.janino" % "janino" % "3.0.8"
)

We need to publish the Spark 3 compatible version which is built and packaged with the artifacts from the spark/sql-30 module

nknize · 2023-07-07T18:56:24Z

I opened an issue to move this forward: opensearch-project/opensearch-hadoop#304

pajuric · 2023-08-21T16:05:24Z

@vagimeli @nknize - Just checking the status on this blog to see if there are any updates?

vagimeli · 2023-08-21T16:59:25Z

@vagimeli @nknize - Just checking the status on this blog to see if there are any updates?

@pajuric I've not heard from the authors in a while. I'm adding them to this comment, as they need to provide the update.

@nknize @harshavamsi Please update on the status of this blog. Is the text final and ready for an editorial review?

pajuric · 2023-11-02T15:57:33Z

@wbeckler @Xtansia - Please provide an update on the blog, as I understand it has been transferred over to you both.

pajuric · 2024-07-25T15:46:11Z

@wbeckler - Are we OK to close this blog?

Adds new blog post announcing opensearch hadoop

66f5e9f

Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

harshavamsi requested review from elfisher, AMoo-Miki, nknize, krisfreedain, peterzhuamazon, CEHENKLE, dtaivpp, kolchfa-aws and nateynateynate as code owners June 2, 2023 22:18

Fix vale grammar errors

b28eb91

Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

harshavamsi force-pushed the opensearch_hadoop_blog branch from 4a471a9 to b28eb91 Compare June 6, 2023 16:01

vagimeli reviewed Jun 6, 2023

View reviewed changes

harshavamsi and others added 8 commits June 6, 2023 10:55

Update _posts/2023-06-05-opensearch-hadoop-launch.markdown

bb8a3f7

Co-authored-by: Melissa Vagi <[email protected]> Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

Update _posts/2023-06-05-opensearch-hadoop-launch.markdown

8d90bbc

Co-authored-by: Melissa Vagi <[email protected]> Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

Update _posts/2023-06-05-opensearch-hadoop-launch.markdown

615e2d3

Co-authored-by: Melissa Vagi <[email protected]> Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

Update _posts/2023-06-05-opensearch-hadoop-launch.markdown

b738d94

Co-authored-by: Melissa Vagi <[email protected]> Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

Update _posts/2023-06-05-opensearch-hadoop-launch.markdown

b9d86ea

Co-authored-by: Melissa Vagi <[email protected]> Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

Update _posts/2023-06-05-opensearch-hadoop-launch.markdown

14d65f7

Co-authored-by: Melissa Vagi <[email protected]> Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

Update _posts/2023-06-05-opensearch-hadoop-launch.markdown

ecbce98

Co-authored-by: Melissa Vagi <[email protected]> Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

Update _posts/2023-06-05-opensearch-hadoop-launch.markdown

71dd439

Co-authored-by: Melissa Vagi <[email protected]> Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

nknize requested changes Jun 6, 2023

View reviewed changes

hdhalter reviewed Jun 6, 2023

View reviewed changes

_posts/2023-06-05-opensearch-hadoop-launch.markdown Outdated Show resolved Hide resolved

vagimeli reviewed Jun 6, 2023

View reviewed changes

_posts/2023-06-05-opensearch-hadoop-launch.markdown Outdated Show resolved Hide resolved

hdhalter reviewed Jun 6, 2023

View reviewed changes

harshavamsi and others added 2 commits June 6, 2023 13:17

Update _posts/2023-06-05-opensearch-hadoop-launch.markdown

15cf7b5

Co-authored-by: Melissa Vagi <[email protected]> Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

Update _posts/2023-06-05-opensearch-hadoop-launch.markdown

8d2f092

Co-authored-by: Heather Halter <[email protected]> Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

vagimeli reviewed Jun 6, 2023

View reviewed changes

_posts/2023-06-05-opensearch-hadoop-launch.markdown Outdated Show resolved Hide resolved

vagimeli reviewed Jun 6, 2023

View reviewed changes

_posts/2023-06-05-opensearch-hadoop-launch.markdown Outdated Show resolved Hide resolved

vagimeli reviewed Jun 6, 2023

View reviewed changes

_posts/2023-06-05-opensearch-hadoop-launch.markdown Outdated Show resolved Hide resolved

vagimeli reviewed Jun 6, 2023

View reviewed changes

_posts/2023-06-05-opensearch-hadoop-launch.markdown Outdated Show resolved Hide resolved

vagimeli reviewed Jun 6, 2023

View reviewed changes

_posts/2023-06-05-opensearch-hadoop-launch.markdown Show resolved Hide resolved

harshavamsi and others added 4 commits June 6, 2023 15:30

Update _posts/2023-06-05-opensearch-hadoop-launch.markdown

8b1b822

Co-authored-by: Melissa Vagi <[email protected]> Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

Update _posts/2023-06-05-opensearch-hadoop-launch.markdown

6c2f17d

Co-authored-by: Melissa Vagi <[email protected]> Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

Update _posts/2023-06-05-opensearch-hadoop-launch.markdown

1d24543

Co-authored-by: Melissa Vagi <[email protected]> Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

Add download links

34d6892

Signed-off-by: Harsha Vamsi Kalluri <[email protected]>

pajuric reviewed Jun 27, 2023

View reviewed changes

krisfreedain added awaiting-response blog-under-review labels Jul 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds new blog post announcing opensearch hadoop #1650

Adds new blog post announcing opensearch hadoop #1650

harshavamsi commented Jun 2, 2023

vagimeli left a comment

nknize left a comment

hdhalter Jun 6, 2023

harshavamsi commented Jun 6, 2023

wbeckler commented Jun 7, 2023

hdhalter commented Jun 14, 2023

nknize commented Jun 15, 2023

wbeckler commented Jun 26, 2023

pajuric Jun 27, 2023

pajuric commented Jun 29, 2023

nknize commented Jun 29, 2023

pajuric commented Jul 7, 2023

wbeckler commented Jul 7, 2023

nknize commented Jul 7, 2023 •

edited

Loading

nknize commented Jul 7, 2023

pajuric commented Aug 21, 2023

vagimeli commented Aug 21, 2023

pajuric commented Nov 2, 2023

pajuric commented Jul 25, 2024


		We are excited to announce the release of the new OpenSearch-Hadoop connector. This tool enables efficient interaction between your Hadoop-based Big Data operations and OpenSearch clusters, supporting all versions of OpenSearch.

		## OpenSearch Hadoop connector features:

Adds new blog post announcing opensearch hadoop #1650

Are you sure you want to change the base?

Adds new blog post announcing opensearch hadoop #1650

Conversation

harshavamsi commented Jun 2, 2023

Description

Issues Resolved

Check List

vagimeli left a comment

Choose a reason for hiding this comment

nknize left a comment

Choose a reason for hiding this comment

hdhalter Jun 6, 2023

Choose a reason for hiding this comment

harshavamsi commented Jun 6, 2023

wbeckler commented Jun 7, 2023

hdhalter commented Jun 14, 2023

nknize commented Jun 15, 2023

wbeckler commented Jun 26, 2023

pajuric Jun 27, 2023

Choose a reason for hiding this comment

pajuric commented Jun 29, 2023

nknize commented Jun 29, 2023

pajuric commented Jul 7, 2023

wbeckler commented Jul 7, 2023

nknize commented Jul 7, 2023 • edited Loading

nknize commented Jul 7, 2023

pajuric commented Aug 21, 2023

vagimeli commented Aug 21, 2023

pajuric commented Nov 2, 2023

pajuric commented Jul 25, 2024

nknize commented Jul 7, 2023 •

edited

Loading