Field '__messageProperties' not found; typically this occurs with arrays which are not mapped as single value. #292

akshayjain3450 · 2023-06-15T10:00:24Z

What is the bug?

Field '__messageProperties' not found; typically this occurs with arrays which are not mapped as single value. Though this field actually exists in the reader data.

How can one reproduce the bug?

This is my sample Scala program using Spark:

def read(): Unit = {
var df = spark.read.format("opensearch")
.option("opensearch.nodes", "host")
.option("opensearch.port", "9200")
.option("opensearch.nodes.wan.only", "true")
.option("opensearch.resource", "index")
.option("opensearch.net.http.auth.user", "admin")
.option("opensearch.net.http.auth.pass", "admin")
.load()
df.printSchema()
df.show(10)
}

What is your host/environment?

Spark: 3.3.1, Opensearch-Hadoop 1.1.0

Do you have any additional context?

The Spark Schema:

Also, if you notice I have two struct fields, one gets mapped properly, and the other throws this issue. I am looking for a solution to this. Do let me know if you need any more details on this.

wbeckler · 2023-06-15T18:20:22Z

Would you be up for trying to add a breaking test to the client?

akshayjain3450 · 2023-06-16T13:33:32Z

Sure, would love to contribute to that. Would require some direction and guidance that where can I do this.

akshayjain3450 · 2023-06-23T12:17:52Z

Hi, @wbeckler any update on this?

harshavamsi · 2023-06-23T16:53:34Z

Hi @akshayjain3450, you would need to set the option opensearch.read.field.as.array.include. This tells OS hadoop how to map arrays like data. Can you set .option("opensearch.read.field.as.array.include", "__messageProperties") and see what happens?

akshayjain3450 · 2023-06-26T07:52:43Z

Hi @harshavamsi, I did try this option and still face the same issue.

Complete Stack Trace:
Exception in thread "main" org.apache.spark.SparkException: Job aborted due to stage failure: Task 2 in stage 1.0 failed 1 times, most recent failure: Lost task 2.0 in stage 1.0 (TID 3) (192.168.68.68 executor driver): org.opensearch.hadoop.rest.OpenSearchHadoopParsingException: org.opensearch.hadoop.OpenSearchHadoopIllegalStateException: Field '__messageProperties' not found; typically this occurs with arrays which are not mapped as single value
at org.opensearch.hadoop.serialization.ScrollReader.readHit(ScrollReader.java:528)
at org.opensearch.hadoop.serialization.ScrollReader.read(ScrollReader.java:306)
at org.opensearch.hadoop.serialization.ScrollReader.read(ScrollReader.java:270)
at org.opensearch.hadoop.rest.RestRepository.scroll(RestRepository.java:326)
at org.opensearch.hadoop.rest.ScrollQuery.hasNext(ScrollQuery.java:104)
at org.opensearch.spark.rdd.AbstractOpenSearchRDDIterator.hasNext(AbstractOpenSearchRDDIterator.scala:75)
at scala.collection.Iterator$$anon$10.hasNext(Iterator.scala:460)
at org.apache.spark.sql.catalyst.expressions.GeneratedClass$GeneratedIteratorForCodegenStage1.processNext(Unknown Source)
at org.apache.spark.sql.execution.BufferedRowIterator.hasNext(BufferedRowIterator.java:43)
at org.apache.spark.sql.execution.WholeStageCodegenExec$$anon$1.hasNext(WholeStageCodegenExec.scala:760)
at org.apache.spark.sql.execution.SparkPlan.$anonfun$getByteArrayRdd$1(SparkPlan.scala:364)
at org.apache.spark.rdd.RDD.$anonfun$mapPartitionsInternal$2(RDD.scala:890)
at org.apache.spark.rdd.RDD.$anonfun$mapPartitionsInternal$2$adapted(RDD.scala:890)
at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:52)
at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:365)
at org.apache.spark.rdd.RDD.iterator(RDD.scala:329)
at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:90)
at org.apache.spark.scheduler.Task.run(Task.scala:136)
at org.apache.spark.executor.Executor$TaskRunner.$anonfun$run$3(Executor.scala:548)
at org.apache.spark.util.Utils$.tryWithSafeFinally(Utils.scala:1504)
at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:551)
at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
at java.base/java.lang.Thread.run(Thread.java:829)
Caused by: org.opensearch.hadoop.OpenSearchHadoopIllegalStateException: Field '__messageProperties' not found; typically this occurs with arrays which are not mapped as single value
at org.opensearch.spark.sql.RowValueReader.rowColumns(RowValueReader.scala:60)
at org.opensearch.spark.sql.RowValueReader.rowColumns$(RowValueReader.scala:57)
at org.opensearch.spark.sql.ScalaRowValueReader.rowColumns(ScalaOpenSearchRowValueReader.scala:41)
at org.opensearch.spark.sql.ScalaRowValueReader.createMap(ScalaOpenSearchRowValueReader.scala:78)
at org.opensearch.hadoop.serialization.ScrollReader.map(ScrollReader.java:1030)
at org.opensearch.hadoop.serialization.ScrollReader.read(ScrollReader.java:901)
at org.opensearch.hadoop.serialization.ScrollReader.map(ScrollReader.java:1066)
at org.opensearch.hadoop.serialization.ScrollReader.read(ScrollReader.java:903)
at org.opensearch.hadoop.serialization.ScrollReader.readHitAsMap(ScrollReader.java:616)
at org.opensearch.hadoop.serialization.ScrollReader.readHit(ScrollReader.java:440)
... 23 more

akshayjain3450 added bug Something isn't working untriaged labels Jun 15, 2023

akshayjain3450 changed the title ~~[BUG]: Field '__messageProperties' not found; typically this occurs with arrays which are not mapped as single value~~ Field '__messageProperties' not found; typically this occurs with arrays which are not mapped as single value. Jun 15, 2023

wbeckler removed the untriaged label Jun 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Field '__messageProperties' not found; typically this occurs with arrays which are not mapped as single value. #292

Field '__messageProperties' not found; typically this occurs with arrays which are not mapped as single value. #292

akshayjain3450 commented Jun 15, 2023 •

edited

Loading

wbeckler commented Jun 15, 2023

akshayjain3450 commented Jun 16, 2023

akshayjain3450 commented Jun 23, 2023

harshavamsi commented Jun 23, 2023

akshayjain3450 commented Jun 26, 2023

Field '__messageProperties' not found; typically this occurs with arrays which are not mapped as single value. #292

Field '__messageProperties' not found; typically this occurs with arrays which are not mapped as single value. #292

Comments

akshayjain3450 commented Jun 15, 2023 • edited Loading

What is the bug?

How can one reproduce the bug?

What is your host/environment?

Do you have any additional context?

wbeckler commented Jun 15, 2023

akshayjain3450 commented Jun 16, 2023

akshayjain3450 commented Jun 23, 2023

harshavamsi commented Jun 23, 2023

akshayjain3450 commented Jun 26, 2023

akshayjain3450 commented Jun 15, 2023 •

edited

Loading