-
Notifications
You must be signed in to change notification settings - Fork 998
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Batch Ingestion Job rewritten on Spark #1020
Conversation
ingestion-spark/src/main/scala/feast/ingestion/FeatureTable.scala
Outdated
Show resolved
Hide resolved
implicit val modesRead: scopt.Read[Modes.Value] = scopt.Read.reads(Modes withName _.capitalize) | ||
|
||
val parser = new scopt.OptionParser[IngestionJobConfig]("IngestionJon") { | ||
head("feast.ingestion.IngestionJob", "0.8") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can the version come from build-info?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not solved yet. Added ToDo
ingestion-spark/src/main/scala/feast/ingestion/OfflinePipeline.scala
Outdated
Show resolved
Hide resolved
spark/ingestion/src/test/scala/feast/ingestion/BatchPipelineIT.scala
Outdated
Show resolved
Hide resolved
58adb85
to
f92c4f0
Compare
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
Signed-off-by: Oleksii Moskalenko <[email protected]>
2a6ec88
to
f8772d1
Compare
Signed-off-by: Oleksii Moskalenko <[email protected]>
/lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: pyalex, woop The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
As part of simplification of ingestion flow this PR proposes to create separate job per each FeatureSet per Batch / Streaming mode.
Spark implementation of Batch Ingestion has next flow
What's not implemented (in comparison to Beam job):
New spark jobs utilize spark's standard metrics reporting. However, I had to fork StatsD Reporter to support metrics with tags (which is extensions of standard protocol (see https://github.com/prometheus/statsd_exporter#tagging-extensions )).
What this PR does / why we need it:
Which issue(s) this PR fixes:
Fixes https://github.com/feast-dev/feast/projects/9#card-45746519
Does this PR introduce a user-facing change?: