-
Notifications
You must be signed in to change notification settings - Fork 4.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Introduce Default Replication Worker Performance Test Harness #20956
Conversation
workerThread.start(); | ||
workerThread.join(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this test in the same process, but using different threads? If that's the case, then this also bypasses the stdio streams as well, which also present their own bottlenecks (cc @colesnodgrass)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Agree. Let me see if I can quickly work something up.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It does show that the core serialising isn't the bottleneck like we thought.
airbyte-commons-worker/src/test/java/io/airbyte/workers/general/StubAirbyteMapper.java
Show resolved
Hide resolved
airbyte-commons-worker/src/test/java/io/airbyte/workers/general/LimitedAirbyteSource.java
Outdated
Show resolved
Hide resolved
…tructured-concurrency
@Benchmark | ||
// SampleTime = the time taken to run the benchmarked method. Use this because we only care about | ||
// the time taken to sync the entire dataset. | ||
@BenchmarkMode(Mode.SampleTime) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@colesnodgrass unfortunately because these annotations don't have the target annotation type, we cannot fold them into a meta annotation. I think this is fine for now.
fyi @cgardens |
Sample logging output:
|
PTAL @colesnodgrass @jdpgrailsdev @evantahler @pmossman Cole/Jonathan, one approval for either of you two is enough. Mainly tagging for awareness. Parker, same thing here. Want to make sure both platform teams are aware this is available. Evan - you are already on the PR. Thought to tag you since you are also interested in this work! |
What
Introduce a performance test harness for the default replication worker to make it easy for devs to test effect of changes on platform throughput.
The current set up is designed to be run manually. In the future, we can look into integrating this report into our build pipelines. For now, this is good enough as I wanted to start somewhere.
How
The general idea is to use JMH to run the test n number of times (currently 4 times). The dev can then look at logs to see throughput and how it varies.
As of this PR, we see general platform throughput of ~ 20 - 25 MB/s.
Recommended reading order
ReplicationWorkerPerformanceTest.java
🚨 User Impact 🚨
Are there any breaking changes? What is the end result perceived by the user? If yes, please merge this PR with the 🚨🚨 emoji so changelog authors can further highlight this if needed.
Pre-merge Checklist
Expand the relevant checklist and delete the others.
New Connector
Community member or Airbyter
airbyte_secret
./gradlew :airbyte-integrations:connectors:<name>:integrationTest
.README.md
bootstrap.md
. See description and examplesdocs/integrations/<source or destination>/<name>.md
including changelog. See changelog exampledocs/integrations/README.md
airbyte-integrations/builds.md
Airbyter
If this is a community PR, the Airbyte engineer reviewing this PR is responsible for the below items.
/test connector=connectors/<name>
command is passing/publish
command described hereUpdating a connector
Community member or Airbyter
airbyte_secret
./gradlew :airbyte-integrations:connectors:<name>:integrationTest
.README.md
bootstrap.md
. See description and examplesdocs/integrations/<source or destination>/<name>.md
including changelog. See changelog exampleAirbyter
If this is a community PR, the Airbyte engineer reviewing this PR is responsible for the below items.
/test connector=connectors/<name>
command is passing/publish
command described hereConnector Generator
-scaffold
in their name) have been updated with the latest scaffold by running./gradlew :airbyte-integrations:connector-templates:generator:testScaffoldTemplates
then checking in your changesTests
Unit
Put your unit tests output here.
Integration
Put your integration tests output here.
Acceptance
Put your acceptance tests output here.