Integration testing on master #568

foxish · 2017-11-30T18:00:14Z

Elaborating on #545 (comment)
We need to separate our integration testing out - so we can make it run any arbitrary branch - including master.

cc @apache-spark-on-k8s/contributors @ifilonenko @erikerlandson @ssuchter @kimoonkim

erikerlandson · 2017-11-30T18:02:46Z

are you thinking of factoring the IT into a separate repo?

foxish · 2017-11-30T18:04:04Z

I think that might be one way. If we just have the integration tests be built separately, as a project by itself, and then invoke it against master. It might not be the most expedient - open to other suggestions as well.

erikerlandson · 2017-11-30T18:05:19Z

One advantage of that would be that it would be convenient to make separate branches, having different IT subsets targeted to various upstreaming stages of functionality

erikerlandson · 2017-11-30T18:07:16Z

Would we then have to re-integrate them into the upstream so it has the correct IT?

foxish · 2017-11-30T18:08:41Z

It might be desirable for upstream also to have the same decoupled integration testing, similar to YARN/Mesos.

erikerlandson · 2017-11-30T18:09:45Z

+1, that makes it a two-fer

erikerlandson · 2017-11-30T18:13:50Z

maybe even a 3-fer, it keeps more code out of the main upstream PRs

erikerlandson · 2017-11-30T18:17:13Z

cc @felixcheung

tnachen · 2017-11-30T18:19:32Z

Having it outside of the main repo is easier and faster to iterate, which is why we did it also for Mesos..

erikerlandson · 2017-11-30T18:21:49Z

I added a tracking issue for jenkins-infra: https://github.com/ucbrise/jenkins-infra/issues/86

felixcheung · 2017-12-01T06:29:23Z

when you say decoupled testing, same as YARN/mesos, do you mean the test and source will be completely outside of the Apache Spark git repo?

(because as far as I can see, there is no open source YARN integration tests for Spark, for example)

ssuchter · 2017-12-01T07:24:58Z

But I’d think that not having OSS YARN integration tests is a bad thing. I think having some would have been good. In the current state, it’s hard for contributors that aren’t at a company that has their own private test suite to be confident of their changes, especially significant ones. I think we can have some for K8s, it’s a good thing. As mentioned in the thread, it doesn’t have to be in the same repo. The disadvantage of this, though, is that they are harder to keep in sync - if we change Spark in some way that enables/requires an integration test change, it becomes harder for every user of the integration tests. Personally, I think the advantages of a separate repo outweigh the disadvantage I listed above. I think we should go for a separate repo. Sean On November 30, 2017 at 10:29:25 PM, Felix Cheung ([email protected]) wrote: when you say decoupled testing, same as YARN/mesos, do you mean the test and source will be completely outside of the Apache Spark git repo? (because as far as I can see, there is no open source YARN integration tests for Spark, for example) — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#568 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AB2E24mGsg57hAvHIJkJR80AQlhGgm8Lks5s75zEgaJpZM4Qw9mJ> .

ifilonenko · 2017-12-01T17:08:50Z

I agree with the separate repo. However, this will require a group of separate group to maintain and monitor this repo as well.

felixcheung · 2017-12-01T17:22:15Z

If it's about fast iteration, I'd say go for it.
At some point though, I'd agree with @ssuchter OSS (or even, under ASF) and same branch/keep in-sync is very important.

foxish · 2017-12-01T18:05:02Z

I agree - we shouldn't look at it as a separate thing long term. It's expedient for now to separate it out - to get some confidence in upstream by running tests against it. Eventually, it should live alongside the rest of the code upstream, but I do think it needs some hardening before that, and we can't hit that for 2.3

foxish · 2017-12-01T18:06:59Z

Created https://github.com/apache-spark-on-k8s/spark-integration.

mccheah · 2017-12-01T22:07:46Z

We probably want to use the testing version provided by #521. There's one more change required to make this pass on Amplab Jenkins. I'll patch this branch and bring it up to speed with our mainline, then we should think about how we're going to move the code over.

Automation is something to consider as well. We probably want to run these tests in the separate repository on every commit to master that affects Kubernetes. We could run the suite on every commit to master in general as well.

foxish · 2017-12-06T00:49:15Z

I think we can break the integration testing out further after #521 is merged.

The immediate next steps I see are:

Separate integration testing code into https://github.com/apache-spark-on-k8s/spark-integration.
Enable generating a JAR from it, which can take a parameter for a runnable spark distribution to test.
Have Jenkins build a spark runnable distribution, and then invoke our integration test jar to test it.

For now, it would still depend on minikube. In the future, we can do even better.

Separate out the image building to use a registry. In minikube, we'll build and push to the minikube local registry which can be enabled as an addon.
Enable the integration-testing JAR to take additional parameters for K8s cluster, docker registry, and runnable spark distro.

This would enable running on other k8s clusters, enabling for example, stress testing on GKE if we want to do that, and also let other users test the release on their own clusters using our integration tests.

cc @ifilonenko @mccheah

mccheah · 2017-12-06T01:22:32Z

Are we still planning to run integration tests in the Jenkins PR builder on master? Or was the plan to run the tests in our own automated system?

foxish · 2017-12-06T15:05:53Z

Not sure if we'll get that done in time.
With this split, even if we don't get the integration tests checked in for Spark 2.3 or the spark prb changes, we'd still have the option to run it ourselves.

mccheah · 2017-12-06T18:35:25Z

@foxish I was thinking about this a little more and am concerned that this setup isn't too intuitive. (edit: the proposed setup being #568 (comment))

What these semantics are basically saying is that the main repository depends on the tests. This communicates that the main repository is checking the correctness of the tests, not the other way around.

To illustrate this point, consider the situation where a developer writes a new integration test for a feature they just merged into master. Consider the workflow:

Developer writes unit tests and main feature in apache/master. They don't update the integration test version because integration tests aren't published yet. The change is merged.
Developer writes and publishes integration tests that are broken for the feature.
Developer updates main repository's integration test dependency to try the new tests.
Developer finds the tests are wrong, so they have to publish a new integration test version.

This sequence of events makes it clear that the main repository is determining whether or not the integration tests are correct. But the more intuitive mental model is that the integration tests are validating the correctness of a given Spark artifact.

Consider also the situation when the main repository is changed such that the assumptions made by the given integration test version are invalid. In this workflow, the developer is blocked from merging into master before the new integration test version is published. But in between the time that the new integration tests are published and the main change merges, that new integration test version is only applicable for a version of the main repository that doesn't exist yet.

So I think we want the integration tests to pull down and test the repository under test, not the other way around.

ssuchter · 2017-12-06T18:40:42Z

So I think we want the integration tests to pull down and test the repository under test, not the other way around. I definitely agree. I think this direction of dependency (repo under test is unaware of what will test it, repo containing tests is aware of what it will be testing) allows you to more easily make advanced changes to the main repo and integration tests in synchronization. Sean On December 6, 2017 at 10:35:27 AM, mccheah ([email protected]) wrote: I was thinking about this a little more and am concerned that this setup isn't too intuitive. What these semantics are basically saying is that the main repository depends on the tests. This communicates that the main repository is checking the correctness of the tests, not the other way around. To illustrate this point, consider the situation where a developer writes a new integration test for a feature they just merged into master. Consider the workflow: 1. Developer writes unit tests and main feature in apache/master. They don't update the integration test version because integration tests aren't published yet. The change is merged. 2. Developer writes and publishes integration tests that are broken for the feature. 3. Developer updates main repository's integration test dependency to try the new tests. 4. Developer finds the tests are wrong, so they have to publish a new integration test version. This sequence of events makes it clear that the main repository is determining whether or not the integration tests are correct. Consider also the situation when the main repository is changed such that the assumptions made by the given integration test version are invalid. In this workflow, the developer is blocked from merging into master before the new integration test version is published. But in between the time that the new integration tests are published and the main change merges, that new integration test version is only applicable for a version of the main repository that doesn't exist yet. So I think we want the integration tests to pull down and test the repository under test, not the other way around. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#568 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AB2E28lxPQj4SQIZe8Cw4SjqVlWyjqsXks5s9t5vgaJpZM4Qw9mJ> .

foxish · 2017-12-06T19:22:54Z

So I think we want the integration tests to pull down and test the
repository under test, not the other way around.

I agree with that @mccheah. I wasn't proposing the opposite but I guess it got confusing since I went into a bit of the implementation as well in my comment. I don't think we want to "publish" the integration test jar as a separate entity. My point was that jenkins could take care of building the latest integration tests (from wherever they live, which may be for now - https://github.com/apache-spark-on-k8s/spark-integration), and the distro (from a particular PR), and then have the integration tests run on the distro. The point about jars was just to make the separation point and isn't really a necessity.

I don't mind having the integration tests actually pull down the repo and build it for now if it helps save time. I think it's the same whether jenkins does, or if we have the test logic itself do it.

mccheah · 2017-12-06T19:46:34Z

Ah, sorry for the misunderstanding there. I think we want the integration tests to target a git hash on either a remote repository to clone or a local checkout for development iteration. But maybe we should support testing against Spark distribution tarballs also because we can save build time by using a pre-published Spark distribution instead of building it ourselves in the integration test CI.

foxish mentioned this issue Nov 30, 2017

Spark on Kubernetes - basic submission client #545

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integration testing on master #568

Integration testing on master #568

foxish commented Nov 30, 2017

erikerlandson commented Nov 30, 2017

foxish commented Nov 30, 2017

erikerlandson commented Nov 30, 2017

erikerlandson commented Nov 30, 2017

foxish commented Nov 30, 2017

erikerlandson commented Nov 30, 2017

erikerlandson commented Nov 30, 2017

erikerlandson commented Nov 30, 2017

tnachen commented Nov 30, 2017

erikerlandson commented Nov 30, 2017

felixcheung commented Dec 1, 2017

ssuchter commented Dec 1, 2017 via email

ifilonenko commented Dec 1, 2017

felixcheung commented Dec 1, 2017 •

edited

Loading

foxish commented Dec 1, 2017 •

edited

Loading

foxish commented Dec 1, 2017

mccheah commented Dec 1, 2017

foxish commented Dec 6, 2017 •

edited

Loading

mccheah commented Dec 6, 2017

foxish commented Dec 6, 2017 •

edited

Loading

mccheah commented Dec 6, 2017 •

edited

Loading

ssuchter commented Dec 6, 2017 via email

foxish commented Dec 6, 2017

mccheah commented Dec 6, 2017

Integration testing on master #568

Integration testing on master #568

Comments

foxish commented Nov 30, 2017

erikerlandson commented Nov 30, 2017

foxish commented Nov 30, 2017

erikerlandson commented Nov 30, 2017

erikerlandson commented Nov 30, 2017

foxish commented Nov 30, 2017

erikerlandson commented Nov 30, 2017

erikerlandson commented Nov 30, 2017

erikerlandson commented Nov 30, 2017

tnachen commented Nov 30, 2017

erikerlandson commented Nov 30, 2017

felixcheung commented Dec 1, 2017

ssuchter commented Dec 1, 2017 via email

ifilonenko commented Dec 1, 2017

felixcheung commented Dec 1, 2017 • edited Loading

foxish commented Dec 1, 2017 • edited Loading

foxish commented Dec 1, 2017

mccheah commented Dec 1, 2017

foxish commented Dec 6, 2017 • edited Loading

mccheah commented Dec 6, 2017

foxish commented Dec 6, 2017 • edited Loading

mccheah commented Dec 6, 2017 • edited Loading

ssuchter commented Dec 6, 2017 via email

foxish commented Dec 6, 2017

mccheah commented Dec 6, 2017

felixcheung commented Dec 1, 2017 •

edited

Loading

foxish commented Dec 1, 2017 •

edited

Loading

foxish commented Dec 6, 2017 •

edited

Loading

foxish commented Dec 6, 2017 •

edited

Loading

mccheah commented Dec 6, 2017 •

edited

Loading