Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

guide: Modeling Pipelines #2883

Closed
5 of 18 tasks
jorgeorpinel opened this issue Oct 1, 2021 · 5 comments
Closed
5 of 18 tasks

guide: Modeling Pipelines #2883

jorgeorpinel opened this issue Oct 1, 2021 · 5 comments
Labels
A: docs Area: user documentation (gatsby-theme-iterative) C: guide Content of /doc/user-guide ✨ epic Placeholder ticket for multi-sprint direction, use story, improvement p2-nice-to-have Less of a priority at the moment. We don't usually deal with this immediately.

Comments

@jorgeorpinel
Copy link
Contributor

jorgeorpinel commented Oct 1, 2021

From #144 (comment)

Structure:

See #2857 for the original structure ideas.

  • Defining Pipelines
    • Focus on the pipelining process and codification link to dvc.yaml ref. for formalities
    • Defining stages (extract from run and stage refs.)
      • Dependencies and (general) parameters
      • Outputs and metrics/plots
    • Should provide some actual dvc.yaml example/templates to copy&paste.
    • Multiple dvc.yaml files vs. multiple pipelines (repro --recursive)
    • Describing formally outs, deps, stage doesn't make sense here
    • Mention exp init? (Link to appropriate page/section) Maybe
  • Pipeline reproduction
  • Experimental pipelines (discussion)
  • Operationalizing pipelines?

Other tasks

@jorgeorpinel jorgeorpinel added 🐛 type: bug Something isn't working. p1-important Active priorities to deal within next sprints ✨ epic Placeholder ticket for multi-sprint direction, use story, improvement C: guide Content of /doc/user-guide labels Oct 1, 2021
@jorgeorpinel jorgeorpinel added A: docs Area: user documentation (gatsby-theme-iterative) and removed 🐛 type: bug Something isn't working. labels Jan 14, 2022
@jorgeorpinel jorgeorpinel added p1-current-quarter and removed p1-important Active priorities to deal within next sprints labels Apr 27, 2022
@jorgeorpinel jorgeorpinel added p1-important Active priorities to deal within next sprints and removed p1-current-quarter labels Jul 9, 2022
@jorgeorpinel
Copy link
Contributor Author

Updated plan for this epic per #3414 (comment). PTAL @dberenbaum @shcheklein

@dberenbaum
Copy link
Collaborator

dberenbaum commented Sep 8, 2022

Thanks @jorgeorpinel!

My specific feedback on #3414 (comment):

We should probably talk more about dvc exp init here? (since it helps to bootstrap the dvc.yaml after all)?

We can mention dvc exp init, but I think it's still immature and we are discussing how to improve this onboarding experience now, so it might end up being very short-lived.

we should provide some example - actual pipepline files?

Yes, I think having some templates that people can copy and paste would help people get started as much as dvc exp init.

mention VS Code as an editor that supports schema definition

Yes, I think we could mention VS Code as one example of an editor and show how these editors can be used to write/modify/validate dvc.yaml files.

Include things like Jupyter notebooks - how to make a pipeline out of it ... etc

Yeah, definitely useful although probably a fairly large project on its own. I know @RCdeWit and @alex000kim are working on something related, so it would be good to sync with them.

@shcheklein
Copy link
Member

I think this item Defining stages (extract from run and stage refs.) doesn't make sense anymore.

Operationalizing pipelines
Experimental pipelines

I'm not sure what it means

Pipeline reproduction

Running pipeline? - we can include both exp and repro I guess. No need to complicate

so it might end up being very short-lived.

even so, it's fine to mention, i think

@jorgeorpinel
Copy link
Contributor Author

jorgeorpinel commented Sep 13, 2022

having some templates that people can copy and paste
mention VS Code

Added to description.

@jorgeorpinel
Copy link
Contributor Author

jorgeorpinel commented Sep 13, 2022

Defining stages (extract from run and stage refs.) doesn't make sense anymore.

Scratched out now (I left it since it's been done in the current doc).

Experimental pipelines

Interaction between exp and dvc.yaml features. The task overlaps with #2768 in that "removing the pipeline and stages concept would be ideal," (from Running Exps).

If it's not a lot it could be a section in Pipeline reproduction.

Operationalizing pipelines

Using pipelines "in production" or some other application. Maybe CML? Probably the last topic to worry about or maybe not even for DVC docs. Thoughts?

@jorgeorpinel jorgeorpinel self-assigned this Oct 13, 2022
@jorgeorpinel jorgeorpinel removed their assignment Feb 17, 2023
@dberenbaum dberenbaum added p2-nice-to-have Less of a priority at the moment. We don't usually deal with this immediately. and removed p1-important Active priorities to deal within next sprints labels Feb 27, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A: docs Area: user documentation (gatsby-theme-iterative) C: guide Content of /doc/user-guide ✨ epic Placeholder ticket for multi-sprint direction, use story, improvement p2-nice-to-have Less of a priority at the moment. We don't usually deal with this immediately.
Projects
None yet
Development

No branches or pull requests

3 participants