Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Standardize environment.yml #81

Open
wants to merge 5 commits into
base: main
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
134 changes: 134 additions & 0 deletions cep-??.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,134 @@
<table>
<tr><td> Title </td><td> Specification of <code>environment.yml</code> input files </td>
<tr><td> Status </td><td> Draft </td></tr>
<tr><td> Author(s) </td><td> Jaime Rodríguez-Guerra &lt;[email protected]&gt;</td></tr>
<tr><td> Created </td><td> June 4, 2024 </td></tr>
<tr><td> Updated </td><td> June 4, 2024 </td></tr>
<tr><td> Discussion </td><td> https://github.com/conda/ceps/pull/81 </td></tr>
<tr><td> Implementation </td><td> NA </td></tr>
</table>

## Abstract

This CEP standardizes the format of `environment.yml` input files for conda clients.

## Motivation

The motivation of this CEP is merely informative. It describes the details of an existing file format.

## Nomenclature

The `environment.yml` file format was introduced by `conda env`. It has been traditionally referred to as "the YAML input file", but other YAML files exist in the ecosystem now. To avoid confusion, in this CEP we will only use the term `environment.yml`. Do note that the actual filename of an input file using this format might be a different one (e.g. `env.yml`).

## Specification

`environment.yml` files are YAML documents that encode the necessary information to create a new conda environment. They can contain up to five top-level keys:

- `name`: The preferred name for the environment.
- `prefix`: The preferred full path to the environment. Often ignored.
- `dependencies`: List of package specifications that MUST be installed in the new environment.
- `channels`: conda channels that will be used to resolve the dependencies.
- `variables`: Environment variables that SHOULD be added to the `conda-meta/state` file in the resulting environment.

Additional sections can be present, but they MUST be ignored and the user SHOULD receive an informative warning about them.

The file name is not standardized, but the extension MUST be `yml` or `yaml`.

### `name` and `prefix`

Optional, `str`.

Both fields refer to the _preferred_ name or path for the newly created environment. Tools SHOULD allow these suggestions to be overridden by the user with additional CLI flags or equivalent. If the proposed environment path exists, tools MUST NOT overwrite silently by default.

Special names `base` and `root` SHOULD NOT be accepted. Prefixes targetting protected system paths SHOULD be rejected. Paths can contain tildes (`~`) and environment variables, and they MUST be expanded when present.

The name of the environment (and the last component of the expanded `prefix` path) MUST NOT contain any of these characters: `/`, ` `, `:`, `#`.

### `dependencies`

Required, `list[str | dict[str, Any]]`.

The simplest form for this section is a list of `str` encoding `MatchSpec`-compatible requirements. These items MUST be processed as conda requirements and submitted to the solver (along with `channels`) to obtain the contents for the resulting conda environment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should platform selectors be mentioned here or is that implied with MatchSpec-compatible?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

AFAIK, environment.yml doesn't support platform selectors like meta.yaml, doesn't it?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You're right, my bad. Would be a nice feature though.


This section can also contain "subsection" dictionaries that map `str` to arbitrary values. Each key refers to a non-conda installer tool that will process the associated contents as necessary. The additional subsections MUST be processed after the conda requirements. They SHOULD only add new contents. These keys SHOULD NOT overwrite existing contents.

Currently known subsections include `pip`, the contents of which encode a list of `str` referring to PyPI requirements. How this list is processed is left as an implementation detail, but common approaches involve invoking the `pip` command-line directly.

Additional subsections are allowed. The conda client MUST error out if it cannot process unknown sections.

### `channels`

Optional, `list[str]`.

These are the conda channels that will be queried to solve the requirements added in `dependencies`. They can be expressed with names, URLs and local paths.

If not specified, the conda client MUST use the default configuration. When specified, these channels MUST be used to populate the channel list passed to the solver. Then default channel configuration MUST be appended, unless the special name `nodefaults` is present. When this is the case, the default configuration MUST no be appended. The `nodefaults` name MUST NOT be passed to the solver.

### `variables`

Optional, `dict[str, str]`.

Keys MUST be valid environment variable names for the target operating system. We recommend sticking to names compatible with both Windows and POSIX standards. Values SHOULD be `str`. If they are not, they MUST be converted to `str`.

These contents SHOULD be dumped into the `conda-meta/state` file of the target environment, or equivalent, so they can be set upon environment activation.

## Examples

Simplest possible `environment.yml`:

```yaml
dependencies:
- numpy
```

With a name:

```yaml
name: test
dependencies:
- numpy >=1.10

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What about conditions if a dependency is used on different platforms, python versions, implementations, etc.?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In this CEP I only want to document the current state, mostly as available in the original implementation in conda.

There are a number of extensions in community projects that are out of scope (e.g. I think micromamba has a sel(xxxx): numpy syntax for selectors).

```

With channels:

```yaml
name: test
channels:
- conda-forge
dependencies:
- numpy
```

With a `pip` section:

```yaml
name: test
channels:
- conda-forge
dependencies:
- numpy
- pip:
- scipy
```

With variables:

```yaml
name: test
channels:
- conda-forge
dependencies:
- numpy
variables:
MY_ENV_VAR: "My Value"
```

## Reference

- [Allowed extensions](https://github.com/conda/conda/blob/24.5.0/conda/env/specs/yaml_file.py)
- [Allowed keys](https://github.com/conda/conda/blob/24.5.0/conda/env/env.py#L25)
- [Extra subsections in `dependencies`](https://github.com/conda/conda/blob/841d9d57fd96ad27cda4b7c43549104a96f961ce/conda/cli/main_env_create.py#L167-L185)

## Copyright

All CEPs are explicitly [CC0 1.0 Universal](https://creativecommons.org/publicdomain/zero/1.0/).