[TYPO] #3635

njzjz · 2024-04-02T03:55:10Z

No description provided.

Set the default `save_ckpt` to `model.ckpt` as the prefix. When saving checkpoints, `model.ckpt-100.pt` will be saved, and `model.ckpt.pt` will be symlinked to `model.ckpt-100.pt`. A `checkpoint` file will be dedicated to record `model.ckpt-100.pt`. This keeps the same behavior as the TF backend. One can do the below using the PT backend just like the TF backend: ```sh dp --pt train input.json # one can cancel the training before it finishes dp --pt freeze ``` --------- Signed-off-by: Jinzhe Zeng <[email protected]>

…eling#3195) Fix https://github.com/deepmodeling/deepmd-kit/security/code-scanning/2096 --------- Signed-off-by: Jinzhe Zeng <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

per discussion. Signed-off-by: Jinzhe Zeng <[email protected]> Co-authored-by: Han Wang <[email protected]>

``` - source - tests - common - tf - pt ``` --------- Signed-off-by: Jinzhe Zeng <[email protected]>

Co-authored-by: Han Wang <[email protected]>

Fix deepmodeling#3121. There are TODOs: (1) PyTorch-backend specific features and arguments; (2) Python interface installation. Currently, the TensorFlow backend is always installed, and I am considering rewriting the logic; (3) Unsupported features - write docs when implemented. --------- Signed-off-by: Jinzhe Zeng <[email protected]>

The default one from PyPI is for CU12. --------- Signed-off-by: Jinzhe Zeng <[email protected]>

…pmodeling#3201) If so, throw the following error: ``` -- PyTorch CXX11 ABI: 0 CMake Error at CMakeLists.txt:162 (message): PyTorch CXX11 ABI mismatch TensorFlow: 0 != 1 ``` Signed-off-by: Jinzhe Zeng <[email protected]>

…deling#3200) Fix deepmodeling#3120. One can disable building the TensorFlow backend during `pip install` by setting `DP_ENABLE_TENSORFLOW=0`. --------- Signed-off-by: Jinzhe Zeng <[email protected]>

…g net. (deepmodeling#3199) - add dp model format (backend independent definition) for the fitting - refactor torch support, compatible with dp model format - fix mlp issue: the idt should only be used when a skip connection is available. - add tools `to_numpy_array` and `to_torch_tensor`. --------- Co-authored-by: Han Wang <[email protected]>

Co-authored-by: Han Wang <[email protected]>

This PR fixes GPU UTs; Delete the PREPROCESS_DEVICE in torch data preprocess and use training DEVICE instead, which will be removed after the dataset is refomated. --------- Signed-off-by: Jinzhe Zeng <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Jinzhe Zeng <[email protected]> Co-authored-by: Han Wang <[email protected]> Co-authored-by: Han Wang <[email protected]>

Today [GitHub introduced the new M1 runners](https://github.blog/changelog/2024-01-30-github-actions-introducing-the-new-m1-macos-runner-available-to-open-source/), making it possible to build macos-arm64 wheels without cross-building. Remove old hacked codes for cross-building.

Signed-off-by: Jinzhe Zeng <[email protected]>

Migrated from this [PR](dptech-corp/deepmd-pytorch#174). This is to reimplement the PairTab Model in Pytorch. Notes: 1. Different from the tensorflow version, the pytorch version abstracts away all the post energy conversion operations (force, virial). 2. Added extrapolation when `rcut` > `rmax`. The pytorch version overwrite energy beyond extrapolation endpoint to `0`. These features are not available in the tensorflow version. The extrapolation uses a cubic spline form, the 1st order derivation for the starting point is estimated using the last two rows in the user defined table. See example below: ![img_v3_027k_b50c690d-dc2d-4803-bd2c-2e73aa3c73fg](https://github.com/deepmodeling/deepmd-kit/assets/137014849/f3efa4d3-795e-4ff8-acdc-642227f0e19c) ![img_v3_027k_8de38597-ef4e-4e5b-989e-dbd13cc93fag](https://github.com/deepmodeling/deepmd-kit/assets/137014849/493da26d-f01d-4dd0-8520-ea2d84e7b548) ![img_v3_027k_f8268564-3f5d-49e6-91d6-169a61d9347g](https://github.com/deepmodeling/deepmd-kit/assets/137014849/b8ad4d4d-a4a4-40f0-94d1-810006e7175b) ![img_v3_027k_3966ef67-dd5e-4f48-992e-c2763311451g](https://github.com/deepmodeling/deepmd-kit/assets/137014849/27f31e79-13c8-4ce8-9911-b4cc0ac8188c) --------- Co-authored-by: Anyang Peng <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Signed-off-by: Jinzhe Zeng <[email protected]>

Use the same arguments as TF. [PyTorch on Tensorboard](https://pytorch.org/docs/stable/tensorboard.html): ![1706608497314](https://github.com/deepmodeling/deepmd-kit/assets/9496702/9d747ee2-2e76-43d3-8252-7dbd0cea6768) [PyTorch Profiler on Tensorboard](https://pytorch.org/tutorials/intermediate/tensorboard_profiler_tutorial.html): ![image](https://github.com/deepmodeling/deepmd-kit/assets/9496702/929d69b7-a696-45b1-8e9b-2b491177ad95) --------- Signed-off-by: Jinzhe Zeng <[email protected]>

…ng#3208) Features: - abstract base classes for atomic model, fitting and descriptor. - dp model format for atomic models - dp model format for models. - torch support for atomic model format. - torch support `fparam` and `aparam`. This pr also introduces the following updates: - support region and nlist in numpy code. - class decorator like `fitting_check_output` gives human readable class names. - support int types in precision dict. - fix descriptor interfaces. - refactor torch atomic model impl. introduces dirty hacks to be fixed. - provide `format_nlist` that format the nlist in forward_lower method. Known limitations: - torch atomic model has dirty hacks - interfaces for descriptor, fitting and model statistics was not considered, should be fixed in future PRs. Will be fixed - [x] dp model module path is a mess to be refactorized. - [x] nlist consistency should be checked. if not format nlist. - [x] doc strings. - [x] `fparam` and `aparam` support. --------- Co-authored-by: Han Wang <[email protected]>

Fix deepmodeling#3214. In the gmx patch file, `${TENSORFLOW_ROOT}` is used other than `${TensorFlow_LIBRARY_PATH}$` or `${TENSORFLOW_INCLUDE_DIRS}`, so the fastest workaround is to set `${TENSORFLOW_ROOT}`. https://github.com/deepmodeling/deepmd-kit/blob/eb9b2efedf4efc946894800a0d7abf5056f4bb7a/source/gmx/patches/2020.2/CMakeLists.txt.patch.in#L14-L18 Signed-off-by: Jinzhe Zeng <[email protected]>

…ng#3219) Co-authored-by: Han Wang <[email protected]>

Fix deepmodeling#3214. Signed-off-by: Jinzhe Zeng <[email protected]>

Fix deepmodeling#3045. All memory leaks have been fixed! --------- Signed-off-by: Jinzhe Zeng <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

This PR is to provide backend independent implementation of PairTabModel in `numpy`. Also the cross framework `serialization` and `deserialization` are added. --------- Co-authored-by: Anyang Peng <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

add shape hit to doc string. Co-authored-by: Han Wang <[email protected]>

…ling#3226) To be consistent with TF, as discussed in deepmodeling#3213 (comment). Old PT models are expected to be incompatible. Signed-off-by: Jinzhe Zeng <[email protected]>

We can change the format of the global logger in the future if the additional information is helpful (e.g., time, path, etc). --------- Signed-off-by: Jinzhe Zeng <[email protected]>

Signed-off-by: Jinzhe Zeng <[email protected]>

Avoid installing tensorflow as build requires when `DP_ENABLE_TENSORFLOW` is `0`. Signed-off-by: Jinzhe Zeng <[email protected]>

- mv all dpmodel related UTs to a separate directory - split the large test_model_format_utils.py as tests for different modules. --------- Co-authored-by: Han Wang <[email protected]>

…modeling#3235) Co-authored-by: Han Wang <[email protected]>

Co-authored-by: Han Wang <[email protected]>

…eepmodeling#3618) Signed-off-by: Jinzhe Zeng <[email protected]>

deepmodeling#3578 deepmodeling#3579 --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

deepmodeling#3475 --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Signed-off-by: Jinzhe Zeng <[email protected]>

Only conduct file renaming in this PR. --------- Signed-off-by: Jinzhe Zeng <[email protected]>

The wrong type hint appeared in f5c67af, very old... Signed-off-by: Jinzhe Zeng <[email protected]>

Signed-off-by: Jinzhe Zeng <[email protected]>

…ng#3626) - also add missing UT for execute_all --------- Co-authored-by: Han Wang <[email protected]>

…modeling#3631) Signed-off-by: Jinzhe Zeng <[email protected]>

- add UT for it. - at the moment, only energy is supported in the `base_atomic_model`. handling of multiple output stat will be implemented in a future PR. --------- Co-authored-by: Han Wang <[email protected]>

Signed-off-by: Jinzhe Zeng <[email protected]>

njzjz and others added 30 commits January 28, 2024 16:38

drop tqdm (deepmodeling#3194)

0bb44f3

per discussion. Signed-off-by: Jinzhe Zeng <[email protected]> Co-authored-by: Han Wang <[email protected]>

reorganize tests directory (deepmodeling#3196)

8900561

``` - source - tests - common - tf - pt ``` --------- Signed-off-by: Jinzhe Zeng <[email protected]>

breaking: pt: unify the output of descriptors. (deepmodeling#3190)

1e51a88

Co-authored-by: Han Wang <[email protected]>

fix: install CU11 PyTorch in the CU11 docker image (deepmodeling#3198)

4a29c8c

The default one from PyPI is for CU12. --------- Signed-off-by: Jinzhe Zeng <[email protected]>

allow disabling TensorFlow backend during Python installation (deepmo…

de18f78

…deling#3200) Fix deepmodeling#3120. One can disable building the TensorFlow backend during `pip install` by setting `DP_ENABLE_TENSORFLOW=0`. --------- Signed-off-by: Jinzhe Zeng <[email protected]>

remove duplicated fitting output check. fix codeql (deepmodeling#3202)

b800043

Co-authored-by: Han Wang <[email protected]>

fix GPU test OOM problem (deepmodeling#3207)

d2edb77

Signed-off-by: Jinzhe Zeng <[email protected]>

pt: set nthreads from env (deepmodeling#3205)

19a8dfb

Signed-off-by: Jinzhe Zeng <[email protected]>

fix bug of output def: the reduced virial is not defined. (deepmodeli…

677d936

…ng#3219) Co-authored-by: Han Wang <[email protected]>

gmx: fix include directive (deepmodeling#3221)

412c812

Fix deepmodeling#3214. Signed-off-by: Jinzhe Zeng <[email protected]>

c: fix all memory leaks; add sanitizer checks (deepmodeling#3223)

ab2c551

Fix deepmodeling#3045. All memory leaks have been fixed! --------- Signed-off-by: Jinzhe Zeng <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

fix model doc str: add shape hint to doc string. (deepmodeling#3225)

cd77429

add shape hit to doc string. Co-authored-by: Han Wang <[email protected]>

pt: rename atomic_virial to atom_virial in the model output (deepmode…

17f2c35

…ling#3226) To be consistent with TF, as discussed in deepmodeling#3213 (comment). Old PT models are expected to be incompatible. Signed-off-by: Jinzhe Zeng <[email protected]>

pt: apply global logger to pt (deepmodeling#3222)

22197f5

We can change the format of the global logger in the future if the additional information is helpful (e.g., time, path, etc). --------- Signed-off-by: Jinzhe Zeng <[email protected]>

pt: apply global user set precision to pt (deepmodeling#3220)

7f5d67c

Signed-off-by: Jinzhe Zeng <[email protected]>

fix DP_ENABLE_TENSORFLOW support (deepmodeling#3229)

f5bb131

Avoid installing tensorflow as build requires when `DP_ENABLE_TENSORFLOW` is `0`. Signed-off-by: Jinzhe Zeng <[email protected]>

test: better structure for the dpmodel uts (deepmodeling#3232)

18c43f6

- mv all dpmodel related UTs to a separate directory - split the large test_model_format_utils.py as tests for different modules. --------- Co-authored-by: Han Wang <[email protected]>

fix: pt: energy model forward lower is not tested and has bugs. (deep…

13a781f

…modeling#3235) Co-authored-by: Han Wang <[email protected]>

wanghan-iapcm and others added 14 commits March 27, 2024 12:50

implement electronic configuration embedding (deepmodeling#3609)

f16d543

Co-authored-by: Han Wang <[email protected]>

Fix fine-tuning entries bug when doing restart. (deepmodeling#3616)

6da8eef

chore(build): move static part of dynamic metadata to pyproject.toml (d…

571bd52

…eepmodeling#3618) Signed-off-by: Jinzhe Zeng <[email protected]>

fix c++ interface bug (deepmodeling#3613)

c2371cd

deepmodeling#3578 deepmodeling#3579 --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

pt: support list format batch size (deepmodeling#3614)

7933c5e

deepmodeling#3475 --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

pt: use unified activation (deepmodeling#3619)

23f67a1

feat(pt): add op library (deepmodeling#3620)

386f3b2

Signed-off-by: Jinzhe Zeng <[email protected]>

chore: move source/op to source/op/tf (deepmodeling#3621)

cc30f27

Only conduct file renaming in this PR. --------- Signed-off-by: Jinzhe Zeng <[email protected]>

fix: fix type hint of sel (deepmodeling#3624)

d0d5a94

The wrong type hint appeared in f5c67af, very old... Signed-off-by: Jinzhe Zeng <[email protected]>

feat: consistent type embedding (deepmodeling#3617)

0be9714

Signed-off-by: Jinzhe Zeng <[email protected]>

feat: auto batch size supports methods that return a dict (deepmodeli…

2e6ab1b

…ng#3626) - also add missing UT for execute_all --------- Co-authored-by: Han Wang <[email protected]>

fix: fix DPOSPath.save_numpy, DPH5Path.is_file, DPH5Path.is_dir (deep…

15e4926

…modeling#3631) Signed-off-by: Jinzhe Zeng <[email protected]>

feat: compute output stat for a dict of labels. (deepmodeling#3628)

8e0cc90

- add UT for it. - at the moment, only energy is supported in the `base_atomic_model`. handling of multiple output stat will be implemented in a future PR. --------- Co-authored-by: Han Wang <[email protected]>

fix(pt): add libtorch directory to rpath

d42bd8e

Signed-off-by: Jinzhe Zeng <[email protected]>

njzjz closed this Apr 2, 2024

github-actions bot added Python Core CUDA ROCM OP C++ LAMMPS Gromacs Docs Examples i-PI C labels Apr 2, 2024

njzjz reopened this Apr 2, 2024

njzjz closed this Apr 2, 2024

njzjz changed the title ~~pt rpath~~ [TYPO] Apr 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TYPO] #3635

[TYPO] #3635

njzjz commented Apr 2, 2024 •

edited

Loading

[TYPO] #3635

[TYPO] #3635

Conversation

njzjz commented Apr 2, 2024 • edited Loading

njzjz commented Apr 2, 2024 •

edited

Loading