Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] Add GCE variances to Data tests #34105

Merged
merged 35 commits into from
Apr 17, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
35 commits
Select commit Hold shift + click to select a range
5984f1e
Rename `"model_state_dict"` to `"model"`
bveeramani Nov 22, 2022
8f58490
Revert "Rename `"model_state_dict"` to `"model"`"
bveeramani Nov 22, 2022
63432eb
Merge remote-tracking branch 'upstream/master'
bveeramani Dec 5, 2022
2c33947
Merge remote-tracking branch 'upstream/master'
bveeramani Dec 6, 2022
89694a0
Merge remote-tracking branch 'upstream/master'
bveeramani Dec 19, 2022
fe60ca3
Merge remote-tracking branch 'upstream/master'
bveeramani Dec 27, 2022
d45ae9a
Merge remote-tracking branch 'upstream/master'
bveeramani Jan 2, 2023
c703dfc
Merge remote-tracking branch 'upstream/master'
bveeramani Jan 6, 2023
81dd25c
Merge remote-tracking branch 'upstream/master'
bveeramani Jan 19, 2023
fba788e
Merge remote-tracking branch 'upstream/master'
bveeramani Jan 24, 2023
de05655
Update annotations.py
bveeramani Jan 26, 2023
fd2ff91
Revert "Update annotations.py"
bveeramani Jan 26, 2023
7c3ac36
Merge remote-tracking branch 'upstream/master'
bveeramani Feb 14, 2023
9c7b546
Merge remote-tracking branch 'upstream/master'
bveeramani Feb 28, 2023
9d7a15f
Merge remote-tracking branch 'upstream/master'
bveeramani Mar 2, 2023
4f02efe
Merge remote-tracking branch 'upstream/master'
bveeramani Mar 2, 2023
46f43d9
Merge remote-tracking branch 'upstream/master'
bveeramani Mar 6, 2023
a644307
Merge remote-tracking branch 'upstream/master'
bveeramani Mar 10, 2023
e5d0bf1
Merge remote-tracking branch 'upstream/master'
bveeramani Mar 20, 2023
fe7286c
Merge remote-tracking branch 'upstream/master'
bveeramani Mar 21, 2023
51b3a09
Merge remote-tracking branch 'upstream/master'
bveeramani Mar 24, 2023
79f8be9
Merge remote-tracking branch 'upstream/master'
bveeramani Mar 28, 2023
53bede6
Merge remote-tracking branch 'upstream/master'
bveeramani Mar 28, 2023
8bc2420
Initial commit
bveeramani Apr 3, 2023
6cb7bb0
Add other tests
bveeramani Apr 3, 2023
121603a
Merge remote-tracking branch 'upstream/master' into gce-data
bveeramani Apr 5, 2023
23b9110
Update files
bveeramani Apr 5, 2023
952d010
Fix tests
bveeramani Apr 7, 2023
7cec6e3
Merge remote-tracking branch 'upstream/master'
bveeramani Apr 10, 2023
a28edab
Update files
bveeramani Apr 11, 2023
7581573
Update inference_gce.yaml
bveeramani Apr 11, 2023
39d692c
Merge branch 'master' into gce-data
bveeramani Apr 11, 2023
fcad4f3
Update config
bveeramani Apr 11, 2023
0f442e3
Update pipelined_training_compute_gce.yaml
bveeramani Apr 11, 2023
d641ec4
Remove some tests
bveeramani Apr 14, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
cloud_id: {{env["ANYSCALE_CLOUD_ID"]}}
region: us-west1
allowed_azs:
allowed_azs:
- us-west1-c

max_workers: 19
Expand Down
28 changes: 28 additions & 0 deletions release/nightly_tests/dataset/inference_gce.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,28 @@
cloud_id: {{env["ANYSCALE_CLOUD_ID"]}}
region: us-west1
allowed_azs:
- us-west1-b

max_workers: 999

head_node_type:
name: head_node
instance_type: n1-standard-32-nvidia-tesla-t4-1


worker_node_types:
- name: worker_node
instance_type: n2-standard-32 # aws m5.8xlarge
min_workers: 0
max_workers: 0
use_spot: false
resources:
cpu: 32
- name: gpu_node
instance_type: n1-standard-32-nvidia-tesla-t4-1 # aws g4dn.16xlarge
min_workers: 1
max_workers: 1
use_spot: false
resources:
cpu: 64
gpu: 1
23 changes: 23 additions & 0 deletions release/nightly_tests/dataset/pipelined_training_compute_gce.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
cloud_id: {{env["ANYSCALE_CLOUD_ID"]}}
region: us-west1
allowed_azs:
- us-west1-b


max_workers: 999

head_node_type:
name: head_node
instance_type: n2-highmem-16 # i3.8xlarge

worker_node_types:
- name: memory_node
instance_type: n2-highmem-16 # i3.8xlarge
min_workers: 10
max_workers: 10
use_spot: false
- name: gpu_node
instance_type: n1-highmem-32-nvidia-tesla-v100-4 # p3.8xlarge
min_workers: 4
max_workers: 4
use_spot: false
22 changes: 22 additions & 0 deletions release/nightly_tests/dataset/shuffle_compute_gce.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
cloud_id: {{env["ANYSCALE_CLOUD_ID"]}}
region: us-west1
allowed_azs:
- us-west1-b

max_workers: 999

head_node_type:
name: head_node
instance_type: n1-standard-16-nvidia-tesla-t4-1 # g3.4xlarge

worker_node_types:
- name: worker_node
instance_type: n1-standard-16-nvidia-tesla-t4-1 # g3.4xlarge
min_workers: 4
max_workers: 4
use_spot: false
- name: worker_node_2
instance_type: c2-standard-30 # c5.9xlarge
min_workers: 2
max_workers: 2
use_spot: false
Original file line number Diff line number Diff line change
@@ -0,0 +1,17 @@
cloud_id: {{env["ANYSCALE_CLOUD_ID"]}}
region: us-west1
allowed_azs:
- us-west1-c

max_workers: 0

head_node_type:
name: head_node
instance_type: n2-standard-16 # m5.4xlarge

worker_node_types:
- name: worker_node
instance_type: n2-standard-16 # m5.4xlarge
max_workers: 0
min_workers: 0
use_spot: false
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
cloud_id: {{env["ANYSCALE_CLOUD_ID"]}}
region: us-west1
allowed_azs:
allowed_azs:
- us-west1-c

gcp_advanced_configurations_json:
Expand Down
Original file line number Diff line number Diff line change
@@ -1,14 +1,15 @@
cloud_id: {{env["ANYSCALE_CLOUD_ID"]}}
region: us-west1
allowed_azs:
allowed_azs:
- us-west1-c

#aws:
# BlockDeviceMappings:
# - DeviceName: /dev/sda1
# Ebs:
# DeleteOnTermination: true
# VolumeSize: 1000
gcp_advanced_configurations_json:
instance_properties:
disks:
- boot: true
auto_delete: true
initialize_params:
disk_size_gb: 1000

head_node_type:
name: head_node
Expand Down
Loading