bulkwriter sample read csv #1673

yhmo · 2023-09-07T03:05:14Z

No description provided.

yiwen92 · 2023-09-11T08:00:41Z

examples/example_bulkwriter.py

+
+    fields = [
+        FieldSchema(name="id", dtype=DataType.INT64, is_primary=True, auto_id=True),
+        FieldSchema(name="path", dtype=DataType.VARCHAR, max_length=512),


Can this field be primary key?

yiwen92 · 2023-09-11T08:02:12Z

examples/example_bulkwriter.py

+    with LocalBulkWriter(
+            schema=schema,
+            local_path="/tmp/bulk_writer",
+            segment_size=4*1024*1024,


any reason for hardcode 4 here?

yiwen92 · 2023-09-11T08:03:36Z

examples/example_bulkwriter.py

+            local_path="/tmp/bulk_writer",
+            segment_size=4*1024*1024,
+    ) as local_writer:
+        read_sample_data("./data/train_embeddings.csv", local_writer)


Is it possible that csv is too big(100 GB) that cannot load into the processing?

yiwen92 · 2023-09-11T08:05:17Z

examples/example_bulkwriter.py

    threads = []
    thread_count = 100
-    rows_per_thread = 1000
+    rows_per_thread = 100


Any limitation for size per row here?

xiaofan-luan · 2023-09-11T13:21:06Z

/lgtm
/approve

XuanYang-cn · 2023-09-12T02:40:34Z

/approve

sre-ci-robot · 2023-09-12T02:40:38Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: xiaofan-luan, XuanYang-cn, yhmo

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [XuanYang-cn]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

qixuan0212 · 2023-09-11T09:54:24Z

examples/example_bulkwriter.py

@@ -161,11 +307,11 @@ def test_cloud_bulkinsert():
        access_key=object_url_access_key,
        secret_key=object_url_secret_key,
        cluster_id=cluster_id,
-        collection_name=COLLECTION_NAME,
+        collection_name=CSV_COLLECTION_NAME,


bulk_import miss api-key parameter

qixuan0212 · 2023-09-11T10:20:49Z

examples/example_bulkwriter.py

    )
    print(resp)

-    print(f"===================== get import job progress ====================")
+    print(f"\n===================== get import job progress ====================")
    job_id = resp['data']['jobId']


json.loads(resp.text)['data']['jobId']

qixuan0212 · 2023-09-11T10:21:29Z

examples/example_bulkwriter.py

    )
    print(resp)

-    print(f"===================== get import job progress ====================")
+    print(f"\n===================== get import job progress ====================")
    job_id = resp['data']['jobId']
    resp = get_import_progress(


miss api_key parameter

qixuan0212 · 2023-09-11T10:21:42Z

examples/example_bulkwriter.py

@@ -174,7 +320,7 @@ def test_cloud_bulkinsert():
    )
    print(resp)

-    print(f"===================== list import jobs ====================")
+    print(f"\n===================== list import jobs ====================")
    resp = list_import_jobs(


miss api_key parameter

sre-ci-robot requested review from longjiquan and XuanYang-cn September 7, 2023 03:05

sre-ci-robot added the size/M label Sep 7, 2023

mergify bot added the dco-passed label Sep 7, 2023

yhmo force-pushed the sss branch 2 times, most recently from 7fe57d0 to 14b895e Compare September 7, 2023 03:32

sre-ci-robot added size/L and removed size/M labels Sep 7, 2023

mergify bot added the ci-passed label Sep 7, 2023

yhmo force-pushed the sss branch from 14b895e to 4d79ab1 Compare September 7, 2023 09:01

sre-ci-robot added size/XL and removed size/L labels Sep 7, 2023

yhmo force-pushed the sss branch 3 times, most recently from 347d406 to 7a020c8 Compare September 8, 2023 02:54

bulkwriter sample read csv

279f22b

Signed-off-by: yhmo <[email protected]>

yhmo force-pushed the sss branch from 7a020c8 to 279f22b Compare September 8, 2023 04:25

yiwen92 reviewed Sep 11, 2023

View reviewed changes

sre-ci-robot assigned xiaofan-luan Sep 11, 2023

sre-ci-robot added the lgtm label Sep 11, 2023

sre-ci-robot added the approved label Sep 12, 2023

sre-ci-robot merged commit b5b4db3 into milvus-io:2.2 Sep 12, 2023
8 checks passed

qixuan0212 reviewed Sep 12, 2023

View reviewed changes

yhmo deleted the sss branch September 14, 2023 03:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bulkwriter sample read csv #1673

bulkwriter sample read csv #1673

yhmo commented Sep 7, 2023

yiwen92 Sep 11, 2023

yiwen92 Sep 11, 2023

yiwen92 Sep 11, 2023

yiwen92 Sep 11, 2023

xiaofan-luan commented Sep 11, 2023

XuanYang-cn commented Sep 12, 2023

sre-ci-robot commented Sep 12, 2023

qixuan0212 Sep 11, 2023

qixuan0212 Sep 11, 2023

qixuan0212 Sep 11, 2023

qixuan0212 Sep 11, 2023

bulkwriter sample read csv #1673

bulkwriter sample read csv #1673

Conversation

yhmo commented Sep 7, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

xiaofan-luan commented Sep 11, 2023

XuanYang-cn commented Sep 12, 2023

sre-ci-robot commented Sep 12, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment