STORAGE: Does a connection need to be bound to anything other than a Batch or Iterator? #728

dhermes · 2015-03-13T23:54:46Z

Run

$ git grep -n '\.connection' -- gcloud/datastore/ | egrep -v test | egrep -v 'datastore\.connection'
$ git grep -n '\.connection' -- gcloud/storage/ | egrep -v test | egrep -v 'storage\.connection'

to compare uses.

The text was updated successfully, but these errors were encountered:

dhermes · 2015-03-14T21:43:11Z

I'm going to start this work in a series of small changes (I'm guessing there will be a lot of test re-factoring).

Please protest if there are any issues.

UPDATE: After starting this, it seems quite daunting.

After checking blob.py and bucket.py it seems the following methods all use a connection.

From git grep -e '^ def' -- gcloud/storage/blob.py (edited by me):

def generate_signed_url(self, expiration, method='GET'):
def exists(self):
def rename(self, new_name):
def delete(self):
def download_to_file(self, file_obj):
def download_to_filename(self, filename):
def download_as_string(self):
def upload_from_file(self, file_obj, rewind=False, ...
def upload_from_filename(self, filename, ...
def upload_from_string(self, data, ...
def make_public(self):  # Maybe not?

From git grep -e '^ def' -- gcloud/storage/bucket.py (edited by me):

def __iter__(self):
def __contains__(self, blob_name):
def exists(self):
def create(self, project=None):
def get_blob(self, blob_name):
def get_all_blobs(self):
def iterator(self, prefix=None, delimiter=None, max_results=None,
def delete(self, force=False):
def delete_blob(self, blob_name):
def delete_blobs(self, blobs, on_error=None):
def copy_blob(self, blob, destination_bucket, new_name=None):
def upload_file(self, filename, blob_name=None):
def upload_file_object(self, file_obj, blob_name=None):
def make_public(self, recursive=False, future=False):

(Updated March 31, 2015)

dhermes · 2015-03-23T17:36:33Z

@tseaver We have some methods which make HTTP requests for historical reasons only:

~~Bucket.update_cors (Making cors a property instead of having two separate methods. #768)~~
~~Bucket.get_default_object_acl (Removing Bucket.get_default_object_acl. #772)~~
~~Bucket.update_lifecycle (Adding Bucket.lifecycle_rules property to replace methods. #774)~~
~~Bucket.get_logging (Removing HTTP requests from Bucket logging methods. #769)~~
~~Bucket.enable_logging (Removing HTTP requests from Bucket logging methods. #769)~~
~~Bucket.disable_logging (Removing HTTP requests from Bucket logging methods. #769)~~
~~Bucket.configure_website (Removing patch() from Bucket.configure_website. #773)~~
~~Bucket.disable_website (Removing patch() from Bucket.configure_website. #773)~~
Bucket.make_public

Also Blob.make_public doesn't really fit the rest of methods, but that makes an HTTP request via the ObjectACL. It seems hairier to unwind this.

tseaver · 2015-03-26T16:53:31Z

I think we could easily change the metadata-updating methods to drop the call (and require calling patch()). Maybe ACL updates should also require an explict patch or save call?

dhermes · 2015-03-27T15:50:33Z

@tseaver I agree. Being more declarative was the original goal of #545. I tried to sketch a way (in #761) to make these properties, but some of them are a bit involved (i.e. have multiple values associated).

We could look into using apitools generated clients at some point?

Relates to googleapis#728.

dhermes · 2015-03-31T21:22:38Z

@tseaver Removing __iter__ and __contains__ is impossible if no Connection is bound to a Bucket. As I mention in #761, using __iter__ is pretty scary for large applications / buckets.

Towards googleapis#728.

dhermes added the api: storage Issues related to the Cloud Storage API. label Mar 13, 2015

dhermes added this to the Storage Stable milestone Mar 13, 2015

dhermes mentioned this issue Mar 14, 2015

_PropertyMixin.patch() does not work as expected in a batch #736

Closed

dhermes self-assigned this Mar 14, 2015

dhermes mentioned this issue Mar 25, 2015

Using a fallback connection in Bucket #759

Closed

dhermes mentioned this issue Mar 28, 2015

Making cors a property instead of having two separate methods. #768

Merged

dhermes added a commit to dhermes/google-cloud-python that referenced this issue Mar 28, 2015

Removing patch() from Bucket.configure_website.

7cf10dd

Relates to googleapis#728.

This was referenced Apr 14, 2015

Decouple connection from Blob #823

Merged

Add optional 'connection' parameter for any method that hits the wire #825

Closed

Using futures in batched requests #812

Merged

dhermes added a commit to dhermes/google-cloud-python that referenced this issue Apr 28, 2015

Making Bucket.(exists|create) have an optional connection.

a14f18a

Towards googleapis#728.

dhermes mentioned this issue Apr 28, 2015

Removing most remaining uses of self.connection in Bucket #843

Merged

dhermes mentioned this issue May 6, 2015

#825: Allow passing explicit connection to pubsub API methods #859

Merged

dhermes closed this as completed May 26, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

STORAGE: Does a connection need to be bound to anything other than a Batch or Iterator? #728

STORAGE: Does a connection need to be bound to anything other than a Batch or Iterator? #728

dhermes commented Mar 13, 2015

dhermes commented Mar 14, 2015

dhermes commented Mar 23, 2015

tseaver commented Mar 26, 2015

dhermes commented Mar 27, 2015

dhermes commented Mar 31, 2015

STORAGE: Does a connection need to be bound to anything other than a Batch or Iterator? #728

STORAGE: Does a connection need to be bound to anything other than a Batch or Iterator? #728

Comments

dhermes commented Mar 13, 2015

dhermes commented Mar 14, 2015

dhermes commented Mar 23, 2015

tseaver commented Mar 26, 2015

dhermes commented Mar 27, 2015

dhermes commented Mar 31, 2015