Initial attempt at enabling reading the columns from the datasource #45

graysonarts · 2016-06-30T16:39:43Z

Addresses issue #42 . This enables the ability to read columns. I want to do more extensive testing, and I still need to add documentation into the code (pydoc stuff), but this is the initial attempt for feedback on the approach.

benlower · 2016-06-30T17:30:47Z

Nice. Some questions:

what is the thinking with the version of "version='0.1.0-dev0'"? how is that being used and when will we increment the patch number?
i don't think this enables writing columns, correct? what would that entail?

graysonarts · 2016-06-30T17:38:30Z

@benlower the version number was changed to appease setuptools. That's the preferred format for in-development versions. I'd honestly, leave it at 0 unless we need to push a development version to pypi for some reason (let's try to not do that). We'll increment the minor for the next release of features, but that'll be done right before we merge into master.

This does not address writing columns at all. We'll need more logic, but I wanted to unblock @t8y8 for reading columns as quickly as possible, so focused on reading first, and then I'll add writing.

t8y8 · 2016-06-30T19:22:31Z

Will this work for TWBs? The XPath queries look like they shouldn't care, but workbooks might handle local/remote fields slightly differently?

t8y8 · 2016-06-30T19:23:44Z

tableaudocumentapi/column.py

@@ -0,0 +1,62 @@
+import functools
+
+_ATTRIBUTES = [


Can you comment how each of these fields relates to the UI, it's not intuitive that caption is what most people would call the 'name' in the UI, etc.

Sure, I can do that.

graysonarts · 2016-06-30T20:11:27Z

@t8y8 For a workbook, you'd need to iterate through all of the datasources and get the columns from the datasource. Since workbook creates a Datasource object for each <datasource> tag passing only that tag into the object, it should "just work". I need to write test cases to ensure that it's true, though.

Do you want a method on workbook that gives you everything by combining datasource name and column name in the standard way: [datasource].[column]

t8y8 · 2016-06-30T20:45:22Z

That makes sense, iterating through the datasources to get the fields works for me!

A convenience method "get_all_fields_with_datasource" or something is a nice addition for the "get a list of things to audit" use case.

graysonarts · 2016-06-30T21:26:52Z

whew. I had to rebase that with the feature @t8y8 just merged in, and I was scared force pushing to github on my feature branch was going to do messy things, but yay! Must just be perforce withdrawals

t8y8 · 2016-06-30T23:31:23Z

Running this through some real work workbooks and I noticed a few things:

When it's a alias, vs caption, vs 'name' matters in terms of format, sometimes you get []'s and sometimes you don't.
I triggered an exception, I'll open an issue with the stacktrace. It looks like bad escaping when building the xpath query
It's tough to know when to use alias vs name vs caption, I have a gnarly
[print(fields[i].alias or fields[i].caption or fields[i].name) for i in fields]`

graysonarts · 2016-07-01T16:30:19Z

@t8y8 Suggested change to the API of this change

Rename name to id
Add a property name that resolves the most common name
Create an extended ordereddict that will attempt to resolve name for look up to match either id, caption, or alias

My understanding of how we display names in the UI is:

If it has an alias, it's the alias
If it has a caption, it's the caption
Otherwise, it's the name (though I think we never use the name because caption always exists)

t8y8 · 2016-07-01T17:27:48Z

I like the API proposal, in the common case folks are trying to get a list of fields and matching Desktop is an intuitive default. For folks who need remote name they can go for 'id'. It remvoes the need for this function in my script too:

def pretty_name(field): if not fields[field].alias: pretty = fields[field].caption or fields[field].name else: pretty = field.alias return pretty

The ordereddict is probably only with it if it matches document order though, if it doesn't I wouldn't continue the trouble of the overhead (though py35 gets a c-based ordereddict) -- is it verified that it matches doc order?

graysonarts · 2016-07-01T17:32:01Z

I haven't verified that it matches the order in the UI, but I'll do that before implementing the more complex look up.

graysonarts · 2016-07-01T17:45:54Z

Okay playing around in the UI, it looks like fields are not related to document order. We display them in alphabetic order in the measures and dimensions panes. I think I'm going to ignore order for this PR and if we decide we want to be specific about order, I'll do it as a separate PR.

t8y8 · 2016-07-01T17:55:28Z

Sounds good to me

graysonarts · 2016-07-01T20:47:34Z

tableaudocumentapi/datasource.py

@@ -10,6 +10,7 @@

 from tableaudocumentapi import Connection, xfile
 from tableaudocumentapi import Field
+from tableaudocumentapi.multilookup_dict import MultiLookupDict


This doesn't address the renaming, I'm working on that next.

t8y8 · 2016-07-01T22:17:44Z

I'm writing against the new api changes in a local branch, and it feels pretty good!

graysonarts · 2016-07-01T22:18:17Z

yay! is that a "LGTM" for merging?

t8y8 · 2016-07-01T22:47:28Z

Yup. LGTM to merge.

Just wrote some random lines against it, and it's feeling pretty good.

Since it's ultimately a dict, the iteration is over the keys by default, that does mean the code has to do something like:
for field in fields:\n print(fields[field].calculation)

But that's easily fixed with a for field in fields.values(): ... and I think ultimately the current design is the most flexible. Just a thing to note in our samples.

I'll share the script I was able to generate when combining this with the REST API! IT TOTALLY DOES WHAT I NEED NOW.

graysonarts · 2016-07-01T23:33:46Z

@t8y8 Awesome, merging in just a moment.
When iterating over dictionaries in python the usual idiom is:

for k, v in datasources.items():
    # k is the key in the dictionary (in this case the id)
    # v is the value (the field object)

t8y8 reviewed Jun 30, 2016
View reviewed changes

Russell Hay added 5 commits June 30, 2016 14:20

Initial attempt at enabling reading the columns from the datasource

d3a120b

Fixing pep8 errors for EOFEOL

b86e869

Changing to OrderedDict for getting columns

a4cf3b3

Add documentation for the various column attributes

99d6ccd

rename column to field

bf284d4

t8y8 mentioned this pull request Jun 30, 2016

Exception when building xpath query #46

Closed

Fixed #46 encode apostrophes in field names

d80696c

Enable multilook up for Fields

2de54b1

graysonarts reviewed Jul 1, 2016
View reviewed changes

Rename properties on the field based on feedback given in #45

2deb58d

graysonarts self-assigned this Jul 1, 2016

graysonarts merged commit 481f38c into tableau:development Jul 1, 2016

graysonarts deleted the feature-get-fields branch July 1, 2016 23:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial attempt at enabling reading the columns from the datasource #45

Initial attempt at enabling reading the columns from the datasource #45

graysonarts commented Jun 30, 2016

benlower commented Jun 30, 2016

graysonarts commented Jun 30, 2016

t8y8 commented Jun 30, 2016

t8y8 Jun 30, 2016

graysonarts Jun 30, 2016

graysonarts commented Jun 30, 2016

t8y8 commented Jun 30, 2016

graysonarts commented Jun 30, 2016

t8y8 commented Jun 30, 2016

graysonarts commented Jul 1, 2016

t8y8 commented Jul 1, 2016

graysonarts commented Jul 1, 2016

graysonarts commented Jul 1, 2016

t8y8 commented Jul 1, 2016

graysonarts Jul 1, 2016

t8y8 commented Jul 1, 2016

graysonarts commented Jul 1, 2016

t8y8 commented Jul 1, 2016 •

edited

Loading

graysonarts commented Jul 1, 2016

Initial attempt at enabling reading the columns from the datasource #45

Initial attempt at enabling reading the columns from the datasource #45

Conversation

graysonarts commented Jun 30, 2016

benlower commented Jun 30, 2016

graysonarts commented Jun 30, 2016

t8y8 commented Jun 30, 2016

t8y8 Jun 30, 2016

Choose a reason for hiding this comment

graysonarts Jun 30, 2016

Choose a reason for hiding this comment

graysonarts commented Jun 30, 2016

t8y8 commented Jun 30, 2016

graysonarts commented Jun 30, 2016

t8y8 commented Jun 30, 2016

graysonarts commented Jul 1, 2016

t8y8 commented Jul 1, 2016

graysonarts commented Jul 1, 2016

graysonarts commented Jul 1, 2016

t8y8 commented Jul 1, 2016

graysonarts Jul 1, 2016

Choose a reason for hiding this comment

t8y8 commented Jul 1, 2016

graysonarts commented Jul 1, 2016

t8y8 commented Jul 1, 2016 • edited Loading

graysonarts commented Jul 1, 2016

t8y8 commented Jul 1, 2016 •

edited

Loading