feat: Update the AccessEntry class with a new condition attribute and unit tests #2163

chalmerlowe · 2025-04-16T19:49:26Z

Updates the AccessEntry class with a new condition attribute and unit tests.

Specifically:

Adds a condition attribute to the AccessEntry class
Updates the pre-existing Condition class with several dunder methods (eq, hash, etc)
Adds unit tests to put the condition setters, getters, etc through their paces
Adds unit tests to exercise the new Condition dunder methods

This work is in support of internal bug: b/330869964

…-policy-version

google/cloud/bigquery/dataset.py

tswast · 2025-04-18T18:40:10Z

google/cloud/bigquery/dataset.py

+        # The api_repr for an AccessEntry object is expected to be a dict with
+        # only a few keys. Two keys that may be present are role and condition.
+        # Any additional key is going to have one of ~eight different names:
+        #   userByEmail, groupByEmail, domain, dataset, specialGroup, view,
+        #   routine, iamMember


I'm a bit confused by this. Why are we leaving out some data?

Couldn't we do:

access_entry = cls() access_entry._properties = resource.copy() return access_entry

@tswast

Thanks for this question. I am open to other approaches.

This approach is only slightly modified from the existing code. The existing code attempted to account for a peculiarity in how we assign values to the "possible" attributes that an AccessEntry "needs to have" and "might have" based on the way we have defined it.

I tweaked the existing code to maintain backwards compatibility and avoid rewrites in things like existing unittests, where possible.

The api_repr for an AccessEntry object is composed of the following keys (internal link):

ROLE

ONE OF THE FOLLOWING (and as I understand it, only one) key:value pairs (as listed in the proto above):
a. userByEmail: some_value
b. groupByEmail: some_value
c. domain: some_value
d. specialGroup: some_value
e. IamMember: some_value
f. view: some_value
g. routine: some_value
h. dataset: some_value

CONDITION

Each of the three items above is stored in the _properties dict (in one form or another, see below).

Depending on how the AccessEntry object gets configured, each of a-h above MAY also be associated with a corresponding attribute (ie access_entry.domain, access_entry.user_by_email) but this is not enforced.

Which element a through h will be provided as part of the api_repr is an unknown.

Despite the presence of one of a through h, to instantiate the AccessEntry class users would need to provide role, entity_type, and entity_id or rely on the defaults. Thus since we received a key a through h there must be a way to translate the key to entity_type and to translate from the value associated with that key to entity_id.

Once the AccessEntry object is created users also have the ability to populate other attributes that align on a one-to-one basis with item a through h by using a setter. (This appears to be included as a convenience feature, does NOT happen automagically during initialization, and is not a strict requirement).

If we receive an api_repr resource we use the logic in .from_api_repr to try and extract the a through h key (without which one we got) to assign it to entity_type and then assign the value to entity_id.

We first pop the "role" if it exists and we pop the "condition" if it exists and the only thing that should be left in the dict is the single element a or b or c, etc. We have no expectation that there will be an additional key:value pair in the dictionary. Thus we do not expect to be dropping any values.

If my interpretation of how this works is wrong, please let me know.

An alternative approach could be to do a key lookup for any key in a set of eight keys, but that seems less resilient than the pop method (ie if a ninth key gets added to the API definition, the current PR still works).

The current approach is still quite brittle. If/when the backend adds additional properties, similar to how condition is being added now, the user will have no way of accessing those, as _properties isn't being saved. This can especially be a problem when doing a round trip of downloading access entries, modifying them and re-uploading them. Losing a property like condition (or whatever new thing comes in future) can have security consequences.

For these reasons, I strongly encourage changing from_api_repr to my proposal

access_entry = cls() access_entry._properties = resource.copy() return access_entry

and modify entity_type to look at _properties for keys other than condition and role. In the case of two or more remaining keys, we can perhaps use an allow list of the entity types we know about to account for some new condition-like property in future.

@tswast PTAL and approve, as appropriate.

I updated the _from_api_repr() method.
And fixed a number of downstream elements that relied upon the output of the original version of that method.

tests/unit/test_dataset.py

tswast · 2025-04-28T21:40:44Z

google/cloud/bigquery/dataset.py

+        return (
+            self.expression == other.expression
+            and self.title == other.title
+            and self.description == other.description
+        )


Nit: Could do the _key() trick here too and in __hash__ to avoid a bit of duplication / potential errors if any new properties are added.

tswast · 2025-04-28T21:46:10Z

tests/unit/test_dataset.py

+        exp_resource = {
+            "role": None,
+            "dataset": {
+                "dataset": DatasetReference("my-project", "my_dataset"),


This seems not great. What happens when we convert this resource to JSON and send it to the REST API? I would expect that json.dumps or the equivalent would fail on such on object.

Are we missing something in the @dataset.setter? Looks like we need a clause like

if isinstance(value, DatasetReference): value = value.to_api_repr()

tswast

Thanks so much Chalmer. This leaves the code base in much better shape than it started. 🏕️

tswast · 2025-04-29T13:08:52Z

google/cloud/bigquery/dataset.py

+        if self.entity_type:
+            entity_type = self.entity_type
+        else:


Optional: This could use the "walrus operator" to save a call to the entity_type property.

Suggested change

if self.entity_type:

entity_type = self.entity_type

else:

if not (entity_type := self.entity_type):

chalmerlowe added 4 commits April 10, 2025 16:23

feat: adds condition class and assoc. unit tests

4e20423

Merge branch 'main' into feat-b330869964-add-dataset-condition-access…

bbfb818

…-policy-version

Updates AccessEntry with condition setter/getter

04fdc8e

Adds condition attr to AccessEntry and unit tests

f41df12

chalmerlowe requested review from a team as code owners April 16, 2025 19:49

chalmerlowe requested a review from suzmue April 16, 2025 19:49

product-auto-label bot added the size: l Pull request size is large. label Apr 16, 2025

blunderbuss-gcf bot assigned suzmue Apr 16, 2025

product-auto-label bot added the api: bigquery Issues related to the googleapis/python-bigquery API. label Apr 16, 2025

chalmerlowe added 2 commits April 16, 2025 15:51

Merge branch 'main' into feat-b330869964-update-accessentry-class

137e0a9

adds tests for Condition dunder methods to ensure coverage

a129e33

tswast self-requested a review April 18, 2025 18:36

tswast reviewed Apr 18, 2025

View reviewed changes

chalmerlowe added 9 commits April 25, 2025 06:57

Merge branch 'main' into feat-b330869964-update-accessentry-class

ca8d734

moves the entity_type logic out of _from_api_repr to entity_type setter

152133e

Updates logic in entity_type getter

5451c58

updates several AccessEntry related tests

a323c9d

Updates AccessEntry condition setter test to use a dict

6d0d1d1

udpates entity_id handling

9340c00

Updates _entity_type access

ae2cb44

tweaks type hinting

447472a

Merge branch 'main' into feat-b330869964-update-accessentry-class

dc83110

chalmerlowe commented Apr 28, 2025

View reviewed changes

tests/unit/test_dataset.py Outdated Show resolved Hide resolved

Update tests/unit/test_dataset.py

5190018

chalmerlowe commented Apr 28, 2025

View reviewed changes

tests/unit/test_dataset.py Outdated Show resolved Hide resolved

Update tests/unit/test_dataset.py

9a6f0b6

tswast reviewed Apr 28, 2025

View reviewed changes

chalmerlowe added 2 commits April 29, 2025 11:10

Updates DatasetReference in test and __eq__ check

635a1f4

remove debug print statement

4c060a4

tswast self-requested a review April 29, 2025 13:03

tswast approved these changes Apr 29, 2025

View reviewed changes

chalmerlowe merged commit 7301667 into main Apr 29, 2025
18 checks passed

chalmerlowe deleted the feat-b330869964-update-accessentry-class branch April 29, 2025 13:16

release-please bot mentioned this pull request Apr 25, 2025

chore(main): release 3.32.0 #2152

Merged

This was referenced May 20, 2025

May 19, 2025 kitta65/bq-extension-vscode#764

Open

May 19, 2025 kitta65/prettier-plugin-bq#656

Open

May 19, 2025 kitta65/bq2cst#515

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Update the AccessEntry class with a new condition attribute and unit tests #2163

feat: Update the AccessEntry class with a new condition attribute and unit tests #2163

Uh oh!

chalmerlowe commented Apr 16, 2025 •

edited

Loading

Uh oh!

Uh oh!

tswast Apr 18, 2025

Uh oh!

chalmerlowe Apr 21, 2025

Uh oh!

tswast Apr 21, 2025 •

edited

Loading

Uh oh!

chalmerlowe Apr 28, 2025

Uh oh!

Uh oh!

Uh oh!

tswast Apr 28, 2025

Uh oh!

tswast Apr 28, 2025

Uh oh!

tswast left a comment

Uh oh!

tswast Apr 29, 2025

Uh oh!

Uh oh!

Uh oh!

feat: Update the AccessEntry class with a new condition attribute and unit tests #2163

feat: Update the AccessEntry class with a new condition attribute and unit tests #2163

Uh oh!

Conversation

chalmerlowe commented Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

tswast Apr 18, 2025

Choose a reason for hiding this comment

Uh oh!

chalmerlowe Apr 21, 2025

Choose a reason for hiding this comment

Uh oh!

tswast Apr 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chalmerlowe Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

tswast Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

tswast Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

tswast left a comment

Choose a reason for hiding this comment

Uh oh!

tswast Apr 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

chalmerlowe commented Apr 16, 2025 •

edited

Loading

tswast Apr 21, 2025 •

edited

Loading