to_dict on empty nested types #200

kilimnik · 2021-01-19T10:49:12Z

The issue is described here #199.

It was a very simple fix. I don't think a test case is neccesary for this bug. If you think otherwise I can add one.

kilimnik · 2021-01-19T21:35:58Z

I analyzed the code a bit more. The problem why so many test fail is that currently the library doesn't know the difference between an uninitialized field and a field which is initialized with an empty object.

Gobot1234 · 2021-01-20T18:02:12Z

src/betterproto/__init__.py

@@ -875,6 +874,10 @@ def parse(self: T, data: bytes) -> T:
                )

            current = getattr(self, field_name)


This should be moved below that if statement

The line 879 uses the current attribute, so putting that below doesn't make any sense

Gobot1234 · 2021-01-20T18:02:54Z

src/betterproto/__init__.py

@@ -875,6 +874,10 @@ def parse(self: T, data: bytes) -> T:
                )

            current = getattr(self, field_name)
+
+            if self.__raw_get(field_name) == PLACEHOLDER:


This should use is not ==

And maybe there should be a return on this

Why should there be a return here? The for loop has to continue

Sorry I meant continue

Gobot1234 · 2021-01-20T18:03:11Z

src/betterproto/__init__.py

@@ -972,6 +975,8 @@ def to_dict(
                    )
                ):
                    output[cased_name] = value.to_dict(casing, include_default_values)
+                elif self.__raw_get(field_name) != PLACEHOLDER:


Similar thing here, should be is not

I was thinking about that but I saw line 540 doing the same comparison

Gobot1234 · 2021-01-20T18:22:47Z

tests/test_features.py

@@ -17,7 +17,7 @@ class Foo(betterproto.Message):
    assert betterproto.serialized_on_wire(foo.bar) is False

    # Serialized after setting something
-    foo.bar.baz = 1
+    foo.bar = Bar(baz=1)


Changing all this is a regression if this PR breaks this

I know that isn't great. My PR would break lazy initialization. If you want to keep that, I have to think of another implementation. But lazy initialization also isn't in the google implementation, so I thought it wouldn't be important.

Well breaking things isn't good. I know how this can be fixed it's been discussed before but it involves every field having a serialized attribute or similar.

nat-n · 2021-01-24T21:17:11Z

Hi @dk99, thanks for working on this.

I analyzed the code a bit more. The problem why so many test fail is that currently the library doesn't know the difference between an uninitialized field and a field which is initialized with an empty object.

I believe this is by design in order to be in line with proto3 semantics, whereby primitive type fields are interpreted as their zero value if unset, and vice versa.

More generally I think the fix for this bug should be isolated to the to_dict structuring logic and shouldn't have any implications for the parsing or internal representation of object. Unless I'm missing something?

I know that isn't great. My PR would break lazy initialization. If you want to keep that, I have to think of another implementation. But lazy initialization also isn't in the google implementation, so I thought it wouldn't be important.

As I mentioned in slack channel, as I recall lazy initialisation is required for recursive message types to work, so I suspect the google implementation actually does support it.

As for testing, ideally we should have a test case for this in line with the pattern documented here: https://github.com/danielgtaylor/python-betterproto/tree/master/tests Let me know if this doesn't turn out to be trivial (the test setup needs some work which has been done in other pending PRs).

to_dict on empty nested types (danielgtaylor#199)

6ad9bab

kilimnik added 3 commits January 20, 2021 18:54

Removed lazy initialization

cef8f71

removed lazy initialization from tests

9b7dd26

Fixed formatting

d213abc

Gobot1234 reviewed Jan 20, 2021

View reviewed changes

nat-n mentioned this pull request Jan 25, 2021

Release v2.0.0b3 #182

Merged

nat-n added the bug Something isn't working label Apr 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

to_dict on empty nested types #200

to_dict on empty nested types #200

kilimnik commented Jan 19, 2021

kilimnik commented Jan 19, 2021

Gobot1234 Jan 20, 2021

kilimnik Jan 20, 2021

Gobot1234 Jan 20, 2021

Gobot1234 Jan 20, 2021

kilimnik Jan 20, 2021

Gobot1234 Jan 20, 2021

Gobot1234 Jan 20, 2021

kilimnik Jan 20, 2021

Gobot1234 Jan 20, 2021

kilimnik Jan 20, 2021

Gobot1234 Jan 20, 2021

nat-n commented Jan 24, 2021

		@@ -875,6 +874,10 @@ def parse(self: T, data: bytes) -> T:
		)

		current = getattr(self, field_name)

to_dict on empty nested types #200

Are you sure you want to change the base?

to_dict on empty nested types #200

Conversation

kilimnik commented Jan 19, 2021

kilimnik commented Jan 19, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nat-n commented Jan 24, 2021