-
Notifications
You must be signed in to change notification settings - Fork 68
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
streaming to struct type column results in nulls #255
Comments
I have found what the problem is: HiveStreamingDataWriter uses StrictDelimitedInputWriter and does not account for Struct types. |
I managed to use master branch and get it to build.
Fix for this issue is included in HIVE-20059, so I will try to include the fix in Hive 3.1.1 and build it locally to see if it fixes the issue. |
created pull request #258 |
So the suggested PR above uses following format to use StrictJsonWriter:
|
I have the same problem, but in Structured Streaming and I am using |
@massoudm changes worked for me in Structured Streaming What i did:
|
I'm trying to write to a table with Struct type column and all properties in that column are null after saving.
HDP 3.0.1.0-187
hive-warehouse-connector_2.11-1.0.0.3.0.1.0-187.jar
Here is code to reproduce:
Spark:
but when I query the table using hive:
in hive
I get:
Since I do not see any code example doing streaming with Struct columns, I believe this may be an undetected bug.
P.S.: Same code works fine if I use HIVE_WAREHOUSE_CONNECTOR instead of DATAFRAME_TO_STREAM.
The text was updated successfully, but these errors were encountered: