Application is getting stuck with oversized events #209

spenes · 2021-11-02T16:08:10Z

In the v2.0.0, we realized that application stops processing the data at some point when oversized events are sent. I was able to reproduce this problem sending events with ~500 kb size.

First suspect for this problem is elastic4s version bump in v2.0.0. We couldn't find the actual reason why it blocks Kinesis consumer yet. When we find it, we can try to find better solution to this problem. However, as a quick fix, we might detect oversized events before sending them to Elasticsearch and create bad rows from them. We can even leave this fix as permanent because we have some similar performance problem with oversized events in prior version of ES Loader too. They don't completely halt but they are processing the events really slow when they are oversized.

So, the thing open to discussion is how we will decide whether event is oversized or not. My suggestion is we can do it empirical way. We can test different size of events and look for starting from how much size we have this problem. We can decide maximum size with this way.

Another potential solution for this problem is truncating the oversized fields in the document sent to Elasticsearch. I think this is not a good solution because we are manipulating incoming data without letting know user there is a problem with that. I think it would be better if we just create bad row from them explicitly instead of just silently truncating them.

spenes · 2022-02-25T14:36:34Z

In here, it is specified that Lucene's term byte-length limit is 32766 bytes. Since we are hitting this limit most probably, it might be a good value as field size limit.

istreeter · 2022-02-25T14:51:08Z

I agree we should be guided by the Lucene byte-length limit. However, be careful about using string length:

scala> "😊".length
res0: Int = 2

scala> "😊".getBytes("UTF-8").length
res1: Int = 4

We should either impose a string character limit of 16383 or a byte limit of 32766.

For now, I agree that if an event violates the length then we should send it to bad. Because that matches the previous behaviour of the app. In old versions, we would try to load the event, then it would fail, then we send to bad.

In future, we might choose to implement a feature where we truncate the fields before inserting. But that is a new feature that goes beyond what we need to do here to fix this bug.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Application is getting stuck with oversized events #209

Application is getting stuck with oversized events #209

spenes commented Nov 2, 2021 •

edited

Loading

spenes commented Feb 25, 2022

istreeter commented Feb 25, 2022

Application is getting stuck with oversized events #209

Application is getting stuck with oversized events #209

Comments

spenes commented Nov 2, 2021 • edited Loading

spenes commented Feb 25, 2022

istreeter commented Feb 25, 2022

spenes commented Nov 2, 2021 •

edited

Loading