Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Process SkinData asynchronously #6589

Closed
wants to merge 8 commits into from

Conversation

Blackjack200
Copy link

@Blackjack200 Blackjack200 commented Dec 28, 2024

Summary

if($geometryData !== ""){
try{
$decodedGeometry = (new CommentedJsonDecoder())->decode($geometryData);
}catch(\RuntimeException $e){
throw new InvalidSkinException("Invalid geometry data: " . $e->getMessage(), 0, $e);
}

This pull request optimizes the processing of player geometry data by offloading skin decoding to a ProcessSkinTask. This change addresses potential server freezes caused by the decoding of overly complex JSON structures.

Existing Problems

Decoding geometry data directly in the main thread was a time-intensive operation, especially when the JSON data was highly complex, leading to server performance issues or even temporary freezes.

Changes

  • Offloaded the skin geometry decoding to an asynchronous task (ProcessSkinTask) to prevent the main thread from being froze.

Behavioural Changes

  • Players' skin geometry data will now be processed asynchronously.
  • Reduced risk of server performance degradation during skin decoding.

Backwards Compatibility

This change is fully backward compatible, as it does not modify the API or the format of geometry data. Existing player skins and data handling will continue to work without modification.

Follow-up

  • Monitor server performance post-deployment to ensure stability.

Tests

  • Verified that skin geometry data is correctly decoded when processed via ProcessSkinTask.
  • Tested gameplay to confirm no adverse effects on player skins or entity rendering.
  • Compared server performance before and after the patch, as reflected in the timings reports.

@Blackjack200 Blackjack200 requested a review from a team as a code owner December 28, 2024 08:34
Copy link
Member

@dktapps dktapps left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is probably a good idea. Only other way we could bring down this cost is if there was some faster way to strip and validate the JSON, which as far as I'm aware there isn't.

However, there's some potential issues that need to be addressed.

src/network/mcpe/handler/InGamePacketHandler.php Outdated Show resolved Hide resolved
@dktapps dktapps added Category: Network Related to the internal network architecture Type: Enhancement Contributes features or other improvements to PocketMine-MP Performance labels Dec 28, 2024
dktapps
dktapps previously approved these changes Dec 28, 2024
Copy link
Member

@dktapps dktapps left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@pmmp/server-developers anyone else want to cast an eye over this?

src/network/mcpe/handler/LoginPacketHandler.php Outdated Show resolved Hide resolved
src/network/mcpe/ProcessSkinTask.php Outdated Show resolved Hide resolved
throw PacketHandlingException::wrap($e, "Invalid skin in PlayerSkinPacket");
}
return $this->player->changeSkin($skin, $packet->newSkinName, $packet->oldSkinName);
$this->player->getServer()->getAsyncPool()->submitTask(new ProcessSkinTask(
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This would probably make the server to freeze / crash if a malicious client floods tons of skin change requests per tick.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would the existing $this->gamePacketLimiter be sufficient for skin change packets, or would it be better to implement a separate rate limiter specifically for PlayerSkinPacket to prevent potential flooding?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If a separate rate limiter is needed, how should we set the values for INCOMING_SKIN_PACKET_PER_TICK = 1 and INCOMING_SKIN_PACKET_BUFFER_TICKS = 100?
Would these settings be appropriate, or should we adjust them further?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would the existing $this->gamePacketLimiter be sufficient for skin change packets, or would it be better to implement a separate rate limiter specifically for PlayerSkinPacket to prevent potential flooding?

Probably not. It's supposed to limit all packets, so it has quite generous limits.

More so the issue here is memory overload if the client spams a lot of skin change packets (it could cause a large backlog in the worker pool & hog a lot of memory).

My thought was to limit the number of concurrent ProcessSkinTasks per session, but I'm not sure if this is the best way.

Copy link
Author

@Blackjack200 Blackjack200 Dec 29, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Limiting the rate of PlayerSkinPacket packets seems easier to implement and would have a similar effect to limiting the number of concurrent ProcessSkinTask.

In my preliminary testing, setting INCOMING_SKIN_PACKET_PER_TICK = 0.1 and INCOMING_SKIN_PACKET_BUFFER_TICKS = 20 seems promising (though it still needs sufficient testing). However, this would require changing the $averagePerTick in PacketLimiter from an int to a float or both. Would that be acceptable?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No, I don't think so. The number of packets isn't the only factor we need to consider. We also have to be aware of how long each packet takes to process.

Under normal conditions the packets might take 10 ms each to process, but an attacker could craft packets that take 10-100x longer to process and still be subject to the same packets-per-tick limits (getting more lag per packet). That's why I said we probably need to limit the number of tasks, rather than the number of packets.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No, I don't think so. The number of packets isn't the only factor we need to consider. We also have to be aware of how long each packet takes to process.

Under normal conditions the packets might take 10 ms each to process, but an attacker could craft packets that take 10-100x longer to process and still be subject to the same packets-per-tick limits (getting more lag per packet). That's why I said we probably need to limit the number of tasks, rather than the number of packets.

Aside from the better methods for now, we could consider handling these potential time-bomb data by spawning new processes, similar to ConsoleReaderChildProcess. This would allow the main thread to control their runtime, using OS-level process killing (with proper abstraction for generalization—I’ve already implemented this in my private project, leveraging opis/closure). However, this approach does come with challenging compatibility issues.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That sounds overengineered to me.

@CicciogamerRrr

This comment was marked as spam.

@dktapps
Copy link
Member

dktapps commented Dec 29, 2024

#4981 would also enable skipping some validation, e.g. if we had a map<rawData, validatedOptimizedData>
However this would only be helpful for dealing with common geometries

@dktapps
Copy link
Member

dktapps commented Dec 29, 2024

This PR would be better if we could capture PlayerSkinPacket immediately after DataPacketDecodeEvent and have it handed off to an async task in its plain buffer form.

This would:

  • eliminate main thread skin decoding performance cost entirely
  • avoid unnecessary serialize cost of SkinData (since it would never exist on the main thread)

However, it would break plugins that use DataPacketReceiveEvent to capture the packet.

On an unrelated note, worth mentioning (this only just occurred to me this second) is that this change would be disruptive for any plugins which set custom SkinAdapters in TypeConverter, as these won't be configured on the other threads.

@Blackjack200
Copy link
Author

This PR would be better if we could capture PlayerSkinPacket immediately after DataPacketDecodeEvent and have it handed off to an async task in its plain buffer form.

This would:

  • eliminate main thread skin decoding performance cost entirely
  • avoid unnecessary serialize cost of SkinData (since it would never exist on the main thread)

However, it would break plugins that use DataPacketReceiveEvent to capture the packet.

On an unrelated note, worth mentioning (this only just occurred to me this second) is that this change would be disruptive for any plugins which set custom SkinAdapters in TypeConverter, as these won't be configured on the other threads.

Therefore, we could preprocess the data packets in other threads first and then pass them to the main thread for processing. This seems to be a more common approach, rather than focusing solely on skin processing, we could also implement asynchronous encryption and asynchronous packets decoding/encoding(assuming the cost of serialize/unserialize is lower than encode/decking them) on the basis.

Also packets’ life cycle is clear so will there a possible way to transfer the ownership of objects between threads? This could improve this process significantly.

@dktapps
Copy link
Member

dktapps commented Jan 9, 2025

Don't overengineer it. That's well outside the scope of the PR and not practical anyway.

@Blackjack200
Copy link
Author

Blackjack200 commented Jan 10, 2025

I have no idea on a proper implementation approach. I think this should be implemented by a plugin.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Category: Network Related to the internal network architecture Performance Status: Waiting on Author Type: Enhancement Contributes features or other improvements to PocketMine-MP
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants