Optimized pillow-simd needed to leverage new GPU instances #212

jph00 · 2018-05-15T18:49:50Z

(Copied from https://www.kaggle.com/dansbecker/running-kaggle-kernels-with-a-gpu/code#328307 as requested by @sebbov ).

With the limited CPU available, being able to quickly read and manipulate images is important. Can I suggest you install pillow-simd with an optimized jpeg library? It will reduce CPU use by around 2-4x when training models, which will dramatically increase the utility of these new instances. The step by step instructions for installing it have been kindly provided by Soumith Chintala here (note that without these exact steps pillow-simd will not use libjpeg-turbo, even if you have it installed as your system libjpeg):

https://gist.github.com/soumith/01da3874bf014d8a8c53406c2b95d56b

I'm not familiar enough with docker to know what PR to send in exactly - hopefully this is enough info for the in-house experts to set it up.

ghost · 2018-05-17T23:22:44Z

@jph00 This is great - thank you for the suggestion! After a quick glance at the step-by-step instructions, I believe there are some potential PATH conflicts with our current setup. I'll work with it and keep you updated!

jph00 · 2018-05-17T23:43:50Z

Terrific! @crawforc3, in the meantime, any chance you could update Kaggle kernels to use the latest version of the fastai library (0.7)? (I don't see any info about how often pip packages are updated, or if anything needs to be done to make that happen.)

Once you've done that, I can start making some fastai examples available as kaggle kernels.

ghost · 2018-05-18T16:33:39Z

@jph00 As long as fastai is up-to-date in pip you are good to go. When we build our docker image, it pulls the most recent packages and we try to update our docker images at least nightly.

Sadly, the current docker image in production is about two weeks old. We ran into some problems with "updated" packages that have been preventing a stable build. However, I am delighted to say that I think we have a stable build this morning. After some testing I will try to get it pushed into production.

ghost · 2018-05-18T17:59:59Z

@jph00 fastai 0.7.0 is now live in kernels. Please let us know if you run into any issues.

jph00 · 2018-05-18T18:03:11Z

Many thanks! :)

whtahy · 2018-05-25T05:16:16Z

@crawforc3 Followup re fastai: GPU kernels are currently pytorch 0.3.1 + fastai 0.6. Does the cuda branch need a pull request on this? (Should I open a new issue?) Thanks for your help!

ghost · 2018-08-14T17:13:17Z

Closing this because we are working on merging the GPU and non-GPU images which will solve this problem.

jph00 · 2018-08-14T17:37:11Z

@crawforc3 this issue is about pillow-simd which isn't directly related to that. Perhaps reopen so we can track that issue?

ghost · 2018-08-14T17:38:32Z

That is a valid point! Re-opened :)

rosbo · 2021-10-05T18:14:25Z

pillow-simd hasn't been updated for 2 year and is lagging behind the pillow library (3.x vs 8.x). You can install it in your notebook via pip install if you need it.

Thank you

jph00 · 2022-06-20T19:23:10Z

@rosbo @crawforc3 I wonder if you'd consider reopening this? pillow-simd it now at version 9. It's hard and slow to install via pip - it requires using custom flags to compile with avx2 extensions, and care must be taken to ensure that it uses libjpeg-turbo (see the link I provided in the top post). In code competitions internet isn't available, making this even more complicated.

jph00 · 2022-06-20T19:24:01Z

cc @sebbov

rosbo · 2022-06-21T18:17:38Z

Hi @jph00,

pillow-simd is currently at 9.0.0 while pillow is at 9.1.1.

I looked at our dependencies and they only require Pillow 8+.

I am not a PIL expert and use it only for basic use cases. @jph00, do see any changes for > 9.0.0 that are worthwhile and we would be missing? https://github.com/python-pillow/Pillow/blob/main/CHANGES.rst

Thank you

jph00 · 2022-06-21T23:40:06Z

There's nothing I see there that's likely to be important for many Kagglers. Lots of folks are using 9.0 still -- in fact lots of folks still use v8! Many thanks for your reply.

whtahy mentioned this issue Jul 5, 2018

Update PyTorch version on docker image with GPU #219

Closed

ghost closed this as completed Aug 14, 2018

ghost reopened this Aug 14, 2018

rosbo self-assigned this Aug 14, 2018

rosbo closed this as completed Oct 5, 2021

rosbo reopened this Jun 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimized pillow-simd needed to leverage new GPU instances #212

Optimized pillow-simd needed to leverage new GPU instances #212

jph00 commented May 15, 2018

ghost commented May 17, 2018

jph00 commented May 17, 2018 •

edited

Loading

ghost commented May 18, 2018

ghost commented May 18, 2018 •

edited by ghost

Loading

jph00 commented May 18, 2018 via email

whtahy commented May 25, 2018

ghost commented Aug 14, 2018

jph00 commented Aug 14, 2018

ghost commented Aug 14, 2018

rosbo commented Oct 5, 2021

jph00 commented Jun 20, 2022

jph00 commented Jun 20, 2022

rosbo commented Jun 21, 2022

jph00 commented Jun 21, 2022 via email

Optimized pillow-simd needed to leverage new GPU instances #212

Optimized pillow-simd needed to leverage new GPU instances #212

Comments

jph00 commented May 15, 2018

ghost commented May 17, 2018

jph00 commented May 17, 2018 • edited Loading

ghost commented May 18, 2018

ghost commented May 18, 2018 • edited by ghost Loading

jph00 commented May 18, 2018 via email

whtahy commented May 25, 2018

ghost commented Aug 14, 2018

jph00 commented Aug 14, 2018

ghost commented Aug 14, 2018

rosbo commented Oct 5, 2021

jph00 commented Jun 20, 2022

jph00 commented Jun 20, 2022

rosbo commented Jun 21, 2022

jph00 commented Jun 21, 2022 via email

jph00 commented May 17, 2018 •

edited

Loading

ghost commented May 18, 2018 •

edited by ghost

Loading