Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Similar project #2

Open
botev opened this issue Feb 12, 2017 · 2 comments
Open

Similar project #2

botev opened this issue Feb 12, 2017 · 2 comments

Comments

@botev
Copy link

botev commented Feb 12, 2017

Hi,

So we had similar start of a similar project here: https://github.com/Metadiff/gir. We occasionally chat in gitter on https://gitter.im/rust-ml/ if by any chance you are interested in potentially putting efforts together or just general discussion about ML and graph computation optimizers please feel free to come and have a chat.

@ethanabrooks
Copy link
Owner

ethanabrooks commented Feb 13, 2017 via email

@botev
Copy link
Author

botev commented Feb 13, 2017

So in general, I think putting efforts together is really good. However, I know that it is possible that for instance maybe you do not agree with my approach etc.. which why the discussion is good as we can talk about good and bad sides.

Generally speaking, I think the main difference is that I do not build the graph in memory (e.g. as enum tree), but it is a fully dynamic structure. I think that is better as it allows for easier access to both parents and child, and also for later down the pipeline inserting auxiliary nodes (for instance fork and join or barriers). It also I think makes it easier to modify the graph during optimization procedure. Additionally, modeling it as enum prevents downstream backends to create their own ops, for instance a cuda + cudnn might want to make its adhoc cudnn_conv2d operator which to be inserted instead of the general conv2d etc...

Second, I explicitly model the shapes of the tensors and their data types, to allow shape verification for dynamic shapes even at graph construction type and other such checks. This really gives significantly better debugging messages than anything else out there. It also requires a bit more verbosity, but it fits well I think in the Rust spirit.

Another main goal, at least for me, is to make the graph engine, totally serialiazable and separate from the backends. This roughly follows the LLVM paradigm of allowing an independent IR, which can be reused, and which later each backend can execute as it pleases.

At the moment, I have a very limited example, with backward differentiation implemented. Currently, I'm plugging in Arrayfire as the easiest initial backend to use (its the least work of the rest). I also want to focus a lot on OpenCL as well as CUDA, but my experience there is not that great. In general I would like to use template engines like tera to be able to generate ad-hoc kernels based on the graph.

Currently, any help in any of the departments is more than welcome. Any suggestions, critiques etc of design choices are also most welcome and any discussion on those. I mostly tried to pick the best things from Theano, Tf and the other frameworks and try to make something like an LLVM for this. Main point being that a lot of the optimization can happen in the IR, such as memory allocation and scheduling, and this can be reused by anyone using it, independent of the backend.

I do not know what are your goals/aims out of this as well as what are your main strengths in this kind of area. Again feel free to drop to the gitter channel.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants