Creating embeddings instead of output prediction #86

JorritWillaert · 2022-11-20T09:03:37Z

Hi! I was wondering if I a GPT Code Clippy model could generate embeddings instead of output generation?
The purpose is to embed code in a semantical space, such that it can be used as a feature for another neural network. I have done the same with BERT (more as a baseline, since this model is not trained on code), and with the OpenAI Codex model (with a paying API), and therefore would love to use one of your models as well.

Thank you!

ncoop57 · 2022-11-26T03:28:24Z

Hi @JorritWillaert !

So you could use our models for embedding. However, I would not recommend it. I'd suggest checking out graphcodebert from Microsoft for doing this. It gets good performance across a ton of other code related tasks: https://huggingface.co/microsoft/graphcodebert-base

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Creating embeddings instead of output prediction #86

Creating embeddings instead of output prediction #86

JorritWillaert commented Nov 20, 2022

ncoop57 commented Nov 26, 2022

Creating embeddings instead of output prediction #86

Creating embeddings instead of output prediction #86

Comments

JorritWillaert commented Nov 20, 2022

ncoop57 commented Nov 26, 2022