Getting FIM right with StarCoder2 #263
-
Hey hivemind. I use VSCode and Twinny with Ollama, and for the most part, things work okay. However, I have issues with FIM where sometimes just the right code completion is provided, or just the right comment is made, but most of the time I get a bunch of extra stuff appended to the complete. Generally this is in the form of I use deepseek-coder:6.7b-instruct for chat and starcoder2-3b-q4_0 for fim. My provider setup is as follows: Label: DeepSeek 6.7B Chat Label: StarCoder2 3B FIM The Twinny config is default for everything except: The templates are all default. Any thoughts as to what I need to tweak/fix in order to ensure that the completion are just completions and don't contain all the extraneous stuff? TIA. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 4 replies
-
Hello. Afaik Starcoder2 messed up a little the FIM in training so this is a model flaw which doesn't stop at the right place. I've also noticed it and unfortunately the added prefix after the completion is often too random to strip out. |
Beta Was this translation helpful? Give feedback.
-
I like codellama-code or deepseek base models for fim and codestrall for chat. |
Beta Was this translation helpful? Give feedback.
Hello. Afaik Starcoder2 messed up a little the FIM in training so this is a model flaw which doesn't stop at the right place. I've also noticed it and unfortunately the added prefix after the completion is often too random to strip out.