You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Got this idea from EvalPlus and BigCodeBench, and that sometimes it would be good to do apples-to-apples between models, and that if most of the top models are large or proprietary, it does not mean much when it is less accessible to the masses (besides the problem that large models are scalable models). Newer versions of Qwen2 and Phi-3 comes to mind.
The text was updated successfully, but these errors were encountered:
Got this idea from EvalPlus and BigCodeBench, and that sometimes it would be good to do apples-to-apples between models, and that if most of the top models are large or proprietary, it does not mean much when it is less accessible to the masses (besides the problem that large models are scalable models). Newer versions of Qwen2 and Phi-3 comes to mind.
The text was updated successfully, but these errors were encountered: