Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Will M2-GPT be open-sourced? #7

Open
yangsp5 opened this issue Oct 30, 2023 · 13 comments
Open

Will M2-GPT be open-sourced? #7

yangsp5 opened this issue Oct 30, 2023 · 13 comments

Comments

@yangsp5
Copy link

yangsp5 commented Oct 30, 2023

Will M2-GPT be open-sourced?
It seems interesting

@DanFu09
Copy link
Collaborator

DanFu09 commented Oct 30, 2023 via email

@LSinev
Copy link

LSinev commented Dec 5, 2023

Thank you for your great work! Is M2-GPT open sourcing postponed?

@avesus
Copy link

avesus commented Jan 25, 2024

GPT code, or it didn't happen. The extraordinary claims require extraordinary proofs. The paper is very convincing, and INCREDIBLY well written, but does causal as good as you claimed in paper? The best test would be to release the training code in Andrej Karpathy's style of minGPT/nanoGPT/llama2.c.

@lhallee
Copy link

lhallee commented Jan 29, 2024

@DanFu09 any update on this? I can't seem to find the checkpoints. At a minimum, I would love to see the yamls so can experiment locally. Great work putting models out with Together AI btw!

@redbrain
Copy link

Do you plan on releasing the weights of the causal M2 models, or just the code?

@DanFu09
Copy link
Collaborator

DanFu09 commented Feb 17, 2024 via email

@redbrain
Copy link

Hello, it's been a couple weeks, just wanted to check on the status of the M2-GPT impl release?

@DanFu09
Copy link
Collaborator

DanFu09 commented Apr 13, 2024

First thing on my list once the faculty interviews finish up! (One more week I promise 🤞)

(it's mostly done sitting on a private branch, just need to fix up a few more bits of configs and merge things)

@redbrain
Copy link

Checking in one more time, since it's been another two weeks! Is it possible to get an ETA on the M2-GPT release?
(Sorry for the persistent reminders, I understand you're busy and just want to make sure this doesn't get buried under everything else.)

@DanFu09
Copy link
Collaborator

DanFu09 commented May 11, 2024

I'm very hopeful that I'll be able to put it out this week 🤞

@redbrain
Copy link

Here's another two-week check-in, hopefully the last one :) How's it looking right now?

@sanjayss34
Copy link

Also interested in this, would you be able to release the code?

@DanFu09
Copy link
Collaborator

DanFu09 commented Jun 13, 2024

Hi :)

I uploaded a new config and some code changes to a branch of safari: https://github.com/HazyResearch/safari/tree/flashfftconv.

Please see these instructions and let me know how they work: https://github.com/HazyResearch/safari/blob/flashfftconv/experiments.md#m2-gpt . You'll have to use the old fused_fft CUDA kernel in that repo (hopefully a refactor of FlashFFTConv comes soon to make it all play nice).

If it goes well I'll start the more involved surgery to get the two repos to play nice with each other (maybe just an update of the other one and a link for now).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

7 participants