Add a pip-installable, simple implementation of MeZO (along with a distributed impl. and some tests) #26

lebrice · 2023-12-20T21:05:10Z

Hello there!

I was very interested in your work after seeing it a NeurIPS. I'd like to play around with it a bit in the future. In order to do so, I felt it might be useful to add a simple, standalone implementation of your algorithm to your codebase, so people can more easily import it into their codebase and use it.

Here are my contributions, if you're interested:

Add a simple, readable, standalone implementation of the MeZO update in a new mezo package
- mezo.update: Perform a single MeZO update given the model, loss function, inputs, random seed, epsilon and learning rate.
  - NOTE: This also implements a minor improvement w.r.t. to the original algorithm: We can split up the update into smaller chunks whenever a weight matrix is too large. This makes it so the maximum additional VRAM used during a mezo update can be selected apriori, instead of being the size of the largest weight (e.g. the embedding matrix in LLMs).
- mezo.reconstruct_updates: Reconstructs a sequence of MeZO updates given the model, random seeds and projected gradients of each step
- mezo.average_of_updates: Performs the average of multiple MeZO updates given the model, random seeds and projected gradients of each step or worker
- mezo.distributed_update: Distributed update, each worker communicates the projected grads (and random seed implicitly) to all other workers. Each worker ends up reconstructing the average update from all workers.
Add a distributed MeZO update in mezo.distributed
Make this installation pip-installable and add small install instructions in the README.md
Add unit tests for every added major function (mezo.update, mezo.reconstruct_updates, mezo.average_of_updates, mezo.distributed_update)

If you'd like to add these changes to your repo, could you please just make sure that I didn't miss anything in my re-implementation of the algorithm (perhaps by reading through the mezo.update and mezo.distributed_update functions, if possible).

Thanks and congratulations on this great work!

Signed-off-by: Fabrice Normandin <[email protected]>

lebrice · 2023-12-20T21:08:27Z

README.md

+
+
+```bash
+pip install git+https://www.github.com/lebrice/MeZO


Note: If you're interested in merging this PR, then simply accept this change so people pip-install your repo instead of my fork.

Suggested change

pip install git+https://www.github.com/lebrice/MeZO

pip install git+https://www.github.com/princeton-nlp/MeZO

Signed-off-by: Fabrice Normandin <[email protected]>

lebrice · 2024-03-01T17:16:20Z

Hey @gaotianyu1350 @sadhikamalladi @eltociear @danqi, would you be interested in reviewing this contribution to your repo?

Alignment-Lab-AI · 2024-03-27T17:43:58Z

@lebrice appreciate the effort youve gone through, this is adding productively to some experiments were o at the moment!

lebrice · 2024-06-03T17:53:21Z

@gaotianyu1350 @sadhikamalladi @eltociear @danqi

lebrice and others added 9 commits December 20, 2023 15:42

Simple algo implementation

5e0d3b1

Signed-off-by: Fabrice Normandin <[email protected]>

Adding a distributed version

cbd3735

Signed-off-by: Fabrice Normandin <[email protected]>

Add a pip-installable simplified implementation

2f00755

Signed-off-by: Fabrice Normandin <[email protected]>

Add tests for simple distributed version

f665b0b

Signed-off-by: Fabrice Normandin <[email protected]>

Fix test for distributed update, add conftest.py

15e5ed1

Signed-off-by: Fabrice Normandin <[email protected]>

Improve logging messages, remove comment

afcad4b

Signed-off-by: Fabrice Normandin <[email protected]>

Rename mezo_update_step to mezo_update

6d5926f

Signed-off-by: Fabrice Normandin <[email protected]>

Remove redundant 'mezo' from fn names

9274e44

Signed-off-by: Fabrice Normandin <[email protected]>

Remove comment blocks from .pre-commit-config.yaml

08b7421

Signed-off-by: Fabrice Normandin <[email protected]>

lebrice commented Dec 20, 2023

View reviewed changes

Tweak interface, fix multi-gpu bugs

e94265a

Signed-off-by: Fabrice Normandin <[email protected]>

This was referenced Jun 3, 2024

Maybe need a requirement.txt file to facilitate environment preparation？ #33

Open

In which file is the code implemented by the algorithm？ #32

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a pip-installable, simple implementation of MeZO (along with a distributed impl. and some tests) #26

Add a pip-installable, simple implementation of MeZO (along with a distributed impl. and some tests) #26

lebrice commented Dec 20, 2023

lebrice Dec 20, 2023

lebrice commented Mar 1, 2024

Alignment-Lab-AI commented Mar 27, 2024

lebrice commented Jun 3, 2024

	pip install git+https://www.github.com/lebrice/MeZO
	pip install git+https://www.github.com/princeton-nlp/MeZO

Add a pip-installable, simple implementation of MeZO (along with a distributed impl. and some tests) #26

Are you sure you want to change the base?

Add a pip-installable, simple implementation of MeZO (along with a distributed impl. and some tests) #26

Conversation

lebrice commented Dec 20, 2023

lebrice Dec 20, 2023

Choose a reason for hiding this comment

lebrice commented Mar 1, 2024

Alignment-Lab-AI commented Mar 27, 2024

lebrice commented Jun 3, 2024