-
Hello, I am trying to use the big Esm-2 models to create embeddings. I first tried the 15b, and then the 3b. I got OoM errors all the time. The 15b wouldn't even load to the gpus (never reached step2)! I tried to use multigpu in one node, checkpoint activation and cpu sharding, but not even the 3b was capable of doing a single sequence (around 300 aminoacid length). I am using modest hardware, with 128 cores, 1tb ram and 4xNvidia A100 80gb vram. So, question. How much hardware is required to run the 3b model? Code
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
I use |
Beta Was this translation helpful? Give feedback.
I use
A40 46gb GRAM
, and 3b is good for me, a test around 1800 aa length would reach the cell to get an OoM error.