cuEquivariance library has only a slight performance improvement for the inference of MACE models #740
Replies: 2 comments 25 replies
-
I notice a similar result, about 10-20% improvement in speed (for a 25x25x25 angstrom water box). I do notice that the amount of memory used is about half, which is extremely convenient as it allows to use larger systems on the same GPU. |
Beta Was this translation helpful? Give feedback.
-
Hello, You should see up to x5 time speed up for the large model on A100 and H100, I confirmed this during MD. Be careful that the actual speed up that you will see is dependent on many things, small models have less acceleration than large. Changing GPUs will also affect the speed up. It is very delicale to time things, so be careful. What GPU are you using? |
Beta Was this translation helpful? Give feedback.
-
I tried to accelerate inference with cuEquivariance library according to the User Guide (https://mace-docs.readthedocs.io/en/latest/guide/cuda_acceleration.html) of MACE.
But adding the enable_cueq=True only has a slight performance improvement, which is significantly different from the several fold performance improvement mentioned in the blog (https://developer.nvidia.com/blog/accelerate-drug-and-material-discovery-with-new-math-library-nvidia-cuequivariance/).
`atoms = build.molecule('H2O')
macemp = mace_mp(model="MACE_MPtrj_2022.9.model", enable_cueq=True) # Return ASE calculator
descriptors_mp = macemp.get_descriptors(atoms)
start = time.time()
loop = 5
for i in range(loop):
descriptors_mp = macemp.get_descriptors(atoms)
print((time.time() - start) / loop)`
Excuse me, is the case I constructed correct? Can you provide test cases?
Thanks.
Beta Was this translation helpful? Give feedback.
All reactions