Optimize the fit gradients #729

HanatoK · 2024-10-15T00:42:49Z

Save the unrotated positions. This can avoid rotating the positions again in calc_fit_forces_impl;
Determine dl, dq or ds by compile-time template options;
Only call calc_fit_gradients when atom groups have explicit gradients;
Simplify and inline cvm::quaternion::position_derivative_inner.

This PR should be only marked as ready after #713 is merged.

giacomofiorin · 2024-10-16T15:03:30Z

@HanatoK We don't currently have a CI job with Intel, but that compiler uses restrict (as well as PGI pre-NVIDIA). See the following from the LAMMPS headers:
https://github.com/lammps/lammps/blob/59bbc5bcc1104bdb4fb45107cd65b5d4d76dbc00/src/lmptype.h#L342-L348

HanatoK · 2024-10-16T15:11:15Z

@HanatoK We don't currently have a CI job with Intel, but that compiler uses restrict (as well as PGI pre-NVIDIA). See the following from the LAMMPS headers: https://github.com/lammps/lammps/blob/59bbc5bcc1104bdb4fb45107cd65b5d4d76dbc00/src/lmptype.h#L342-L348

Thanks! I will try using the same way to apply __restrict.

giacomofiorin · 2024-10-16T17:29:28Z

@HanatoK We don't currently have a CI job with Intel, but that compiler uses restrict (as well as PGI pre-NVIDIA). See the following from the LAMMPS headers: https://github.com/lammps/lammps/blob/59bbc5bcc1104bdb4fb45107cd65b5d4d76dbc00/src/lmptype.h#L342-L348

Thanks! I will try using the same way to apply __restrict.

Awesome! I added an Intel oneAPI CI job in the meantime, which is overkill for just this PR but definitely was a missing feature otherwise.

Wikipedia says "__restrict" is recognized by all three compilers. Let me try it.

See https://github.com/lammps/lammps/blob/59bbc5bcc1104bdb4fb45107cd65b5d4d76dbc00/src/lmptype.h#L342-L348

HanatoK · 2024-10-17T22:11:27Z

I think it is better to document how the fit gradients are computed so I tried my best to improve the doxygen documentation in 2d353eb. I know the SI of the new Colvars paper includes the computation but that is too simplified and not quite informative.

HanatoK force-pushed the optimize_fit_gradients branch 2 times, most recently from 09def4b to 2da467b Compare October 16, 2024 14:09

HanatoK added 7 commits October 16, 2024 13:54

Optimize the fit-gradient calculation by saving the unrotated frame

d49de7a

Minor optimizations

e9d7bba

Determine dl, dq or ds by compile-time template options

aa1618f

Only call calc_fit_gradients when atom groups have explicit gradients

f8fb91f

Workaround for the MSVC compiler

e894682

refactor: use __restrict instead of __restrict__

6d0ca12

Wikipedia says "__restrict" is recognized by all three compilers. Let me try it.

Use the same trick for __restrict as LAMMPS

3de535c

See https://github.com/lammps/lammps/blob/59bbc5bcc1104bdb4fb45107cd65b5d4d76dbc00/src/lmptype.h#L342-L348

HanatoK force-pushed the optimize_fit_gradients branch from 643ed9e to 3de535c Compare October 16, 2024 18:54

chore: improve the docs of the calculation of fit gradients

2d353eb

HanatoK marked this pull request as ready for review October 17, 2024 22:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize the fit gradients #729

Optimize the fit gradients #729

HanatoK commented Oct 15, 2024

giacomofiorin commented Oct 16, 2024

HanatoK commented Oct 16, 2024

giacomofiorin commented Oct 16, 2024

HanatoK commented Oct 17, 2024

Optimize the fit gradients #729

Are you sure you want to change the base?

Optimize the fit gradients #729

Conversation

HanatoK commented Oct 15, 2024

giacomofiorin commented Oct 16, 2024

HanatoK commented Oct 16, 2024

giacomofiorin commented Oct 16, 2024

HanatoK commented Oct 17, 2024