Gradient check example #1497

m-philipps · 2024-10-20T21:20:15Z

add a documentation example for performing gradient checks (closes Add gradient check methods to notebooks #1481)
fix Conditional variable initialisation in AmiciObjective.check_gradients_match_finite_differences #1494

I'd be happy about suggestions on the "Best practices" and "How to fix my gradients".

codecov-commenter · 2024-10-20T21:22:51Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 84.34%. Comparing base (e9a969e) to head (c0ef8cb).

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #1497      +/-   ##
===========================================
- Coverage    84.37%   84.34%   -0.03%     
===========================================
  Files          163      163              
  Lines        14037    14037              
===========================================
- Hits         11844    11840       -4     
- Misses        2193     2197       +4

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Fixed two small typos.

dweindl

Thanks. I think I would just skip over check_grad and directly introduce check_grad_multi_eps. The latter performs much better.

doc/example/gradient_check.ipynb

Co-authored by: Daniel Weindl [email protected]

m-philipps · 2024-10-22T00:31:14Z

I think I would just skip over check_grad and directly introduce check_grad_multi_eps. The latter performs much better.

I agree that check_grad_multi_eps is convenient to use in practice; I like showcasing check_grad first to build it up like a tutorial, such that it is easier to understand what check_grad_multi_eps is doing for people who aren't so familiar with gradient checks. I'm also fine with changing it though.

doc/example/gradient_check.ipynb

PaulJonasJost

Looks good to me, thanks for this.

pypesto/objective/amici/amici.py

dweindl · 2024-10-23T08:47:29Z

doc/example/gradient_check.ipynb

+   "source": [
+    "# Gradient checks\n",
+    "\n",
+    "It is best practice to do gradient checks before and after gradient-based optimization.\n",


I think it would be good to include some rationale for why to check it afterwards and what to look for. I.e. except for parameters with active bounds, the values should be close to 0. At the same time, this might make it difficult to get good FD approximations.

You mean, a 0 gradient makes the FD approximation difficult?

Yes, that, and one might miss gradient entries that are (incorrectly) always zero.

Shouldn't these already show up before optimisation?

Yes, they should.

doc/example/gradient_check.ipynb

Doresic

Added some comments, more in the direction of how would one interpret the results of gradient checks. Because sometimes we get high absolute errors, but relative are still ok, so is this an issue, or not? Some guidance for users.

Not sure if we want to add that type of content to this notebook tho, or just want to leave it only as -- "This can be done and it's done using these functions."

Doresic · 2025-01-10T13:55:06Z

doc/example/gradient_check.ipynb

+    "- `fd_f`: FD forward difference\n",
+    "- `fd_b`: FD backward difference\n",
+    "- `fd_c`: Approximation of FD central difference (reusing the information from `fd_f` and `fd_b`)\n",
+    "- `fd_err`: Deviation between forward and backward differences `fd_f`, `fd_b`\n",


It might be good to add what this represents and why it's here. It is in the name (error) but it shows in some way how much does the gradient change in the local area of the point that's being checked. So it is in some way a showing of whether the step size is small enough -- in cases the function is smooth enough.

However, that's also not completely true. If you're at the optimum then the forward gradient will be positive, and the backward negative, possibly with high values, so fd_err will be very high, almost for any choice of eps.

Doresic · 2025-01-10T14:02:23Z

doc/example/gradient_check.ipynb

+      "text/plain": [
+       "                              grad          fd_f          fd_b          fd_c  \\\n",
+       "Epo_degradation_BaF3  2.899805e+10  2.898349e+10  2.899516e+10  2.898933e+10   \n",
+       "k_exp_hetero         -1.822477e+03  8.247990e+07 -1.185836e+08 -1.805185e+07   \n",
+       "k_exp_homo            1.940159e+06 -7.634094e+06  2.620560e+07  9.285754e+06   \n",
+       "k_imp_hetero          1.324222e+09  9.636109e+08  1.401833e+09  1.182722e+09   \n",
+       "k_imp_homo            2.759777e+09  2.689595e+09  2.810697e+09  2.750146e+09   \n",
+       "k_phos               -3.183894e+10 -3.189761e+10 -3.181251e+10 -3.185506e+10   \n",
+       "sd_pSTAT5A_rel       -4.106435e+12 -4.106388e+12 -4.106482e+12 -4.106435e+12   \n",
+       "sd_pSTAT5B_rel       -2.467665e+11 -2.468085e+11 -2.467245e+11 -2.467665e+11   \n",
+       "sd_rSTAT5A_rel        3.684015e+01 -4.769135e+07  4.769149e+07  7.324219e+01   \n",
+       "\n",
+       "                            fd_err       abs_err       rel_err  \n",
+       "Epo_degradation_BaF3  1.166055e+07  8.729236e+06  3.011190e-04  \n",
+       "k_exp_hetero          2.010635e+08  1.805003e+07  9.998990e-01  \n",
+       "k_exp_homo            3.383970e+07  7.345596e+06  7.910607e-01  \n",
+       "k_imp_hetero          4.382216e+08  1.415004e+08  1.196396e-01  \n",
+       "k_imp_homo            1.211021e+08  9.630702e+06  3.501887e-03  \n",
+       "k_phos                8.509938e+07  1.611509e+07  5.058878e-04  \n",
+       "sd_pSTAT5A_rel        9.372550e+07  6.663599e+02  1.622721e-10  \n",
+       "sd_pSTAT5B_rel        8.401874e+07  3.362799e+01  1.362745e-10  \n",
+       "sd_rSTAT5A_rel        9.538284e+07  3.640203e+01  4.970090e-01  "
+      ]
+     },


Do you think we should add some conclusion on this or one of the gradient checks? In the sense what would we think if we saw this gradient check: the absolute errors are rather high but can be expected with finite differences on random points. We see this also with fd_err, so changing the eps might make sense.
But what's reassuring is that the relative error is not too high in most cases.

Would redoing the gradient check with another eps make sense to show how it can affect the check?

doc/example/gradient_check.ipynb

m-philipps added 3 commits October 20, 2024 21:26

fix variable assignment error

1cd6fd2

add documentation for gradient checks

474ec3c

add gradient check notebook

ad45226

m-philipps requested review from dweindl, FFroehlich, Doresic, PaulJonasJost and vwiela as code owners October 20, 2024 21:20

m-philipps added the documentation label Oct 20, 2024

m-philipps and others added 2 commits October 21, 2024 01:54

fix kernel

0d6deea

Fixed typos in gradient_check.ipynb

d593b6a

Fixed two small typos.

dweindl reviewed Oct 21, 2024

View reviewed changes

doc/example/gradient_check.ipynb Outdated Show resolved Hide resolved

doc/example/gradient_check.ipynb Outdated Show resolved Hide resolved

implement Daniel's suggestions

002ec08

Co-authored by: Daniel Weindl [email protected]

vwiela reviewed Oct 22, 2024

View reviewed changes

doc/example/gradient_check.ipynb Outdated Show resolved Hide resolved

m-philipps commented Oct 22, 2024

View reviewed changes

doc/example/gradient_check.ipynb Outdated Show resolved Hide resolved

m-philipps and others added 2 commits October 22, 2024 11:20

rephrase

72a53a8

Merge branch 'develop' into gradient_check_example

0e76b2b

PaulJonasJost approved these changes Oct 23, 2024

View reviewed changes

pypesto/objective/amici/amici.py Show resolved Hide resolved

dweindl reviewed Oct 23, 2024

View reviewed changes

m-philipps marked this pull request as draft November 6, 2024 12:11

m-philipps added 2 commits January 2, 2025 12:22

Merge branch 'develop' into gradient_check_example

6611bfc

Daniels feedback

fc5bf99

m-philipps marked this pull request as ready for review January 2, 2025 14:40

m-philipps and others added 2 commits January 7, 2025 13:00

review

f038fd2

Merge branch 'develop' into gradient_check_example

bf8fd83

m-philipps requested a review from vwiela January 8, 2025 18:59

Doresic mentioned this pull request Jan 8, 2025

Prepare release 0.5.5 #1546

Merged

vwiela approved these changes Jan 9, 2025

View reviewed changes

Merge branch 'develop' into gradient_check_example

c0ef8cb

Doresic approved these changes Jan 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gradient check example #1497

Gradient check example #1497

m-philipps commented Oct 20, 2024

codecov-commenter commented Oct 20, 2024 •

edited

Loading

dweindl left a comment

m-philipps commented Oct 22, 2024

PaulJonasJost left a comment

dweindl Oct 23, 2024

m-philipps Jan 2, 2025

dweindl Jan 6, 2025

m-philipps Jan 6, 2025

dweindl Jan 6, 2025

Doresic left a comment

Doresic Jan 10, 2025

Doresic Jan 10, 2025

Gradient check example #1497

Are you sure you want to change the base?

Gradient check example #1497

Conversation

m-philipps commented Oct 20, 2024

codecov-commenter commented Oct 20, 2024 • edited Loading

Codecov Report

dweindl left a comment

Choose a reason for hiding this comment

m-philipps commented Oct 22, 2024

PaulJonasJost left a comment

Choose a reason for hiding this comment

dweindl Oct 23, 2024

Choose a reason for hiding this comment

m-philipps Jan 2, 2025

Choose a reason for hiding this comment

dweindl Jan 6, 2025

Choose a reason for hiding this comment

m-philipps Jan 6, 2025

Choose a reason for hiding this comment

dweindl Jan 6, 2025

Choose a reason for hiding this comment

Doresic left a comment

Choose a reason for hiding this comment

Doresic Jan 10, 2025

Choose a reason for hiding this comment

Doresic Jan 10, 2025

Choose a reason for hiding this comment

codecov-commenter commented Oct 20, 2024 •

edited

Loading