-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Think through challenge design #12
Comments
Hi David, I have seen the pKa data (by the way the file WP6_pKa_report.pdf does Concerning the possibility of organizing a new SAMPLing Piero |
Hi, sorry for the very long delay on this. I'm in a bit of a rush but I'll try and take a couple of your questions:
The hard part of this is, essentially, identifying a system we think people can converge with somewhat reasonable efficiency and then getting enough folks to agree to do it, as well as finding an approach to serve as a reference method and getting someone to run that. It's a major community initiative/a large cat-herding exercise, and I haven't had bandwidth to lead this. One route forward might be if you wanted to put together a draft of a proposed plan for such a challenge, such as in an editable Google Doc, I could circulate it to our e-mail list for feedback and then depending on the feedback we'd know how hard it would be to push this forward/herd the cats. The hosts you mention sound interesting and like they share some similarity with the CBclip/TrimerTrip type systems that have appeared in past challenges, in that they are significantly flexible. It's interesting that the timescales are already known. Are you proposing those as the focus of a SAMPLing challenge? At some level I thought that the rihgt level of difficulty would be do essentially revisit something like the systems which were in the SAMPL6 SAMPLing challenge, as at that point the field still did not quite reach complete agreement as to the right answer for calculated binding free energies even with equivalent methods. Originally the plan had been to compare method efficiency by checking which converged to the known answer most rapidly, but in the end we were unable to do this since they didn't all converge to the same answer. Before going to systems with more difficult sampling I'd hope we could do an efficiency comparison in a case where we do all get the same answer. |
Migrating this comment rom @procacci in #9 here for better tracking:
I'll try and take this point-by-point for the record:
Honestly, the SAMPL challenges are not staged with a particular purpose in mind other than to (a) allow fair assessment of the state of the art in prospective challenges, and (b) to drive progress in the field. Whether that progress comes from force fields or from sampling methods is dependent on participation/innovation, not on the structure of the challenges necessarily.
That said, the SAMPL6 "SAMPLing" challenge did have a clear focus specifically on sampling methods. One thing on my list has been to stage another "SAMPLing" challenge as I see it important to focus some amount of attention directly on sampling issues (as opposed to overall accuracy, which can be the result of many factors) but I haven't had the bandwidth to orchestrate another such challenge recently, as it requires a great deal of engagement with the community. Are you interested in helping out with organizing one?
This seems like it's an argument for NOT using WP6 for a SAMPLing challenge, but not necessarily for not including WP6 in SAMPL, unless I'm missing something.
I did just add some experimental data on WP6 protonation states in #11
Completely isolating FF from sampling is tricky, in my experience, since even "simple" systems often have some amount of sampling problems. However, as noted, one can pose a "SAMPLing" challenge (as in SAMPL6/as you suggest) to specifically focus on sampling issues. Note, however, such challenges do not necessarily require new data as in such cases the goal is to get the "right" answer (the force field gold standard) efficiently rather than to get the most accurate result.
It is probably time for another such challenge; let me know if you're interested in lending a hand with organization.
In general I agree with you that one needs to address both sampling and force field quality and if sampling is the limiting factor, force field quality won't fix it. That said, my role in running these challenge is... to make them available to the community and let them use the challenges to drive progress, not to decide who gets to use which methods and how. I would note that over the years, far fewer pure QM methods have participated in SAMPL host-guest challenges and far more methods have begun incorporating extensive sampling, as this has proven critical for decent performance. Yes, we still see more expensive force fields used -- but there are far fewer methods now which just do expensive QM calculations on host-guest complexes.
Anyway, let me know if you're interested in helping orchestrate another round of SAMPLing challenges, and what system(s) you think would be best suited for those.
The text was updated successfully, but these errors were encountered: