Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Client for FAH resources, reference implementation for compute service #7

Merged
merged 79 commits into from
Jul 23, 2024

Conversation

dotsdl
Copy link
Member

@dotsdl dotsdl commented Sep 27, 2023

Closes #1, #3.

Uses a process pool for CPU-bound units, async/await for Fah units
…onds to a FAH RUN

Building out API points in FahAdaptiveSamplingClient to support this
unit, since it is largely responsible for managing its own state on the
work server.

Considering ways of making the work server support partial execution, or
at least not compute the same FahOpenMMSimulationUnit twice from the
same Task twice.
@dotsdl
Copy link
Member Author

dotsdl commented Jul 9, 2024

Also, during my deployment testing I realized we've made an oversight in the design of how we interface ProtocolDAG execution with the FAH PRC(G) system. Instead of a Task mapping to a CLONE, we should instead have a ProtocolUnit map to a CLONE, since a given Task's ProtocolDAG may in principle feature any number of ProtocolUnits that are FahSimulationUnits. This means that for a single Task, it's possible multiple FAH CLONEs will be performed.

Fixing this isn't too difficult given how we've laid things out, but making this adjustment will require changes in a few places.

@dotsdl
Copy link
Member Author

dotsdl commented Jul 15, 2024

Addressed this today; we now use Task-ProtocolUnit pairs to map into FAH CLONES instead of just Tasks.

Copy link
Member Author

@dotsdl dotsdl left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Addressed all comments! Waiting for CI, then merging!

@hmacdope
Copy link
Collaborator

Congrats @dotsdl

@dotsdl dotsdl merged commit d9971ee into main Jul 23, 2024
3 checks passed
@dotsdl dotsdl deleted the fahcompute branch July 23, 2024 15:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

FAHComputeService for execuing FAH-based Protocols via a Folding@Home work server
4 participants