Currently engrossed in the field of RL < Mainly interested in discovering the proper mathematical definition of Intelligence < ML Researcher
-
University of Alberta
- Canada, Toronto
- https://www.linkedin.com/in/erfan-miahi-8637a1130/
- https://orcid.org/0000-0001-7510-083X
- @erfan_mhi
- erfan_mhi
- in/erfan-miahi-8637a1130
Pinned Loading
-
distributed_training
distributed_training PublicImplementation of DDP and FSDP in PyTorch from Scratch using torch primitives
Python
-
Deep-Reinforcement-Learning-CS285-Pytorch
Deep-Reinforcement-Learning-CS285-Pytorch PublicSolutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework
-
base_reinforcement_learning
base_reinforcement_learning PublicThis is the code-base that I personally use as the starting point for any reinforcement learning codebase with the purpose of fast experimentation and analysis.
-
-
RLR
RLR PublicA Deep Reinforcement Learning Framework for Post-training LLMs on Reasoning Tasks
Jupyter Notebook
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.