erfanMhi

Follow

Erfan Miahi erfanMhi

Follow

Currently engrossed in the field of RL < Mainly interested in discovering the proper mathematical definition of Intelligence < ML Researcher

119 followers · 76 following

University of Alberta
Canada, Toronto
https://www.linkedin.com/in/erfan-miahi-8637a1130/
https://orcid.org/0000-0001-7510-083X
@erfan_mhi
erfan_mhi
in/erfan-miahi-8637a1130

Achievements

Achievements

Organizations

Pinned Loading

distributed_training distributed_training Public

Implementation of DDP and FSDP in PyTorch from Scratch using torch primitives

Python
Deep-Reinforcement-Learning-CS285-Pytorch Deep-Reinforcement-Learning-CS285-Pytorch Public

Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework

Python 134 11
base_reinforcement_learning base_reinforcement_learning Public

This is the code-base that I personally use as the starting point for any reinforcement learning codebase with the purpose of fast experimentation and analysis.

Python 12 1
intractai/IntractCodeAPI intractai/IntractCodeAPI Public

Python 9 2
flypi flypi Public

Circuit Analysis for Extracting Components and Connections for XR (Toronto Meta Llama Hackathon)

Python 5
RLR RLR Public

A Deep Reinforcement Learning Framework for Post-training LLMs on Reasoning Tasks

Jupyter Notebook