🏠
Working from home
Pinned Loading
-
LLMs-Distillation-Quantification
LLMs-Distillation-Quantification PublicRepo of "Distillation Quantification for Large Language Models"
-
xJailbreak
xJailbreak PublicCode of paper "xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking"
Python 1
-
CEA
CEA PublicCode of paper: Counterfactual Experience Augmented Off-policy Reinforcement Learning.
Python
-
HdGkde
HdGkde PublicA Maximum Entropy Sampling Method Based on High-Dimensional Gaussian Kernel Density Estimation.
Jupyter Notebook
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.