An awesome repository & A comprehensive survey on interpretability of LLM attention heads.
-
Updated
Oct 29, 2024 - TeX
An awesome repository & A comprehensive survey on interpretability of LLM attention heads.
Explore consciousness and self-awareness as it pertains to AI systems.
Repository for the LWDA'24 presentation on 'Psychometric Profiling of GPT Models for Bias Exploration', featuring conference materials including the poster, paper, slides, and references.
Add a description, image, and links to the machine-psychology topic page so that developers can more easily learn about it.
To associate your repository with the machine-psychology topic, visit your repo's landing page and select "manage topics."