Skip to content
View jasonlim131's full-sized avatar
  • philadelphia

Block or report jasonlim131

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. ARENA_Replication ARENA_Replication Public

    Forked from callummcdougall/ARENA_2.0

    Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.

    HTML

  2. maze_rl_peek maze_rl_peek Public

    Jupyter Notebook

  3. sparse_AE sparse_AE Public

    Forked from ai-safety-foundation/sparse_autoencoder

    Sparse Autoencoder for Mechanistic Interpretability for Practice

    Python

  4. Contrastive-SAE-Cluster-Steering Contrastive-SAE-Cluster-Steering Public

    Jupyter Notebook

  5. TransformerLensOrg/TransformerLens TransformerLensOrg/TransformerLens Public

    A library for mechanistic interpretability of GPT-style language models

    Python 1.6k 304

  6. KanishkT123/Clustered_SAE_Steering KanishkT123/Clustered_SAE_Steering Public

    Clustered SAE Steering Code and Experiments

    Python 1