Skip to content
Change the repository type filter

All

    Repositories list

    • gpt-neox

      Public
      An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
      Python
      Apache License 2.0
      1k6.9k6031Updated Nov 5, 2024Nov 5, 2024
    • Closed-form polynomial approximations to neural networks
      Python
      MIT License
      0100Updated Nov 5, 2024Nov 5, 2024
    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      1.8k6.9k30993Updated Nov 5, 2024Nov 5, 2024
    • Jupyter Notebook
      MIT License
      0200Updated Nov 5, 2024Nov 5, 2024
    • Experiments in transformer knowledge and reasoning
      Jupyter Notebook
      MIT License
      12200Updated Nov 5, 2024Nov 5, 2024
    • elk

      Public
      Keeping language models honest by directly eliciting knowledge encoded in their activations.
      Python
      MIT License
      331861510Updated Nov 4, 2024Nov 4, 2024
    • Jupyter Notebook
      Apache License 2.0
      119710Updated Nov 4, 2024Nov 4, 2024
    • monkfish

      Public
      Python
      MIT License
      1400Updated Nov 1, 2024Nov 1, 2024
    • pythia

      Public
      The hub for EleutherAI's work on interpretability and learning dynamics
      Jupyter Notebook
      Apache License 2.0
      1702.3k253Updated Nov 1, 2024Nov 1, 2024
    • website

      Public
      New website for EleutherAI based on Hugo static site generator
      HTML
      6402Updated Oct 31, 2024Oct 31, 2024
    • mdl

      Public
      Minimum Description Length probing for neural network representations
      Python
      MIT License
      21602Updated Oct 31, 2024Oct 31, 2024
    • Precisely estimating the volume of basins in neural net parameter space corresponding to interpretable behaviors
      Jupyter Notebook
      Apache License 2.0
      0000Updated Oct 30, 2024Oct 30, 2024
    • The simplest, fastest repository for training/finetuning medium-sized GPTs.
      Python
      MIT License
      5.9k8001Updated Oct 30, 2024Oct 30, 2024
    • Understanding how features learned by neural networks evolve throughout training
      Python
      MIT License
      13100Updated Oct 24, 2024Oct 24, 2024
    • The code used in "Balancing Label Quantity and Quality for Scalable Elicitation"
      Jupyter Notebook
      MIT License
      2200Updated Oct 22, 2024Oct 22, 2024
    • sae

      Public
      Sparse autoencoders
      Python
      MIT License
      4633331Updated Oct 22, 2024Oct 22, 2024
    • Erasing concepts from neural representations with provable guarantees
      Python
      MIT License
      1520821Updated Oct 15, 2024Oct 15, 2024
    • aria-amt

      Public
      Efficient and robust implementation of seq-to-seq automatic piano transcription.
      Python
      Apache License 2.0
      71800Updated Oct 9, 2024Oct 9, 2024
    • aria

      Public
      Python
      Apache License 2.0
      114000Updated Oct 9, 2024Oct 9, 2024
    • Efficiently computing & storing token n-grams from large corpora
      Rust
      MIT License
      31500Updated Oct 6, 2024Oct 6, 2024
    • Adds GaLore style projection wrappers to optax optimizers
      Python
      MIT License
      0300Updated Oct 3, 2024Oct 3, 2024
    • Equinox implementation of llama3 and llama3.1
      Python
      MIT License
      0610Updated Oct 3, 2024Oct 3, 2024
    • cupbearer

      Public
      A library for mechanistic anomaly detection
      Python
      MIT License
      9500Updated Oct 3, 2024Oct 3, 2024
    • Jupyter Notebook
      54515Updated Oct 1, 2024Oct 1, 2024
    • cookbook

      Public
      Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
      Python
      Apache License 2.0
      3670680Updated Sep 24, 2024Sep 24, 2024
    • w2s

      Public
      Python
      MIT License
      21810Updated Sep 24, 2024Sep 24, 2024
    • ccs

      Public
      Python
      MIT License
      6533Updated Sep 24, 2024Sep 24, 2024
    • A library for efficient patching and automatic circuit discovery.
      Python
      12000Updated Sep 16, 2024Sep 16, 2024
    • Python
      0200Updated Aug 2, 2024Aug 2, 2024
    • Utilities to use the Hugging Face Hub API
      TypeScript
      MIT License
      224200Updated Jul 31, 2024Jul 31, 2024