This release brings two new features:
Top-K Transcoders
- Top-K transcoders are now supported - thanks to @zsquaredz for helping with this! This means that these Llama-3 8B Instruct transcoders are now usable.
- In the
config.yaml, specifyactivation: topkto mark the transcoders as top-k transcoders, and usek: 128to indicate the value ofk(e.g., 128). The weight files for these transcoders should be the same as e.g. relu transcoders. load_relu_transcoderis nowload_transcoder, and serves as a general function for loading per-layer transcoders. It acts the same as the oldload_relu_transcoder, except that you can now pass in theactivation_fnthat you want used.
Local Features
- Thanks to @s-ewbank, there is now a
features_dirargument forserveandcircuit-tracer start-serverthat allows you to specify a local directory where locally-computed features live! This is helpful if you've trained your own transcoders / computed your own features, and don't wish to upload them to Huggingface.