inline relevant fields of HookedTransformer rather than nesting it in config #53

JasonGross · 2024-02-19T20:34:44Z

I think we made the wrong decision (/ I gave bad design advice) when making HookedTransformerConfig a field of each of the experiments. I think on reflection the downsides outweigh the upsides.

Upsides:

uniform way of adding CLI arguments (the utility functions can be replaced by a lookup table of the argparse arugments corresponding to each HookedTransformerConfig field)
enforced uniform naming scheme

Downsides:

upgrading HookedTransformer invalidates all of our config hashes / wandb models (see also More compatibility in config #52 and Update dependency transformer-lens to v1.14.0 #40 (comment))
we have to introduce kludges when we want more control over renaming and defaulting arguments, e.g., when we want to be based on sequence length rather than context window size, or use a prime p rather than d_vocab_out, etc.

The text was updated successfully, but these errors were encountered:

JasonGross added engineering priority: medium-low Non-blocking but somewhat time-sensitive, e.g., adds overhead or friction labels Feb 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

inline relevant fields of HookedTransformer rather than nesting it in config #53

inline relevant fields of HookedTransformer rather than nesting it in config #53

JasonGross commented Feb 19, 2024

inline relevant fields of HookedTransformer rather than nesting it in config #53

inline relevant fields of HookedTransformer rather than nesting it in config #53

Comments

JasonGross commented Feb 19, 2024