Interested in interpretable ML, particularly for LLMs?
eg "causal" interpretability, as in the "OthelloGPT" paper [1]?
Let's connect!
1. https://arxiv.org/abs/2210.13382
#ai #machinelearning #interpretability #interpretableml #mechanisticinterpretability