krackers 10 days ago

>I would be interested in reading a paper that does a good job of explaining what a parameter ends up representing in an LLM model.

https://distill.pub/2020/circuits/ https://transformer-circuits.pub/2025/attribution-graphs/bio...

1
ChuckMcM 9 days ago

That's an interesting paper and worth reading. Not sure it has answered my question but I did learn some things from it that I had not considered.

This was the quote I resonated with :-)

"... the discoveries we highlight here only capture a small fraction of the mechanisms of the model."

It sometimes feels a bit like papers on cellular biology with DNA discussions in which descriptions of the enzymes and proteins involved are insightful but the mechanism that operates the reaction remains opaque.