Icon Icon Icon Icon Icon

Geometrical structures in the self-attention matrices of Transformer models



  • we are preparing the manuscript, work in progress :)







refs: