Back to 2023
@mikiobraun
Mikio Braun
@mikiobraun
Replying to @paul_rietschka
Yeah it‘s wholly impractical. I was reading papers (yeah… I know) and stumbled upon linformer… it uses low rank matrix approximation on the attention matrices… whatever happened to that?