ML and AI expert

Thread

2 tweets

Yeah it‘s wholly impractical. I was reading papers (yeah… I know) and stumbled upon linformer… it uses low rank matrix approximation on the attention matrices… whatever happened to that?

May 23, 2023 · 05:34

@paul_rietschka I stumbled upon that because LoRA kept talking how very low rank changes can make significant improvements and so on (not surprised about that tbh) So yeah, always seeing the best in people I suspect they kept increasing model sizes for marketing purposes.

May 23, 2023 · 05:37