Back to 2023
@mikiobraun
Mikio Braun
@mikiobraun
RT @swyx: LLM datasets be like: • First you start with CommonCrawl • Then you add C4, which is just CommonCrawl again, but dont worry abo…