Mikio Braun
@mikiobraun
Part of a thread
GPT-4 and professional benchmarks: the wrong answer to the wrong question
OpenAI may have tested on the training data. Besides, human benchmarks are meaningless for bots.
aisnakeoil.substack.com