Back to archive

Thread

4 tweets

2
Now having worked with ML based systems for the past decade or so, I‘m directly wondering how to evaluate this properly and make sure it works as intended.
3
Because if I learned one thing it‘s that eye balling a solution doesn‘t work. In „classical“ code you can test edge cases and you‘ll know „it works“, but with ML you really need good coverage and out of sample testing.
4
So anyone who worked with chatGPT, are they providing services for that? Because from the docs it looks like „it just works“ and I find it hard to believe that.