I can evaluate this because it’s easy for me to count. But how can I evaluate something else, how can I know whether the LLM ist good at it or not?
I can evaluate this because it’s easy for me to count. But how can I evaluate something else, how can I know whether the LLM ist good at it or not?
Race, gender and whatever are “people that are not like me”. Jokes about old people are jokes about myself, because I (hopefully) will get old myself one day.
Thank God there’s a standard for USB. And another one. And another one. And another one. And another one. And another one. And another one. And another one. And another one. And another one. And another one. And another one…