I'm continuously surprised that some people get good results out of GPT models. They sort of fail on my personal benchmarks for me.
Maybe GPT needs a different approach to prompting? (as compared to eg Claude, Gemini, or Kimi)
replies(1):
Maybe GPT needs a different approach to prompting? (as compared to eg Claude, Gemini, or Kimi)