
Researchers compare the performance on a neurosurgery board oral examination question bank of three large-language models (LLMs). The study revealed that GPT-4 scored the
highest at 82.6%. It outperformed ChatGPT and Google Bard.
Continue reading…
