RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @IDavidConnolly: 1 / In our new paper, we challenged LLMs on an oral boards q-bank with mostly higher order questions! Also for the firs…
RT @IDavidConnolly: 1 / In our new paper, we challenged LLMs on an oral boards q-bank with mostly higher order questions! Also for the firs…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @IDavidConnolly: 1 / In our new paper, we challenged LLMs on an oral boards q-bank with mostly higher order questions! Also for the firs…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @IDavidConnolly: 1 / In our new paper, we challenged LLMs on an oral boards q-bank with mostly higher order questions! Also for the firs…
RT @IDavidConnolly: 1 / In our new paper, we challenged LLMs on an oral boards q-bank with mostly higher order questions! Also for the firs…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
دراسة بحثية تقيس أداء نماذج #الذكاء_الاصطناعي التوليدي في أسئلة الاختبارات الشفهية لجراحة المخ والأعصاب! الاختبار مكون من 149 سؤال https://t.co/cvuKAcBRXf صحيح أن التخصص دقيق ومخيف، لكن النماذج حققت بعض الإنجازات 👇🧵
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @IDavidConnolly: 1 / In our new paper, we challenged LLMs on an oral boards q-bank with mostly higher order questions! Also for the firs…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @IDavidConnolly: 1 / In our new paper, we challenged LLMs on an oral boards q-bank with mostly higher order questions! Also for the firs…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @IDavidConnolly: 1 / In our new paper, we challenged LLMs on an oral boards q-bank with mostly higher order questions! Also for the firs…
@currencyat Ahh it’s ok was just a hallucination
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @IDavidConnolly: 1 / In our new paper, we challenged LLMs on an oral boards q-bank with mostly higher order questions! Also for the firs…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
No one in healthcare has any clue on what’s coming. The next 5 years will change things more than the last 50. Putting this excellent study here as a reminder.
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @emollick: GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the…
RT @IDavidConnolly: 1 / In our new paper, we challenged LLMs on an oral boards q-bank with mostly higher order questions! Also for the firs…
GLT-4 as brain surgeon. GPT-4 scored 83% on neurosurgery board exams, GPT-3.5 got 62%, Bard, 44% Even more intriguing, the paper measured hallucinations. Bard had a hallucination rate of 57% while GPT-4 was just 2% That suggests a potential for real pro
RT @IDavidConnolly: 1 / In our new paper, we challenged LLMs on an oral boards q-bank with mostly higher order questions! Also for the firs…
RT @RohaidAliMD: [1/5] The oral boards: a challenging final hurdle for neurosurgeons seeking full board certification. How prepared is AI…
RT @RohaidAliMD: [1/5] The oral boards: a challenging final hurdle for neurosurgeons seeking full board certification. How prepared is AI…
RT @IDavidConnolly: 1 / In our new paper, we challenged LLMs on an oral boards q-bank with mostly higher order questions! Also for the firs…
RT @RohaidAliMD: [1/5] The oral boards: a challenging final hurdle for neurosurgeons seeking full board certification. How prepared is AI…
1 / In our new paper, we challenged LLMs on an oral boards q-bank with mostly higher order questions! Also for the first time, we quantify “hallucinations” — a puzzling phenomenon where LLMs will confidently incorporate falsehoods into responses. https://
RT @RohaidAliMD: [1/5] The oral boards: a challenging final hurdle for neurosurgeons seeking full board certification. How prepared is AI…
[1/5] The oral boards: a challenging final hurdle for neurosurgeons seeking full board certification. How prepared is AI for this demanding exam? We tested Google Bard, ChatGPT, and its successor, GPT-4. Thread: https://t.co/YbmOohhxzI
Performance of ChatGPT, GPT-4, and Google Bard on a Neurosurgery Oral Boards Preparation Question Bank https://t.co/QqP3wFOhGY #medRxiv