This pilot study evaluated the influence of medical background on the diagnostic quality of ChatGPT-4's responses in Internal Medicine. Third-year students, residents and specialists summarised five complex NEJM clinical cases before querying ChatGPT-4. Diagnostic ranking, assessed by independent experts, revealed that residents significantly outperformed students (OR 2.33, p = .007); though overall performance was low. These findings indicate that user expertise and concise case summaries are critical for optimising AI diagnostics, highlighting the need for enhanced AI training and user interaction strategies.
Influence of medical educational background on the diagnostic quality of ChatGPT-4 responses in internal medicine: A pilot study
Gilardi, Nicolò;Ferrando, Lorenzo;Stabile, Mario;Bellodi, Andrea;Cigolini, Benedetta;Genova, Carlo;Carbone, Federico;Montecucco, Fabrizio;Ballestrero, Alberto;Zoppoli, Gabriele
2025-01-01
Abstract
This pilot study evaluated the influence of medical background on the diagnostic quality of ChatGPT-4's responses in Internal Medicine. Third-year students, residents and specialists summarised five complex NEJM clinical cases before querying ChatGPT-4. Diagnostic ranking, assessed by independent experts, revealed that residents significantly outperformed students (OR 2.33, p = .007); though overall performance was low. These findings indicate that user expertise and concise case summaries are critical for optimising AI diagnostics, highlighting the need for enhanced AI training and user interaction strategies.File in questo prodotto:
| File | Dimensione | Formato | |
|---|---|---|---|
|
Eur J Clin Investigation - 2025 - Gilardi - Influence of medical educational background on the diagnostic quality of.pdf
accesso aperto
Tipologia:
Documento in versione editoriale
Dimensione
414.87 kB
Formato
Adobe PDF
|
414.87 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.



