AI、緊急医療の初期トリアージ診断で医師を上回る -ハーバード大学の研究
ハーバード大学の研究で、大規模言語モデル(LLM)を活用したAIシステムが、緊急医療の初期トリアージ診断において人間の医師を上回る成果を上げたことが判明しました。
AIは電子カルテに基づいた診断において67%の精度を記録し、医師の50〜55%を上回りました。
特に情報が限られた状況での迅速な判断においてAIの優位性が際立っています。
今後は、医師とAIが協働する新しい医療体制が構築されると予想されていますが、AIの誤りや責任に関する課題も残されています。
ハーバード大学による画期的な研究で、AIシステムが緊急医療のトリアージ(患者の重症度判定)において、人間の医師を上回る診断精度を示したとのことだ。この研究は、生命に関わる緊迫した状況下で、AIがどれほど臨床的な判断を下せるのかを検証したものであり、医療の未来に大きな変革をもたらす可能性が注目されている。
緊急トリアージにおけるAIの優位性
ボストンの病院で行われた実験では、AIと人間の医師のペアが同じ電子カルテ(バイタルサインや病歴など)を読み込み、76人の患者の診断を行っている。その結果、AIは67%のケースで正確な、または非常に近い診断を提示し、人間の医師が達成した50〜55%の精度を上回った。特に、情報が限られた状況での迅速な判断が求められるトリアージの場面で、AIの優位性が際立ったと報告されている。
治療計画立案能力の高さ
さらに、AIは単なる診断だけでなく、長期的な治療計画の立案においても高い能力を示した。AIと46人の医師が5つの臨床事例を検討した際、AIは抗生物質の投与計画や終末期ケアの計画などにおいて、従来の検索エンジンなどのリソースを用いた人間の医師(34%)と比較して、89%という有意に優れた計画を提示した。これは、AIが単なるデータ処理を超えた、高度な臨床的推論を行っていることを示唆している。
AIと医師の協働モデルへの移行
研究チームは、AIが医師を完全に置き換えるものではないと強調している。AIはあくまで「セカンドオピニオン」を提供するツールとして機能していると説明されている。今後は、医師、患者、AIシステムという三者の関与する「トライアドケアモデル」が医療現場に導入されると見られている。ただし、AIの判断ミス時の責任の所在や、患者の感情的なサポートといった人間特有の要素は、今後議論されるべき課題だ。
まとめ
この研究は、AIが医療の診断支援において極めて強力なツールとなり得ることを証明した。今後、AIと医師がどのように連携し、より質の高い医療を提供していくのか、その具体的な導入方法と倫理的な枠組みの構築が急務となっている。
原文の冒頭を表示(英語・3段落のみ)
From George Clooney in ER to Noah Wyle in The Pitt, emergency department doctors have long been popular heroes. But will it soon be time to hang up the scrubs?A groundbreaking Harvard study has found that AI systems outperformed human doctors in high-pressure emergency medicine triage, diagnosing more accurately in the potentially life and death moments when people are first rushed to hospital.The results were described by independent experts as showing “a genuine step forward” in the clinical reasoning of AIs and came as part of trials that tested the responses of hundreds of doctors against an AI.The authors said the results, published in the journal Science, showed large language models (LLMs) “have eclipsed most benchmarks of clinical reasoning”.One experiment focused on 76 patients who arrived at the emergency room of a Boston hospital. An AI and a pair of human doctors were each given the same standard electronic health record to read – typically including vital sign data, demographic information and a few sentences from a nurse about why the patient was there. The AI identified the exact or very close diagnosis in 67% of cases, beating the human doctors, who were right only 50%-55% of the time.It showed the AIs’ advantage was particularly pronounced in triage circumstances requiring rapid decisions with minimal information. The diagnosis accuracy of the AI – OpenAI’s o1 reasoning model – rose to 82% when more detail was available, compared with the 70-79% accuracy achieved by the expert humans, though this difference was not statistically significant.It also outperformed a larger cohort of human doctors when asked to provide longer term treatment plans, such as providing antibiotics regimes or planning end-of-life processes. The AI and 46 doctors were asked to examine five clinical case studies and the computer made significantly better plans, scoring 89% compared with 34% for humans using conventional resources, such as search engines.But it is not curtains for emergency doctors yet, the researchers said. The study only tested humans against AIs looking at patient data that can be communicated via text. The AI’s reading of signals, such as the patient’s level of distress and their visual appearance, were not tested. That means the AI was performing more like a clinician producing a second opinion based on paperwork.“I don’t think our findings mean that AI replaces doctors,” said Arjun Manrai, one of the lead authors of the study who heads an AI lab at Harvard Medical School. “I think it does mean that we’re witnessing a really profound change in technology that will reshape medicine.”Dr Adam Rodman, another lead author and a doctor at Boston’s Beth Israel Deaconess medical centre where the study took place, said AI LLMs were among “the most impactful technologies in decades”. Over the next decade, he said, AI would not replace physicians but join them in a new “triadic care model … the doctor, the patient, and an artificial intelligence system”.In one case in the Harvard study, a patient presented with a blood clot to the lungs and worsening symptoms. Human doctors thought the anti-coagulants were failing, but the AI noticed something the humans did not: the patient’s history of lupus meant this might be causing the inflammation of the lungs. The AI was proved correct.Nearly one in five US physicians are already using AI to assist diagnosis, according to research published last month. In the UK, 16% of doctors are using the tech daily and a further 15% weekly, with “clinical decision-making” being one of the most common uses, according to a recent Royal College of Physicians survey.The UK doctors’ biggest concerns were AI error and liability risks. Billions are being invested in AI healthcare companies, but questions remain about the consequences of AI error.“There is not a formal framework right now for accountability,” said Rodman, who also stressed patients ultimately “want humans to guide them through life or death decisions [and] to guide them through challenging treatment decisions”.Prof Ewen Harrison, co-director of the University of Edinburgh’s centre for medical informatics, said the study was important and showed that “these systems are no longer just passing medical exams or solving artificial test cases. They are starting to look like useful second-opinion tools for clinicians, particularly when it is important to consider a wider range of possible diagnoses and avoid missing something important.”Dr Wei Xing, an assistant professor at the University of Sheffield’s school of mathematical and physical sciences, said some of the other findings suggested doctors may unconsciously defer to the AI’s answer rather than thinking independently.“This tendency could grow more significant as AI becomes more routinely used in clinical settings,” he said. He also highlighted the lack of information about which patients the AI was worse at diagnosing and whether it struggled more with elderly patients or non-English speakers.He said: “It does not demonstrate that AI is safe for routine clinical use, nor that the public should turn to freely available AI tools as a substitute for medical advice.”
※ 著作権に配慮し、引用は冒頭3段落までです。続きは元記事をご覧ください。