Techniques for Collaborative Development of AI Models in the Age of COVID

Ittai Dayan1, Holger R Roth2, Aoxiao Zhong3,4, Ahmed Harouni2, Amilcare Gentili5, Anas Z Abidin2, Andrew Liu2, Anthony Beardsworth Costa6, Bradford J Wood7,8, Chien-Sung Tsai9, Chih-Hung Wang10,11, Chun-Nan Hsu12, C K Lee2, Peiying Ruan2, Daguang Xu2, Dufan Wu3, Eddie Huang2, Felipe Campos Kitamura13, Griffin Lacey2, Gustavo César de Antônio Corradi13, Gustavo Nino14, Hao-Hsin Shin15, Hirofumi Obinata16, Hui Ren3, Jason C Crane17, Jesse Tetreault2, Jiahui Guan2, John W Garrett18, Joshua D Kaggie19, Jung Gil Park20, Keith Dreyer1,21, Krishna Juluru15, Kristopher Kersten2, Marcio Aloisio Bezerra Cavalcanti Rockenbach21, Marius George Linguraru22,23, Masoom A Haider24,25, Meena AbdelMaseeh25, Nicola Rieke2, Pablo F Damasceno17, Pedro Mario Cruz E Silva2, Pochuan Wang26,27, Sheng Xu7,8, Shuichi Kawano16, Sira Sriswasdi28,29, Soo Young Park30, Thomas M Grist31, Varun Buch21, Watsamon Jantarabenjakul32,33, Weichung Wang26,27, Won Young Tak30, Xiang Li3, Xihong Lin34, Young Joon Kwon6, Abood Quraini2, Andrew Feng2, Andrew N Priest35, Baris Turkbey8,36, Benjamin Glicksberg37, Bernardo Bizzo21, Byung Seok Kim38, Carlos Tor-Díez22, Chia-Cheng Lee39, Chia-Jung Hsu39, Chin Lin40,41,42, Chiu-Ling Lai43, Christopher P Hess17, Colin Compas2, Deepeksha Bhatia2, Eric K Oermann44, Evan Leibovitz21, Hisashi Sasaki16, Hitoshi Mori16, Isaac Yang2, Jae Ho Sohn17, Krishna Nand Keshava Murthy15, Li-Chen Fu45, Matheus Ribeiro Furtado de Mendonça13, Mike Fralick46, Min Kyu Kang20, Mohammad Adil2, Natalie Gangai15, Peerapon Vateekul47, Pierre Elnajjar15, Sarah Hickman19, Sharmila Majumdar17, Shelley L McLeod48,49, Sheridan Reed7,8, Stefan Gräf50, Stephanie Harmon8,51, Tatsuya Kodama16, Thanyawee Puthanakit32,33, Tony Mazzulli52,53,54, Vitor Lima de Lavor13, Yothin Rakvongthai55, Yu Rim Lee30, Yuhong Wen2, Fiona J Gilbert19, Mona G Flores56, Quanzheng Li3

  1. MGH Radiology and Harvard Medical School, Boston, MA, USA.
  2. NVIDIA, Santa Clara, CA, USA.
  3. Center for Advanced Medical Computing and Analysis, Department of Radiology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
  4. School of Engineering and Applied Sciences, Harvard University, Boston, MA, USA.
  5. San Diego VA Health Care System, San Diego, CA, USA.
  6. Department of Neurosurgery, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
  7. Radiology & Imaging Sciences/Clinical Center, National Institutes of Health, Bethesda, MD, USA.
  8. National Cancer Institute, National Institutes of Health, Bethesda, MD, USA.
  9. Division of Cardiovascular Surgery, Department of Surgery, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan.
  10. Department of Otolaryngology-Head and Neck Surgery, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan.
  11. Graduate Institute of Medical Sciences, National Defense Medical Center, Taipei, Taiwan.
  12. Center for Research in Biological Systems, University of California, San Diego, CA, USA.
  13. DasaInova, Diagnósticos da América SA, Barueri, Brazil.
  14. Division of Pediatric Pulmonary and Sleep Medicine, Children's National Hospital, Washington, DC, USA.
  15. Memorial Sloan Kettering Cancer Center, New York, NY, USA.
  16. Self-Defense Forces Central Hospital, Tokyo, Japan.
  17. Center for Intelligent Imaging, Department of Radiology and Biomedical Imaging, University of California, San Francisco, CA, USA.
  18. Departments of Radiology and Medical Physics, The University of Wisconsin-Madison School of Medicine and Public Health, Madison, WI, USA.
  19. Department of Radiology, NIHR Cambridge Biomedical Resource Centre, University of Cambridge, Cambridge, UK.
  20. Department of Internal Medicine, Yeungnam University College of Medicine, Daegu, South Korea.
  21. Center for Clinical Data Science, Massachusetts General Brigham, Boston, MA, USA.
  22. Sheikh Zayed Institute for Pediatric Surgical Innovation, Children's National Hospital, Washington, DC, USA.
  23. Departments of Radiology and Pediatrics, The George Washington University School of Medicine and Health Sciences, Washington, DC, USA.
  24. Joint Dept. of Medical Imaging, Sinai Health System, University of Toronto, Toronto, Ontario, Canada.
  25. Lunenfeld-Tanenbaum Research Institute, Toronto, Ontario, Canada.
  26. MeDA Lab Institute of Applied Mathematical Sciences, National Taiwan University, Taipei, Taiwan.
  27. Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan.
  28. Research Affairs, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand.
  29. Center for Artificial Intelligence in Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand.
  30. Department of Internal Medicine, School of Medicine, Kyungpook National University, Daegu, South Korea.
  31. Departments of Radiology, Medical Physics, and Biomedical Engineering, The University of Wisconsin-Madison School of Medicine and Public Health, Madison, WI, USA.
  32. Department of Pediatrics, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand.
  33. Thai Red Cross Emerging Infectious Diseases Clinical Center, King Chulalongkorn Memorial Hospital, Bangkok, Thailand.
  34. Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA, USA.
  35. Department of Radiology, NIHR Cambridge Biomedical Resource Centre, Cambridge University Hospital, Cambridge, UK.
  36. Department of Radiology and Imaging Sciences, National Institutes of Health, Bethesda, MD, USA.
  37. Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai and Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
  38. Department of Internal Medicine, Catholic University of Daegu School of Medicine, Daegu, South Korea.
  39. Planning and Management Office, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan.
  40. School of Medicine, National Defense Medical Center, Taipei, Taiwan.
  41. School of Public Health, National Defense Medical Center, Taipei, Taiwan.
  42. Graduate Institute of Life Sciences, National Defense Medical Center, Taipei, Taiwan.
  43. Medical Review and Pharmaceutical Benefits Division, National Health Insurance Administration, Taipei, Taiwan.
  44. Department of Neurosurgery, NYU Grossman School of Medicine, New York, NY, USA.
  45. MOST/NTU All Vista Healthcare Center, Center for Artificial Intelligence and Advanced Robotics, National Taiwan University, Taipei, Taiwan.
  46. Division of General Internal Medicine and Geriatrics (Fralick), Sinai Health System, Toronto, Ontario, Canada.
  47. Department of Computer Engineering, Faculty of Engineering, Chulalongkorn University, Bangkok, Thailand.
  48. Schwartz/Reisman Emergency Medicine Institute, Sinai Health, Toronto, Ontario, Canada.
  49. Department of Family and Community Medicine, University of Toronto, Toronto, Ontario, Canada.
  50. Department of Medicine and NIHR BioResource for Translational Research, NIHR Cambridge Biomedical Research Centre, University of Cambridge, Cambridge, UK.
  51. Clinical Research Directorate, Frederick National Laboratory for Cancer, National Cancer Institute, Frederick, MD, USA.
  52. Department of Microbiology, Sinai Health/University Health Network, Toronto, Ontario, Canada.
  53. Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Ontario, Canada.
  54. Public Health Ontario Laboratories, Toronto, Ontario, Canada.
  55. Chulalongkorn University Biomedical Imaging Group and Division of Nuclear Medicine, Department of Radiology, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand.
  56. NVIDIA, Santa Clara, CA, USA. mflores@nvidia.com.

Abstract

Federated learning (FL) is a method used for training artificial intelligence models with data from multiple sources while maintaining data anonymity, thus removing many barriers to data sharing. Here we used data from 20 institutes across the globe to train a FL model, called EXAM (electronic medical record (EMR) chest X-ray AI model), that predicts the future oxygen requirements of symptomatic patients with COVID-19 using inputs of vital signs, laboratory data and chest X-rays. EXAM achieved an average area under the curve (AUC) >0.92 for predicting outcomes at 24 and 72 h from the time of initial presentation to the emergency room, and it provided 16% improvement in average AUC measured across all participating sites and an average increase in generalizability of 38% when compared with models trained at a single site using that site's data. For prediction of mechanical ventilation treatment or death at 24 h at the largest independent test site, EXAM achieved a sensitivity of 0.950 and specificity of 0.882. In this study, FL facilitated rapid data science collaboration without data exchange and generated a model that generalized across heterogeneous, unharmonized datasets for prediction of clinical outcomes in patients with COVID-19, setting the stage for the broader use of FL in healthcare.

Presented By Holger Roth | ORCID iD