New era after ChatGPT in ophthalmology: advances from data-based decision support to patient-centered generative artificial intelligence
Editorial

New era after ChatGPT in ophthalmology: advances from data-based decision support to patient-centered generative artificial intelligence

Joon Yul Choi1, Tae Keun Yoo2,3^

1Department of Biomedical Engineering, Yonsei University, Wonju, South Korea; 2B&VIIT Eye Center, Seoul, South Korea; 3VISUWORKS, Seoul, South Korea

^ORCID: 0000-0003-0890-8614.

Correspondence to: Tae Keun Yoo, MD. B&VIIT Eye Center, B2 GT Tower, 1317-23 Seocho-Dong, Seocho-Gu, Seoul, South Korea; VISUWORKS, Seoul, South Korea. Email: eyetaekeunyoo@gmail.com; fawoo2@yonsei.ac.kr.

Comment on: Jiang X, Xie M, Ma L, et al. International publication trends in the application of artificial intelligence in ophthalmology research: an updated bibliometric analysis. Ann Transl Med 2023;11:219.


Keywords: ChatGPT; generative artificial intelligence (generative AI); ophthalmology; AI


Submitted May 07, 2023. Accepted for publication Jun 28, 2023. Published online Jun 30, 2023.

doi: 10.21037/atm-23-1598


Artificial intelligence (AI) has permeated the medical field to enhance the experiences of clinicians and patients. Jiang et al. summarized the major publications and authors working on AI development in the ophthalmology field so far (1). To date, most studies have focused on designing models that help doctors make fast screenings or arrive at accurate medical decisions by learning from existing clinical data based on machine learning and convolutional neural networks (CNN) (2). Therefore, institutions that have extensive data on existing research medical have quickly developed AI models and have emerged as leaders in academia in this field. Most AI studies have targeted diabetic retinopathy, age-related macular degeneration, glaucoma, and cataracts based on epidemiological research data combined with large-scale fundus photographs and optical coherence tomography (OCT) (3,4). Based on the advances of deep learning technology, AI-based systems such as IDx-DR (Digital Diagnostics, Coralville, IA, USA) and EyeArt (Eyenuk, Los Angeles, CA, USA) have been approved for diabetic retinopathy detection by the US Food and Drug Administration (5). Several clinical trials have demonstrated that these AI systems can serve as low-cost point-of-care diabetic retinopathy detection tools (6). AI has been used in various fields, such as refractive correction surgery (7), ocular disease treatment (8), and oculomics (9). The AI models developed so far have focused on decision support systems for doctors.


Future perspectives on generative AI

In addition to existing data analyses based on the aforementioned technologies, AI technology that generates synthetic data has recently been introduced (Figure 1) (10). In the future, generative AI systems that promote patient convenience rather than doctors will be actively used in the new era of AI. Since the advent of ChatGPT, the public has been able to easily access and interact with AI on chat-like interfaces, which have rapidly accelerated its progress in all areas, including medicine (11). ChatGPT was developed based on GPT-3, a large language model trained to generate realistic texts. Based on its high level of natural language processing ability, it shows the ability to understand unstructured data, new content generation ability, and high versatility. Eventually, it is expected to quickly penetrate medical AI fields related to ophthalmology and expand its scope from simple questions and answers (Q&As) to diagnostic care assistance.

Figure 1 History of development of generative AI. AI, artificial intelligence; VAE, variational autoencoder; GAN, generative adversarial network; DCGAN, deep convolutional generative adversarial network; VQ, vector quantized; ViT, vision transformer; DDPM, denoising diffusion probabilistic model; DDIM, denoising diffusion implicit model; GLIDE, guided language-to-image diffusion for generation and editing; GRU, gated recurrent unit; GPT, generative pre-trained transformer; BERT, bidirectional encoder representations from transformers; LaMDA, Language Model for Dialogue Applications; LLaMA; Large Language Model Meta AI.

Wang et al. conducted several studies based on a basic transformer architecture to analyze electronic medical records (12,13). Medical image generation using generative adversarial networks (GAN) has been conducted for ophthalmology imaging (14). Until now, these language processing and generative technologies have been used by ophthalmologists to solve detailed tasks; however, it is expected that patient-centered AI will be developed in the future, starting with ChatGPT. First, there is an era of searching for and inquiring about patients’ ophthalmic knowledge based on chatbots (15). The future advanced chatbot model does not simply refer to medical text data but also to the analysis of medical records, including ophthalmology images, and notification of analysis results. Secondly, patients can predict the course of future diseases; therefore, they can use various ophthalmological imaging domains based on generative AI (16). This can convey the characteristics and prognosis of the disease to patients more effectively than simple language delivery. For example, patients can understand how the shape of their macula changes after retinal surgery and decide on the surgery (17). The development of diffusion models beyond GAN will make this possible in the future (18,19). Third, smoother communication in telemedicine is possible through new interactive tools based on generated AI, such as virtual humans (digital humans) (20). Virtual doctors can be shown to patients naturally through generative AI, without being constrained by language, using text-to-speech technology. In areas where doctors are scarce, patients can easily understand various types of medical information through virtual humans.


Limitations of generative AI

However, generative AI currently has certain disadvantages. Currently, conventional CNN models can operate on personal computers with graphics cards; however, most large generative models, including ChatGPT, require a large amount of computation and can operate only in large computational environments. This limits the dissemination of new technologies. In addition, shortcomings that must be addressed include difficulties in providing the latest information, frequent errors, and insufficient performance. Owing to the difficulty of learning large models, the latest medical knowledge, especially recent research results, is likely to have difficulty providing information. The provision of inaccurate information in the medical field can cause fatal problems. Although diffusion models continue to appear in the field of image generation following GAN, they still exhibit insufficient performance for generating realistic images. Though so far, the importance of training data has been reported, but the algorithms have not overcome the limitations of small data. Therefore, it is necessary to continue development through continuous data collection and advanced algorithm development.


Conclusions

The article by Jiang et al. summarized the flow of international AI research in ophthalmology over the last decade (1). Just in time, AI in academia and industries are also facing a new phase. AI development in the field of ophthalmology is also expected to enter a new era that will not simply help doctors improve screening accuracy. Patients are increasingly able to use AI and enhance their lives. In the future, research groups that use large generative models effectively and are competent at fine-tuning to adapt large foundation models to their specific tasks will emerge as new leaders in the field of ophthalmology and AI.


Acknowledgments

We would like to thank the Editage for English language editing.

Funding: None.


Footnote

Provenance and Peer Review: This article was commissioned by the editorial office, Annals of Translational Medicine. The article did not undergo external peer review.

Conflicts of Interest: Both authors have completed the ICMJE uniform disclosure form (available at https://atm.amegroups.com/article/view/10.21037/atm-23-1598/coif). TKY reported that he served as a speaker for an academic lecture in VUNO and Hangil Eye Hospital, and served as a lecturer for a commercial conference held by the Korea Association of Intelligence Wellcare Industries (KIWI). TKY is an employee of B&VIIT Eye Center and VISUWORKS. He received a salary as part of the standard compensation package. He also received research grants for refractive surgery from Carl Zeiss Meditec AG. The research grants did not affect this manuscript. The other author has no conflicts of interest to declare.

Ethical Statement: The authors are accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved.

Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0/.


References

  1. Jiang X, Xie M, Ma L, et al. International publication trends in the application of artificial intelligence in ophthalmology research: an updated bibliometric analysis. Ann Transl Med 2023;11:219. [Crossref] [PubMed]
  2. Jin K, Ye J. Artificial intelligence and deep learning in ophthalmology: current status and future perspectives. Adv Ophthalmol Pract Res 2022;2:100078. [Crossref]
  3. Gulshan V, Peng L, Coram M, et al. Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs. JAMA 2016;316:2402-10. [Crossref] [PubMed]
  4. Keenan TDL, Chen Q, Agrón E, et al. DeepLensNet: Deep Learning Automated Diagnosis and Quantitative Classification of Cataract Type and Severity. Ophthalmology 2022;129:571-84. [Crossref] [PubMed]
  5. Abràmoff MD, Lavin PT, Birch M, et al. Pivotal trial of an autonomous AI-based diagnostic system for detection of diabetic retinopathy in primary care offices. NPJ Digit Med 2018;1:39. [Crossref] [PubMed]
  6. Lim JI, Regillo CD, Sadda SR, et al. Artificial intelligence detection of diabetic retinopathy: subgroup comparison of the EyeArt system with ophthalmologists’ dilated examinations. Ophthalmol Sci 2023;3:100228. [Crossref] [PubMed]
  7. Yoo TK, Ryu IH, Lee G, et al. Adopting machine learning to automatically identify candidate patients for corneal refractive surgery. NPJ Digit Med 2019;2:59. [Crossref] [PubMed]
  8. Yoo TK, Kim SH, Kim M, et al. DeepPDT-Net: predicting the outcome of photodynamic therapy for chronic central serous chorioretinopathy using two-stage multimodal transfer learning. Sci Rep 2022;12:18689. [Crossref] [PubMed]
  9. Wagner SK, Fu DJ, Faes L, et al. Insights into Systemic Disease through Retinal Imaging-Based Oculomics. Transl Vis Sci Technol 2020;9:6. [Crossref] [PubMed]
  10. Celard P, Iglesias EL, Sorribes-Fdez JM, et al. A survey on deep learning applied to medical images: from simple artificial neural networks to generative models. Neural Comput Appl 2023;35:2291-323. [Crossref] [PubMed]
  11. Vaishya R, Misra A, Vaish A. ChatGPT: Is this version good for healthcare and research? Diabetes Metab Syndr 2023;17:102744. [Crossref] [PubMed]
  12. Wang SY, Tseng B, Hernandez-Boussard T. Deep Learning Approaches for Predicting Glaucoma Progression Using Electronic Health Records and Natural Language Processing. Ophthalmol Sci 2022;2:100127. [Crossref] [PubMed]
  13. Hu W, Wang SY. Predicting Glaucoma Progression Requiring Surgery Using Clinical Free-Text Notes and Transfer Learning With Transformers. Transl Vis Sci Technol 2022;11:37. [Crossref] [PubMed]
  14. You A, Kim JK, Ryu IH, et al. Application of generative adversarial networks (GAN) for ophthalmology image domains: a survey. Eye Vis (Lond) 2022;9:6. [Crossref] [PubMed]
  15. Mihalache A, Popovic MM, Muni RH. Performance of an Artificial Intelligence Chatbot in Ophthalmic Knowledge Assessment. JAMA Ophthalmol 2023;141:589-97. [Crossref] [PubMed]
  16. Yoo TK, Choi JY, Kim HK. A generative adversarial network approach to predicting postoperative appearance after orbital decompression surgery for thyroid eye disease. Comput Biol Med 2020;118:103628. [Crossref] [PubMed]
  17. Lee H, Kim S, Kim MA, et al. Post-treatment prediction of optical coherence tomography using a conditional generative adversarial network in age-related macular degeneration. Retina 2021;41:572-80. [Crossref] [PubMed]
  18. KimHKRyuIHChoiJYEarly experience of adopting a generative diffusion model for the synthesis of fundus photographs.Research Square [Preprint]. 2022. Available online: 10.21203/rs.3.rs-2183608/v2
  19. Müller-Franzes G, Niehues JM, Khader F, et al. Diffusion probabilistic models beat GANs on medical images. arXiv:2212.07501 [Preprint]. 2022. Available online: http://arxiv.org/abs/2212.07501
  20. Li J, Zhang Y. Construction of smart medical assurance system based on virtual reality and GANs image recognition. Int J Syst Assur Eng Manag 2022;13:2517-30. [Crossref]
Cite this article as: Choi JY, Yoo TK. New era after ChatGPT in ophthalmology: advances from data-based decision support to patient-centered generative artificial intelligence. Ann Transl Med 2023;11(10):337. doi: 10.21037/atm-23-1598

Download Citation