Human evaluation of large language models in healthcare: gaps, challenges, and the need for standardization
| Title: | Human evaluation of large language models in healthcare: gaps, challenges, and the need for standardization |
|---|---|
| Authors: | Awasthi, RaghavAff1, Aff2; Bhattad, AtharvaAff1; Ramachandran, Sai PrasadAff1; Mishra, ShreyaAff1, Aff3; Khanna, Ashish K.Aff1; Cywinski, Jacek B.Aff1, Aff3; Maheshwari, KamalAff1; Mahapatra, DwarikanathAff1; DiRosa, IzabellaAff1; Cohen, AnabelleAff1; Arshad, HajraAff1; Atreja, AaritAff1; Alshukaili, AsmaAff1; Vohra, AryanAff1; Singh, NishantAff1; Papay, Francis A.Aff1, Aff2, Aff3; Atreja, AshishAff1; Kashyap, RahulAff1, Aff4; Mathur, PiyushAff1, Aff2, Aff3, IDs44401025000432_cor1 |
| Source: | npj Health Systems. 2(1) |
| Database: | Springer Nature Journals |