AI showdown: info accuracy on protein quality content in foods from ChatGPT 3.5, ChatGPT 4, bard AI and bing chat

BAYRAM, HATİCE; ÖZTÜRKCAN, SEYFETTİN

doi:10.1108/bfj-02-2024-0158

AI showdown: info accuracy on protein quality content in foods from ChatGPT 3.5, ChatGPT 4, bard AI and bing chat

Atıf İçin Kopyala

BAYRAM H. M., ÖZTÜRKCAN S. A.

British Food Journal, cilt.126, sa.9, ss.3335-3346, 2024 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 126 Sayı: 9
Basım Tarihi: 2024
Doi Numarası: 10.1108/bfj-02-2024-0158
Dergi Adı: British Food Journal
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, ABI/INFORM, Aerospace Database, Agricultural & Environmental Science Database, CAB Abstracts, Communication Abstracts, Food Science & Technology Abstracts, Hospitality & Tourism Complete, Hospitality & Tourism Index, Index Islamicus, INSPEC, Metadex, Veterinary Science Database, Civil Engineering Abstracts
Sayfa Sayıları: ss.3335-3346
Anahtar Kelimeler: Al models, Artificial intelligence, Bard AI, Bing chat, ChatGPT, Food assessment, Sustainability, Sustainable diet
İstanbul Gelişim Üniversitesi Adresli: Evet

Özet

Purpose: This study aims to assess the effectiveness of different AI models in accurately aggregating information about the protein quality (PQ) content of food items using four artificial intelligence (AI) models -– ChatGPT 3.5, ChatGPT 4, Bard AI and Bing Chat. Design/methodology/approach: A total of 22 food items, curated from the Food and Agriculture Organisation (FAO) of the United Nations (UN) report, were input into each model. These items were characterised by their PQ content according to the Digestible Indispensable Amino Acid Score (DIAAS). Findings: Bing Chat was the most accurate AI assistant with a mean accuracy rate of 63.6% for all analyses, followed by ChatGPT 4 with 60.6%. ChatGPT 4 (Cohen’s kappa: 0.718, p < 0.001) and ChatGPT 3.5 (Cohen’s kappa: 0.636, p: 0.002) showed substantial agreement between baseline and 2nd analysis, whereas they showed a moderate agreement between baseline and 3rd analysis (Cohen’s kappa: 0.538, p: 0.011 for ChatGPT 4 and Cohen’s kappa: 0.455, p: 0.030 for ChatGPT 3.5). Originality/value: This study provides an initial insight into how emerging AI models assess and classify nutrient content pertinent to nutritional knowledge. Further research into the real-world implementation of AI for nutritional advice is essential as the technology develops.