TY - JOUR
T1 - Inter-Rater Disagreements in Applying the Montreal Classification for Crohn's Disease
T2 - The Five-Nations Survey Study
AU - Ukashi, Offir
AU - Amiot, Aurelien
AU - Laharie, David
AU - Menchén, Luis
AU - Gutiérrez, Ana
AU - Fernandes, Samuel
AU - Pessarelli, Tommaso
AU - Correia, Fábio
AU - Gonzalez-Muñoza, Carlos
AU - López-Cardona, Julia
AU - Calabrese, Giulio
AU - Ferreiro-Iglesias, Rocio
AU - Tamir-Degabli, Natalie
AU - Dussias, Nikolas Konstantine
AU - Mousa, Amjad
AU - Oliveira, Raquel
AU - Richard, Nicolas
AU - Veisman, Ido
AU - Sharif, Kassem
AU - Ben-Horin, Shomron
AU - Soutullo-Castiñeiras, Carlos
AU - Dragoni, Gabriele
AU - Rotulo, Silvia
AU - Favale, Agnese
AU - Calméjane, Louis
AU - Bazin, Thomas
AU - Elosua, Alfonso
AU - Lopes, Sara
AU - Felice, Carla
AU - Mauriz, Violeta
AU - Rodrigues, Inês Coelho
AU - Jougon, Julia
AU - Botto, Inês
AU - Tavares de Sousa, Helena
AU - Bertani, Lorenzo
AU - Abadía, Paula Ripoll
AU - Bernardi, Alice De
AU - Zabana, Yamile
AU - Serra-Ruiz, Xavier
AU - Viola, Anna
AU - Barreiro-de Acosta, Manuel
AU - Yanai, Henit
AU - Armuzzi, Alessandro
AU - Magro, Fernando
AU - Kopylov, Uri
N1 - Publisher Copyright:
© 2025 The Author(s). United European Gastroenterology Journal published by Wiley Periodicals LLC on behalf of United European Gastroenterology.
PY - 2025
Y1 - 2025
N2 - Background: The Montreal classification has been widely used in Crohn's disease since 2005 to categorize patients by the age of onset (A), disease location (L), behavior (B), and upper gastrointestinal tract and perianal involvement. With evolving management paradigms in Crohn's disease, we aimed to assess the performance of gastroenterologists in applying the Montreal classification. Methods: An online survey was conducted among participants at an international educational conference on inflammatory bowel diseases. Participants classified 20 theoretical Crohn's disease cases using the Montreal classification. Agreement rates with the inflammatory bowel diseases board (three expert gastroenterologists whose consensus rating was considered the gold standard) were calculated for gastroenterologist specialists and fellows/specialists with ≤ 2 years of clinical experience. A majority vote < 75% among participants was considered a notable disagreement. The same cases were classified using three large language models (LLMs), ChatGPT-4, Claude-3, and Gemini-1.5, and assessed for agreement with the board and gastroenterologists. Fleiss Kappa was used to assess within-group agreement. Results: Thirty-eight participants from five countries completed the survey. In defining the Montreal classification as a whole, specialists (21/38 [55%]) had a higher agreement rate with the board compared to fellows/young specialists (17/38 [45%]) (58% vs. 49%, p = 0.012) and to LLMs (58% vs. 18%, p < 0.001). Disease behavior classification was the most challenging, with 76% agreement among specialists and fellows/young specialists and 48% among LLMs compared to the inflammatory bowel diseases board. Regarding disease behavior, within-group agreement was moderate (specialists: k = 0.522, fellows/young specialists: k = 0.532, LLMs: k = 0.577; p < 0.001 for all). Notable points of disagreement included: defining disease behavior concerning obstructive symptoms, assessing disease extent via video capsule endoscopy, and evaluating treatment-related reversibility of the disease phenotype. Conclusions: There is significant inter-rater disagreement in applying the Montreal classification, particularly for disease behavior in Crohn's disease. Improved education or revisions to phenotype criteria may be needed to enhance consensus on the Montreal classification.
AB - Background: The Montreal classification has been widely used in Crohn's disease since 2005 to categorize patients by the age of onset (A), disease location (L), behavior (B), and upper gastrointestinal tract and perianal involvement. With evolving management paradigms in Crohn's disease, we aimed to assess the performance of gastroenterologists in applying the Montreal classification. Methods: An online survey was conducted among participants at an international educational conference on inflammatory bowel diseases. Participants classified 20 theoretical Crohn's disease cases using the Montreal classification. Agreement rates with the inflammatory bowel diseases board (three expert gastroenterologists whose consensus rating was considered the gold standard) were calculated for gastroenterologist specialists and fellows/specialists with ≤ 2 years of clinical experience. A majority vote < 75% among participants was considered a notable disagreement. The same cases were classified using three large language models (LLMs), ChatGPT-4, Claude-3, and Gemini-1.5, and assessed for agreement with the board and gastroenterologists. Fleiss Kappa was used to assess within-group agreement. Results: Thirty-eight participants from five countries completed the survey. In defining the Montreal classification as a whole, specialists (21/38 [55%]) had a higher agreement rate with the board compared to fellows/young specialists (17/38 [45%]) (58% vs. 49%, p = 0.012) and to LLMs (58% vs. 18%, p < 0.001). Disease behavior classification was the most challenging, with 76% agreement among specialists and fellows/young specialists and 48% among LLMs compared to the inflammatory bowel diseases board. Regarding disease behavior, within-group agreement was moderate (specialists: k = 0.522, fellows/young specialists: k = 0.532, LLMs: k = 0.577; p < 0.001 for all). Notable points of disagreement included: defining disease behavior concerning obstructive symptoms, assessing disease extent via video capsule endoscopy, and evaluating treatment-related reversibility of the disease phenotype. Conclusions: There is significant inter-rater disagreement in applying the Montreal classification, particularly for disease behavior in Crohn's disease. Improved education or revisions to phenotype criteria may be needed to enhance consensus on the Montreal classification.
KW - Crohn's disease
KW - complicated disease phenotype
KW - inflammatory bowel diseases
KW - large language models
KW - montreal classification
UR - http://www.scopus.com/inward/record.url?scp=85215297278&partnerID=8YFLogxK
U2 - 10.1002/ueg2.12757
DO - 10.1002/ueg2.12757
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 39825768
AN - SCOPUS:85215297278
SN - 2050-6406
JO - United European Gastroenterology Journal
JF - United European Gastroenterology Journal
ER -