Building models for generating realistic, synthetic conversational data to assess clinicians’ motivational interviewing (MI) skills. A data-centric challenge to advance healthcare AI.
We aim to build models for generating realistic, synthetic conversational data used to train automated assessment systems for Motivational Interviewing (MI).
These skills are evaluated using the Motivational Interviewing Treatment Integrity (MITI) rubric. However, building reliable models is hindered by the scarcity of high-quality, privacy-compliant data.
The Goal: Generate synthetic "volleys" (conversation turns) to train models in classifying pairs of Behavior Codes (BCs), helping identify proficiency or deficiency in clinical interviewing skills.
Participants are expected to produce high-quality datasets representing a wide range of clinical conversations to enhance the performance of a frozen baseline model (DistillBert).
We invite the community to develop Generative AI (GenAI) methods for creating synthetic conversation turns that can substantially improve the performance of models trained to recognize behavior codes (BCs) in the context of motivational interviews.
A Behavior Code (BC) is a discrete, observable clinician action (e.g., asking a question, giving information) counted during the coding of a motivational interviewing session.
These codes allow raters to tally how often particular clinician behaviours occur, helping assess adherence to MI-consistent versus MI-inconsistent practice.
Our ultimate goal is to generate valuable data for training models for the automatic assessment of clinicians’ motivational-interviewing skills.
These skills (crucial for promoting behavior change among patients) can be evaluated by using the Motivational Interviewing Treatment Integrity (MITI) rubric.
This is a data-centric competition: participants are expected to produce high-quality datasets representing a wide range of clinical conversations (rather than training a model) to enhance the performance of a frozen baseline model used for BC classification.
We encourage participants to include samples featuring clients from diverse backgrounds, varied conversation topics, and conversing with different types of health professionals.
Submissions must generate content for these specific categories, covering both MI-consistent behaviors and anti-MI behaviors.
Neutral education without persuasion.
Open and closed queries seeking client response.
Repeating or rephrasing without adding meaning.
Adding substantial meaning or emphasis.
Overt attempts to change opinions without autonomy.
Participants in this competition must provide three datasets, one for each pair of considered BCs, each containing at most 100 labeled conversation turns. These datasets will be used to fine-tune pretrained models. The fine-tuned models will then generate predictions for a held-out dataset. The performance of the fine-tuned models on this held-out dataset will serve as the primary evaluation metric for ranking participants.
The BC pairs are: (1) Simple Reflection vs. Complex Reflection; (2) Open Question vs. Closed Question; (3) Persuasion vs. Giving Information.
Contains
Contains
Contains
Start of the development phase.
Start of the final phase.
End of evaluation campaign.
Publication of official results.
Deadline for paper submission.
Acceptance notification.
Camera-ready submission deadline.
Publication of proceedings.
Workshop with SEPLN 2026.
The researchers behind MIRROR@IberLEF2026

Primary Contact
INAOE, Mexico
Research Scientist at INAOE. Vice chair of IAPR TC12. Organizer of challenges at NeurIPS, ICPR, and more.
hugo.jair@gmail.com
Professor of Global Public Health
University of Michigan, USA
Expert in evidence-based clinical communication for behavior change. Lectures on clinical AI applications.
jpiette@umich.edu
Researcher
INAOE, Mexico
Member of AMNLP. Research focuses on NLP in social media, sentiment analysis, and irony detection.
dirazuhf@inaoep.mx
Research Scientist
INAOE, Mexico
Organizer of the Annual Mexican Workshop of Language Technologies. Former president of AMNLP.
villasen@inaoep.mx
Research Scientist
INAOE, Mexico
Author of 200+ papers in text mining and authorship analysis. Program chair for IBERAMIA.
mmontesg@inaoep.mx
PhD Student
INAOE, Mexico
PhD Student in Computer Science, focusing in Speech and Language
ca_olachea@inaoep.mx
PhD Student
INAOE, Mexico
PhD student in Computer Science, focusing on Vision and Language.
arellano.luis@inaoep.mx
Primary Contact
INAOE, Mexico
Research Scientist at INAOE. Vice chair of IAPR TC12. Organizer of challenges at NeurIPS, ICPR, and more.
hugo.jair@gmail.com
Professor of Global Public Health
University of Michigan, USA
Expert in evidence-based clinical communication for behavior change. Lectures on clinical AI applications.
jpiette@umich.edu
Researcher
INAOE, Mexico
Member of AMNLP. Research focuses on NLP in social media, sentiment analysis, and irony detection.
dirazuhf@inaoep.mx
Research Scientist
INAOE, Mexico
Organizer of the Annual Mexican Workshop of Language Technologies. Former president of AMNLP.
villasen@inaoep.mx
Research Scientist
INAOE, Mexico
Author of 200+ papers in text mining and authorship analysis. Program chair for IBERAMIA.
mmontesg@inaoep.mx
PhD Student
INAOE, Mexico
PhD Student in Computer Science, focusing in Speech and Language
ca_olachea@inaoep.mx
PhD Student
INAOE, Mexico
PhD student in Computer Science, focusing on Vision and Language.
arellano.luis@inaoep.mxSupported By


