TY - GEN
T1 - Harmony in Diagnosis
T2 - 13th International Conference on Complex Networks and their Applications, COMPLEX NETWORKS 2024
AU - Satel, Sanad
AU - Kour, George
AU - Zinger, Eyal
AU - Ganzach, Yoav
AU - Cohen, Sarel
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
PY - 2025
Y1 - 2025
N2 - This research explores the variability in decision-making among clinical psychologists by applying machine learning techniques to reverse-engineer their judgments. Using a dataset of 861 patients evaluated by 29 judges (both experienced psychologists and trainees), we trained decision tree and linear regression models for each judge to capture their decision patterns. We employed methods such as Jaccard similarity and pairwise Mean Squared Error (MSE) to quantify the distance between judges’ models, providing a clear metric of variability. Additionally, we applied SMOTE for data augmentation to enhance model training and improve robustness in comparisons. To gain deeper insights into the decision-making process, we used Large Language Models (LLMs) to explain and compare individual and pairwise models, highlighting key differences in clinical judgment. Our findings reveal significant variability in how judges weigh psychological traits, offering valuable insights into the potential for improving consistency and understanding judgment discrepancies in clinical practice. You can also find the related code in the following GitHub Repository.
AB - This research explores the variability in decision-making among clinical psychologists by applying machine learning techniques to reverse-engineer their judgments. Using a dataset of 861 patients evaluated by 29 judges (both experienced psychologists and trainees), we trained decision tree and linear regression models for each judge to capture their decision patterns. We employed methods such as Jaccard similarity and pairwise Mean Squared Error (MSE) to quantify the distance between judges’ models, providing a clear metric of variability. Additionally, we applied SMOTE for data augmentation to enhance model training and improve robustness in comparisons. To gain deeper insights into the decision-making process, we used Large Language Models (LLMs) to explain and compare individual and pairwise models, highlighting key differences in clinical judgment. Our findings reveal significant variability in how judges weigh psychological traits, offering valuable insights into the potential for improving consistency and understanding judgment discrepancies in clinical practice. You can also find the related code in the following GitHub Repository.
KW - medical decision making
KW - medical ML
KW - relations between judges
UR - http://www.scopus.com/inward/record.url?scp=105002040330&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-82435-7_17
DO - 10.1007/978-3-031-82435-7_17
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:105002040330
SN - 9783031824340
T3 - Studies in Computational Intelligence
SP - 207
EP - 217
BT - Complex Networks and Their Applications XIII - Proceedings of The 13th International Conference on Complex Networks and Their Applications
A2 - Cherifi, Hocine
A2 - Donduran, Murat
A2 - Rocha, Luis M.
A2 - Cherifi, Chantal
A2 - Varol, Onur
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 10 December 2024 through 12 December 2024
ER -