TY - JOUR
T1 - SEGLLM
T2 - 49th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024
AU - Malkiel, Itzik
AU - Alon, Uri
AU - Yehuda, Yakir
AU - Keren, Shahar
AU - Barkan, Oren
AU - Ronen, Royi
AU - Koenigstein, Noam
N1 - Publisher Copyright:
©2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Transcriptions of phone calls are of significant value across diverse fields, such as sales, customer service, healthcare, and law enforcement. Nevertheless, the analysis of these recorded conversations can be an arduous and time-intensive process, especially when dealing with long and multifaceted dialogues. In this work, we propose a novel method, which we name SegLLM, for efficient and accurate call segmentation and topic extraction. SegLLM is composed of offline and online phases. The offline phase is applied once to a given list of topics and involves generating a distribution of synthetic sentences for each topic using a large language model (LLM). The online phase is applied to every call separately and scores the similarity between the transcripted conversation and the topic anchors found in the offline phase. The proposed paradigm provides an accurate and efficient method for call segmentation and topic extraction that does not require labeled data, thus making it a versatile approach applicable to various domains.
AB - Transcriptions of phone calls are of significant value across diverse fields, such as sales, customer service, healthcare, and law enforcement. Nevertheless, the analysis of these recorded conversations can be an arduous and time-intensive process, especially when dealing with long and multifaceted dialogues. In this work, we propose a novel method, which we name SegLLM, for efficient and accurate call segmentation and topic extraction. SegLLM is composed of offline and online phases. The offline phase is applied once to a given list of topics and involves generating a distribution of synthetic sentences for each topic using a large language model (LLM). The online phase is applied to every call separately and scores the similarity between the transcripted conversation and the topic anchors found in the offline phase. The proposed paradigm provides an accurate and efficient method for call segmentation and topic extraction that does not require labeled data, thus making it a versatile approach applicable to various domains.
KW - Call Segmentation
KW - LLM
KW - Transformers
UR - http://www.scopus.com/inward/record.url?scp=105001477546&partnerID=8YFLogxK
U2 - 10.1109/ICASSP48485.2024.10446156
DO - 10.1109/ICASSP48485.2024.10446156
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.conferencearticle???
AN - SCOPUS:105001477546
SN - 1520-6149
SP - 11361
EP - 11365
JO - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
JF - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
Y2 - 14 April 2024 through 19 April 2024
ER -