TY - JOUR
T1 - Measuring flight-destination similarity
T2 - A multidimensional approach
AU - Goldstein, Anat
AU - Hajaj, Chen
N1 - Publisher Copyright:
© 2023 Elsevier Ltd
PY - 2024/3/15
Y1 - 2024/3/15
N2 - E-tourism websites offer users a vast array of travel destinations and opportunities, necessitating tools that enable destination comparison and intelligent search capabilities. One key requirement for such tools is the ability to measure the similarity between destinations. Over the years, various similarity measurement techniques have been proposed, including user-based and content-based approaches. However, many of these techniques require data preparation or prior domain knowledge from experts. In contrast, this study proposes an innovative approach that requires no prior domain knowledge of flight destinations or their relationships, and utilizes only readily available data. Our approach draws upon concepts from image recognition and natural language processing (NLP) to extract hidden aspects of destinations. Using data from a flight-search website as a testbed, we analyze similarity metrics based on state-of-the-art methods for image recognition, NLP, and product-network analysis. We then compare these metrics to those obtained by human subjects. Our findings suggest that no single method dominates in all aspects, leading us to propose a hybrid method that leverages the strengths of each. The proposed method can be readily applied to measure product similarity in other domains.
AB - E-tourism websites offer users a vast array of travel destinations and opportunities, necessitating tools that enable destination comparison and intelligent search capabilities. One key requirement for such tools is the ability to measure the similarity between destinations. Over the years, various similarity measurement techniques have been proposed, including user-based and content-based approaches. However, many of these techniques require data preparation or prior domain knowledge from experts. In contrast, this study proposes an innovative approach that requires no prior domain knowledge of flight destinations or their relationships, and utilizes only readily available data. Our approach draws upon concepts from image recognition and natural language processing (NLP) to extract hidden aspects of destinations. Using data from a flight-search website as a testbed, we analyze similarity metrics based on state-of-the-art methods for image recognition, NLP, and product-network analysis. We then compare these metrics to those obtained by human subjects. Our findings suggest that no single method dominates in all aspects, leading us to propose a hybrid method that leverages the strengths of each. The proposed method can be readily applied to measure product similarity in other domains.
KW - E-commerce
KW - E-tourism
KW - Image-Embeddings
KW - Multi-Modal Product Representation
KW - Product Network
KW - Product Similarity
KW - Text-Embeddings
UR - http://www.scopus.com/inward/record.url?scp=85173560273&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2023.121802
DO - 10.1016/j.eswa.2023.121802
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85173560273
SN - 0957-4174
VL - 238
JO - Expert Systems with Applications
JF - Expert Systems with Applications
M1 - 121802
ER -