Audio analysis of customer calls for predicting purchase intentions: A novel approach to e-commerce insights

Yu, Miao

Audio analysis of customer calls for predicting purchase intentions: A novel approach to e-commerce insights

dc.contributor.author	Yu, Miao
dc.contributor.supervisor	Li, Kin Fun
dc.date.accessioned	2024-11-27T16:19:14Z
dc.date.available	2024-11-27T16:19:14Z
dc.date.issued	2024
dc.degree.department	Department of Electrical and Computer Engineering
dc.degree.level	Master of Engineering MEng
dc.description.abstract	Client audio recordings represent a valuable resource for many types of businesses. Utilizing these recordings to identify potential customers can help enhance purchase rates and reduce marketing costs, particularly with different kinds of machine learning methods that automatically label different groups, including positive, neutral, and negative buyers, instead of manual analysis. Though previous research has predominantly focused on text content analysis for this purpose, audio features, which effectively capture voice nuances such as tone, pitch, rhythm, and interaction patterns between interviewers and interviewees, may impact the model performance. This project explored an innovative method. It firstly investigates the effectiveness of emotion detection through audio features, leveraging two datasets: the Toronto Emotional Speech Set (TESS) and the Surrey Audio-Visual Expressed Emotion Dataset (SAVEE). Furthermore, hierarchical clustering techniques are applied to explore the relationship between emotion-related audio features and customer categories using audio data provided by VINN Auto, an e-commerce firm. Next, Exploratory Data Analysis (EDA) is conducted to find the correlation between interaction-related audio features and customer categories, including positive, neutral, and negative buyers within the same dataset after labeling it. Using supervised learning, the results indicate that integrating audio features, including emotion-related and interaction pattern features, can affect the performance of models like Support Vector Machines (SVM), Decision Tree, and Extreme Gradient Boosting (XGBoosts), particularly when combined with traditional audio content-related features such as Term Frequency-Inverse Document Frequency (TF-IDF) scores while applying adjusted weight configuration for positive class. After these exploration, an ensemble method using a soft voting mechanism across these three models is developed to assess whether it can enhance the identification of potential purchasers. The approach of combining emotion-related audio features, interaction pattern features, and content-based features like TF-IDF scores with tailored weight configurations highlights the value of collaborating audio features in customer identification tasks compared with only using content-based features like TF-IDF scores. It could be a robust strategy for improving classification outcomes for the relevant analysis in the future.
dc.description.scholarlevel	Graduate
dc.identifier.uri	https://hdl.handle.net/1828/20805
dc.language.iso	en
dc.rights	Available to the World Wide Web
dc.subject	purchase intention
dc.subject	protential purchasers
dc.subject	audio
dc.subject	emotion-related features
dc.subject	interaction-pattern features
dc.subject	text
dc.subject	content-related features
dc.title	Audio analysis of customer calls for predicting purchase intentions: A novel approach to e-commerce insights
dc.type	project

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Yu_Miao_MEng_2024.pdf
Size:: 3.9 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.62 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Master's Projects