Blue Network - for SDAD Seminar Series

Social and Decision Analytics Seminar Series: Multiple Instance Classification with Attention Block for Word-level Sentiment Analysis

Event Details

Wednesday, November 16, 2022
1:00pm-2:00pm Eastern Time (ET)


This event is part of our Social and Decision Analytics Seminar Series.

Jing Cao's Headshot

Speaker: Dr. Jing Cao, professor of statistics in the Department of Statistical Science at Southern Methodist University

Abstract: As a branch of machine learning, multiple instance learning (MIL) learns from a collection of labeled bags, each containing a set of instances. Each instance is described by a feature vector. Since its emergence, MIL has been applied to solve various problems including content-based image retrieval, object tracking/detection, and computer-aided diagnosis. In this study, we apply MIL to text sentiment analysis. The current neural-network-based approaches in text analysis enjoy high classification accuracies but usually lack interpretability. The proposed MIL model treats each text document as a bag, where the words are the instances. The model has a two-layered structure. The first layer identifies whether a word is essential or not (i.e., primary instance), and the second layer assigns a sentiment score over the individual words of a document. The motivation of our approach is that by the combination of the attention mechanism from neural networks with a relatively simple statistical model, we can combine the best of two worlds: the interpretability of a statistical model and the high predictive performance of neural-network models. The proposed method is applied to a wine tasting review dataset to demonstrate its interpretability.

Event link TBD.