TY - GEN
T1 - Fortunately, Discourse Markers Can Enhance Language Models for Sentiment Analysis
AU - Ein-Dor, Liat
AU - Shnayderman, Ilya
AU - Spector, Artem
AU - Dankin, Lena
AU - Aharonov, Ranit
AU - Slonim, Noam
N1 - Publisher Copyright:
Copyright © 2022, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
PY - 2022/6/30
Y1 - 2022/6/30
N2 - In recent years, pretrained language models have revolutionized the NLP world, while achieving state-of-the-art performance in various downstream tasks. However, in many cases, these models do not perform well when labeled data is scarce and the model is expected to perform in the zero or few shot setting. Recently, several works have shown that continual pretraining or performing a second phase of pretraining (inter-training), which is better aligned with the downstream task, can lead to improved results, especially in the scarce data setting. Here, we propose to leverage sentiment-carrying discoursemarkers to generate large-scale weakly-labeled data, which in turn can be used to adapt general-purpose language models to the task of sentiment classification. In addition, we propose a new method for adapting sentiment classification models to new domains. This method is based on automatic identification of domain-specific sentiment-carrying discourse markers. Extensive experimental results show the value of our approach on various benchmark datasets. Code, models and data are available at https://github.com/ibm/tslm-discourse-markers.
AB - In recent years, pretrained language models have revolutionized the NLP world, while achieving state-of-the-art performance in various downstream tasks. However, in many cases, these models do not perform well when labeled data is scarce and the model is expected to perform in the zero or few shot setting. Recently, several works have shown that continual pretraining or performing a second phase of pretraining (inter-training), which is better aligned with the downstream task, can lead to improved results, especially in the scarce data setting. Here, we propose to leverage sentiment-carrying discoursemarkers to generate large-scale weakly-labeled data, which in turn can be used to adapt general-purpose language models to the task of sentiment classification. In addition, we propose a new method for adapting sentiment classification models to new domains. This method is based on automatic identification of domain-specific sentiment-carrying discourse markers. Extensive experimental results show the value of our approach on various benchmark datasets. Code, models and data are available at https://github.com/ibm/tslm-discourse-markers.
UR - http://www.scopus.com/inward/record.url?scp=85147546852&partnerID=8YFLogxK
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:85147546852
T3 - Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022
SP - 10608
EP - 10617
BT - AAAI-22 Technical Tracks 10
PB - Association for the Advancement of Artificial Intelligence
T2 - 36th AAAI Conference on Artificial Intelligence, AAAI 2022
Y2 - 22 February 2022 through 1 March 2022
ER -