Will it blend? Blending weak and strong labeled data in a neural network for argumentation mining

Eyal Shnarch; Carlos Alzate; Lena Dankin; Martin Gleize; Yufang Hou; Leshem Choshen; Ranit Aharonov; Noam Slonim

doi:10.18653/v1/p18-2095

ACL 2018

Conference paper

15 Jul 2018

Will it blend? Blending weak and strong labeled data in a neural network for argumentation mining

View publication

Abstract

The process of obtaining high quality labeled data for natural language understanding tasks is often slow, error-prone, complicated and expensive. With the vast usage of neural networks, this issue becomes more notorious since these networks require a large amount of labeled data to produce satisfactory results. We propose a methodology to blend high quality but scarce labeled data with noisy but abundant weak labeled data during the training of neural networks. Experiments in the context of topic-dependent evidence detection with two forms of weak labeled data show the advantages of the blending scheme. In addition, we provide a manually annotated data set for the task of topic-dependent evidence detection.

Conference paper