Few-shot Text Classification with Saliency-equivalent Concatenation

Ying Jia Lin, Yu Fang Chang, Hung Yu Kao, Hsin Yang Wang, Mu Liu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In few-shot text classification, the lack of significant features limits models from generalizing to data not included in the training set. Data augmentation is a solution to the classification tasks; however, the standard augmentation methods in natural language processing are not feasible in few-shot learning. In this study, we explore data augmentation in few-shot text classification. We propose saliency-equivalent concatenation (SEC)1. The core concept of SEC is to append additional key information to an input sentence to help a model understand the sentence easier. In the proposed method, we first leverage a pre-trained language model to generate several novel sentences for each sample in datasets. Then we leave the most relevant one and concatenate it with the original sentence as additional information for each sample. Our experiments on the two fewshot text classification tasks verified that the proposed method can boost the performance of meta-learning models and outperform the previous unsupervised data augmentation methods.

Original languageEnglish
Title of host publicationProceedings - 2022 IEEE 5th International Conference on Artificial Intelligence and Knowledge Engineering, AIKE 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages74-81
Number of pages8
ISBN (Electronic)9781665471206
DOIs
Publication statusPublished - 2022
Event5th IEEE International Conference on Artificial Intelligence and Knowledge Engineering, AIKE 2022 - Laguna Hills, United States
Duration: 2022 Sept 192022 Sept 21

Publication series

NameProceedings - 2022 IEEE 5th International Conference on Artificial Intelligence and Knowledge Engineering, AIKE 2022

Conference

Conference5th IEEE International Conference on Artificial Intelligence and Knowledge Engineering, AIKE 2022
Country/TerritoryUnited States
CityLaguna Hills
Period22-09-1922-09-21

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications
  • Information Systems

Fingerprint

Dive into the research topics of 'Few-shot Text Classification with Saliency-equivalent Concatenation'. Together they form a unique fingerprint.

Cite this