Towards improving the robustness of sequential labeling models against typographical adversarial examples using triplet loss

Udomcharoenchaikit, Can; Boonkwan, Prachya; Vateekul, Peerapon

Published in

Cambridge University Press, Natural Language Engineering, 2(29), p. 287-315, 2022

DOI: 10.1017/s1351324921000486

Tools

Export citation

Search in Google Scholar

Towards improving the robustness of sequential labeling models against typographical adversarial examples using triplet loss

Journal article published in 2022 by Can Udomcharoenchaikit

, Prachya Boonkwan, Peerapon Vateekul

This paper was not found in any repository, but could be made available legally by the author.

Full text: Unavailable

Preprint: archiving allowed

Upload

Postprint: archiving forbidden

Published version: archiving forbidden

Policy details

Data provided by

Abstract

AbstractMany fundamental tasks in natural language processing (NLP) such as part-of-speech tagging, text chunking, and named-entity recognition can be formulated as sequence labeling problems. Although neural sequence labeling models have shown excellent results on standard test sets, they are very brittle when presented with misspelled texts. In this paper, we introduce an adversarial training framework that enhances the robustness against typographical adversarial examples. We evaluate the robustness of sequence labeling models with an adversarial evaluation scheme that includes typographical adversarial examples. We generate two types of adversarial examples without access (black-box) or with full access (white-box) to the target model’s parameters. We conducted a series of extensive experiments on three languages (English, Thai, and German) across three sequence labeling tasks. Experiments show that the proposed adversarial training framework provides better resistance against adversarial examples on all tasks. We found that we can further improve the model’s robustness on the chunking task by including a triplet loss constraint.

Published in

Links

Tools

Towards improving the robustness of sequential labeling models against typographical adversarial examples using triplet loss

Abstract