4.7 Review

Exploring sequence-to-sequence taxonomy expansion via language model probing

Journal

EXPERT SYSTEMS WITH APPLICATIONS
Volume 239, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2023.122321

Keywords

Taxonomy; Taxonomy expansion; Sequence to sequence; Structural features

Ask authors/readers for more resources

Taxonomy is a knowledge graph used in semantic entailment and natural language processing tasks. Taxonomy expansion involves adding new concepts to enrich an existing taxonomy. Our method, TaxoSeq, converts taxonomy expansion into a sequence to sequence setting, effectively utilizing structural features and handling various expansion cases. It outperforms other methods on SemEval's benchmark datasets.
Taxonomy is a knowledge graph of concept hierarchy which plays a significant role in semantic entailment and is widely used in many downstream natural language processing tasks. Distinct from building a taxonomy from scratch, the task of taxonomy expansion aims at enriching an existing taxonomy by adding new concepts. However, existing methods often construct only part of semantic relationships for representing the taxonomy, which may ignore sufficient features. Meanwhile, as many recent models usually take this task in insertion only manner, they preserve limitations when the new concept is not an insertion to taxonomy. Therefore, we propose TaxoSeq, a method that converts the task of taxonomy expansion into a sequence to sequence setting, thereby effectively exploiting the entire structural features and naturally dealing with more expansion cases. Empowered by pre-trained language models such as T5, our approach is shown to achieve significant progress over other methods in SemEval's three publicly benchmark datasets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available