4.1 Article

Constraction: a tool for the automatic extraction and interactive exploration of linguistic constructions

Journal

LINGUISTICS VANGUARD
Volume -, Issue -, Pages -

Publisher

WALTER DE GRUYTER GMBH
DOI: 10.1515/lingvan-2022-0122

Keywords

linguistic constructions; construction extraction; natural language processing; computer-assisted language learning

Ask authors/readers for more resources

This article introduces an open-source tool, Constraction, for the automatic extraction and interactive exploration of linguistic constructions from textual corpora. Constraction features a generic algorithm and a browser-based interface, allowing customizable layers of linguistic annotation and visual representation of extracted patterns. Case studies demonstrate the utility of Constraction in language research and pedagogy.
A central task in empirical and quantitative language studies is the extraction of linguistic constructions important to linguistic theory and application. The great number and variety of such constructions increasingly necessitates computer-assisted extraction, which often proves challenging as it entails a simultaneous analysis of multiple layers of linguistic information latent in large-scale corpora. To address this, we present Constraction, an open-source tool for the automatic extraction and interactive exploration of linguistic constructions from arbitrary textual corpora. Constraction features a generic algorithm that integrates customizable layers of linguistic annotation (e.g., lexical, syntactic, and semantic) to identify constructional patterns of varying sizes and abstraction levels. Its browser-based interface allows users to configure various extraction parameters and enables visual, interactive exploration of the extracted patterns. We demonstrate the utility of Constraction through case studies and discuss its potential applications in language research and pedagogy.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.1
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available