4.4 Article

DBTagger: Multi-Task Learning for Keyword Mapping in NLIDBs Using Bi-Directional Recurrent Neural Networks

Journal

PROCEEDINGS OF THE VLDB ENDOWMENT
Volume 14, Issue 5, Pages 813-821

Publisher

ASSOC COMPUTING MACHINERY
DOI: 10.14778/3446095.3446103

Keywords

-

Funding

  1. Scientific and Technological Research Council of Turkey (TUBITAK) [118E724]

Ask authors/readers for more resources

Translating natural language queries (NLQs) into structured query language (SQL) in relational databases is a challenging task. This study proposes a deep learning based supervised approach called DBTagger that utilizes POS tags of NLQs to address the keyword mapping problem. DBTagger achieves state-of-the-art accuracy results and is significantly faster than traditional methods, making it practical for various relational databases.
Translating Natural Language Queries (NLQs) to Structured Query Language (SQL) in interfaces deployed in relational databases is a challenging task, which has been widely studied in database community recently. Conventional rule based systems utilize series of solutions as a pipeline to deal with each step of this task, namely stop word filtering, tokenization, stemming/lemmatization, parsing, tagging, and translation. Recent works have mostly focused on the translation step overlooking the earlier steps by using adhoc solutions. In the pipeline, one of the most critical and challenging problems is keyword mapping; constructing a mapping between tokens in the query and relational database elements (tables, attributes, values, etc.). We define the keyword mapping problem as a sequence tagging problem, and propose a novel deep learning based supervised approach that utilizes POS tags of NLQs. Our proposed approach, called DBTagger (DataBase Tagger), is an end-to-end and schema independent solution, which makes it practical for various relational databases. We evaluate our approach on eight different datasets, and report new state-of-the-art accuracy results, 92.4% on the average. Our results also indicate that DBTagger is faster than its counterparts up to 10000 times and scalable for bigger databases.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available