4.6 Article

Integrating Text Classification into Topic Discovery Using Semantic Embedding Models

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Chemistry, Multidisciplinary

Comparison of Topic Modelling Approaches in the Banking Context

Bayode Ogunleye et al.

Summary: Topic modelling is a crucial task in various applications, and traditional approaches like LDA have shown good performance but lack consistency due to data sparseness and inability to understand word order. This study introduces the use of KernelPCA and K-means clustering in the BERTopic architecture, which produced coherent topics with a high coherence score of 0.8463 when applied to a new dataset of tweets from Nigerian bank customers.

APPLIED SCIENCES-BASEL (2023)

Article Biology

Lexicon-based sentiment analysis to detect opinions and attitude towards COVID-19 vaccines on Twitter in Italy

Rosario Catelli et al.

Summary: The paper presents a method that combines Natural Language Processing (NLP) and Sentiment Analysis (SA) to analyze sentiments and opinions on COVID-19 vaccination in Italy. The study focuses on a dataset of vaccine-related tweets published between January 2021 and February 2022. By filtering out irrelevant tweets, a total of 353,217 tweets were analyzed. The approach categorizes opinion holders into four classes (Common users, Media, Medicine, Politics) and utilizes NLP tools and domain-specific lexicons to enhance sentiment analysis. The results show an overall negative sentiment, especially among Common users, and different attitudes towards specific events during the examined period.

COMPUTERS IN BIOLOGY AND MEDICINE (2023)

Article Engineering, Multidisciplinary

TopicStriKer: A topic kernels-powered approach for text classification

Nikhil Chandran et al.

Summary: TopicStriKer is a model that combines unsupervised topic modeling with supervised string kernels for text classification tasks. It reduces the document corpus using co-occurring topic words and topic proportions per document, and utilizes string kernels for classification, resulting in improved accuracy and reduced training time.

RESULTS IN ENGINEERING (2023)

Article Computer Science, Artificial Intelligence

Emerging Trends: SOTA-Chasing

Kenneth Ward Church et al.

Summary: Pursuing state-of-the-art (SOTA) numbers in research papers can have costs, such as missing out on more promising opportunities and potentially leading to unrealistic expectations. Lack of leadership and uncertain reviewing processes are identified as the root causes of SOTA-chasing. This phenomenon is compared to the replication crisis in scientific literature.

NATURAL LANGUAGE ENGINEERING (2022)

Article Genetics & Heredity

TextNetTopics: Text Classification Based Word Grouping as Topics and Topics' Scoring

Malik Yousef et al.

Summary: Medical document classification is a challenging research problem within text classification, and TextNetTopics proposes a novel approach of feature selection based on Bag-of-topics instead of the traditional Bag-of-words. The approach, using the G-S-M method, scores topics to select the top topics for training the classifier, leading to improved accuracy.

FRONTIERS IN GENETICS (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Hierarchical Topic Model Inference by Community Discovery on Word Co-occurrence Networks

Eric Austin et al.

Summary: The popular topic modelling algorithm, Latent Dirichlet Allocation, only produces a simple set of topics. In contrast, the novel algorithm called Community Topic mines communities from word co-occurrence networks to generate topics with a hierarchical structure. Compared to other models, Community Topic uncovers a more coherent topic hierarchy with a tighter relationship between parent and child topics, and it can find this hierarchy more quickly. This algorithm also allows researchers to discover sub- and super-topics on demand, facilitating corpus exploration.

DATA MINING, AUSDM 2022 (2022)

Article Computer Science, Information Systems

Topic Modeling for Interpretable Text Classification From EHRs

Emil Rijcken et al.

Summary: The article discusses the use of topic models for text classification of clinical notes in predictive tasks and how to select a suitable topic model. The study found that there is no correlation between interpretability and predictive performance, with the proposed fuzzy topic modeling algorithm showing the best interpretability while two state-of-the-art methods perform the best in predictive performance.

FRONTIERS IN BIG DATA (2022)

Article Computer Science, Information Systems

Enhancing Big Social Media Data Quality for Use in Short-Text Topic Modeling

Belal Abdullah Hezam Murshed et al.

Summary: This study addresses the issue of poor quality microblog data and proposes a Social Media Data Cleansing Model (SMDCM) to improve data quality for Short-Text Topic Modelling (STTM). By evaluating six topic modelling methods, it was found that GLTM and WNTM were the most effective when applying SMDCM techniques, achieving optimal topic coherence and accuracy values.

IEEE ACCESS (2022)

Article

redBERT

Chaitanya Pandey

International Journal of Open Source Software and Processes (2021)

Article Chemistry, Multidisciplinary

Bert-Based Latent Semantic Analysis (Bert-LSA): A Case Study on Geospatial Data Technology and Application Trend Analysis

Quanying Cheng et al.

Summary: Geospatial data plays a crucial role in research and applications across various fields. This paper introduces a new method for topic discovery, which effectively determines development trends and generates coherent topics by assigning literature to different topics. Through this method, text content can be better revealed and new research hotspots can be identified.

APPLIED SCIENCES-BASEL (2021)

Proceedings Paper Computer Science, Software Engineering

Unsupervised Topic Discovery in User Comments

Christoph Stanik et al.

Summary: On social media platforms, users' comments are valuable but difficult to manage manually. Researchers have proposed automated methods to extract valuable comments and provide insights through topic aggregation. The topic discovery approach using deep natural language processing algorithms achieved high consistency.

29TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE 2021) (2021)

Review Computer Science, Information Systems

The application of artificial intelligence and data integration in COVID-19 studies: a scoping review

Yi Guo et al.

Summary: The study identified 7 key areas where AI was applied in COVID-19 research but found a lack of heterogeneous data integration in these applications. Most AI applications adopted a single-sourced approach, potentially leading to biased algorithms.

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION (2021)

Article Computer Science, Artificial Intelligence

Lexicon-Grammar based open information extraction from natural language sentences in Italian

Raffaele Guarasci et al.

EXPERT SYSTEMS WITH APPLICATIONS (2020)

Article Computer Science, Information Systems

Deep Sentiment Classification and Topic Discovery on Novel Coronavirus or COVID-19 Online Discussions: NLP Using LSTM Recurrent Neural Network Approach

Hamed Jelodar et al.

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS (2020)

Article Computer Science, Artificial Intelligence

Topic discovery in massive text corpora based on Min-Hashing

Gibran Fuentes-Pineda et al.

EXPERT SYSTEMS WITH APPLICATIONS (2019)

Article Computer Science, Information Systems

Word Sense Disambiguation Using Cosine Similarity Collaborates with Word2vec and WordNet

Korawit Orkphol et al.

FUTURE INTERNET (2019)

Review Computer Science, Information Systems

Text Classification Algorithms: A Survey

Kamran Kowsari et al.

INFORMATION (2019)

Review Computer Science, Information Systems

Textual Analysis for Online Reviews: A Polymerization Topic Sentiment Model

Lijuan Huang et al.

IEEE ACCESS (2019)

Article Computer Science, Information Systems

Online Sales Prediction: An Analysis With Dependency SCOR-Topic Sentiment Model

Lijuan Huang et al.

IEEE ACCESS (2019)

Article Computer Science, Artificial Intelligence

Unsupervised learning of human action categories using spatial-temporal words

Juan Carlos Niebles et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2008)