Natural language processing (NLP) for hate speech detection in Vietnamese language: challenges and implementation

Van Cong Pham, Thair Al-Dala’in

Research output: Chapter in Book / Conference PaperChapterpeer-review

Abstract

This study emphasizes the importance of natural language processing (NLP) in identifying hate speech on social media when it comes to Vietnamese language by synthesizing many previous papers. A finding from the study is that models like Bidirectional Encoder Representations from Transformers (BERT) and its variations have outperformed Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM) in terms of accuracy and reliability. By studying previous research papers on Vietnamese language, it is found that with the complexity and diversity of Vietnamese language such as homonyms, polysemy, slang, regional languages and so on, they are considered as major linguistic challenges that prevent the success in detecting hate speech using NLP tools. However, there is a substantial research gap existing in this field well as in the tools used to process and filter them. To enhance the capacity to identify and reduce harmful speech and support the development of healthier online communities, future research should concentrate on comprehending and resolving these obstacles.

Original languageEnglish
Title of host publicationProceedings of the 3rd International Conference on Advances in Computing Research (ACR'25)
EditorsKevin Daimi, Abeer Al Sadoon
Place of PublicationSwitzerland
PublisherSpringer
Pages113-126
Number of pages14
ISBN (Electronic)9783031876479
ISBN (Print)9783031876462
DOIs
Publication statusPublished - 2025
EventInternational Conference on Advances in Computing Research - Radisson Hotel Nice Airport, Nice, France
Duration: 7 Jul 20258 Jul 2025
Conference number: 3rd

Publication series

NameLecture Notes in Networks and Systems
Volume1346
ISSN (Print)2367-3370
ISSN (Electronic)2367-3389

Conference

ConferenceInternational Conference on Advances in Computing Research
Abbreviated titleACR
Country/TerritoryFrance
CityNice
Period7/07/258/07/25

Keywords

  • Bidirectional Encoder Representations from Transformers (BERT)
  • Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM)
  • Hate Speech (HS)
  • Natural language processing (NLP)
  • Vietnamese

Fingerprint

Dive into the research topics of 'Natural language processing (NLP) for hate speech detection in Vietnamese language: challenges and implementation'. Together they form a unique fingerprint.

Cite this