2 Citations (Scopus)


Click counts are related to the amount of money that online advertisers paid to news sites. Such business models forced some news sites to employ a dirty trick of click-baiting, i.e., using hyperbolic and interesting words, sometimes unfinished sentences in a headline to purposefully tease the readers. Some Indonesian online news sites also joined the party of clickbait, which indirectly degrade other established news sites' credibility. A neural network with a pre-trained language model multilingual bidirectional encoder representations from transformers (BERT) that acted as an embedding layer is then combined with a 100 node-hidden layer and topped with a sigmoid classifier was trained to detect clickbait headlines. With a total of 6,632 headlines as a training dataset, the classifier performed remarkably well. Evaluated with 5-fold cross-validation, it has an accuracy score of 0.914, an F1-score of 0.914, a precision score of 0.916, and a receiver operating characteristic-area under curve (ROC-AUC) of 0.92. The usage of multilingual BERT in the Indonesian text classification task was tested and is possible to be enhanced further. Future possibilities, societal impact, and limitations of clickbait detection are discussed.

Original languageEnglish
Pages (from-to)2921-2930
Number of pages10
JournalInternational Journal of Electrical and Computer Engineering
Issue number3
Publication statusPublished - Jun 2023


  • Adult literacy
  • Clickbait
  • Natural language processing
  • Online news


Dive into the research topics of 'Flagging clickbait in Indonesian online news websites using fine-tuned transformers'. Together they form a unique fingerprint.

Cite this