Comparing Automated Methods to Detect Explicit Content in Song Lyrics

Michael Fell 1 Elena Cabrio 2 Michele Corazza 2 Fabien Gandon 2
2 WIMMICS - Web-Instrumented Man-Machine Interactions, Communities and Semantics
CRISAM - Inria Sophia Antipolis - Méditerranée , Laboratoire I3S - SPARKS - Scalable and Pervasive softwARe and Knowledge Systems
Abstract : The Parental Advisory Label (PAL) is a warning label that is placed on audio recordings in recognition of profanity or inappropriate references, with the intention of alerting parents of material potentially unsuitable for children. Since 2015, digital providers such as iTunes, Spotify, Amazon Music and Deezer also follow PAL guidelines and tag such tracks as "explicit". Nowadays, such labelling is carried out mainly manually on voluntary basis, with the drawbacks of being time consuming and therefore costly, error prone and partly a subjective task. In this paper, we compare automated methods ranging from dictionary-based lookup to state-of-the-art deep neural networks to automatically detect explicit contents in English lyrics. We show that more complex models perform only slightly better on this task, and relying on a qualitative analysis of the data, we discuss the inherent hardness and subjectivity of the task.
Complete list of metadatas

Cited literature [16 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02281137
Contributor : Michael Fell <>
Submitted on : Sunday, September 8, 2019 - 1:11:08 PM
Last modification on : Tuesday, September 10, 2019 - 1:18:35 AM

File

Explicit_Lyrics_RANLP_final.pd...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02281137, version 1

Collections

Citation

Michael Fell, Elena Cabrio, Michele Corazza, Fabien Gandon. Comparing Automated Methods to Detect Explicit Content in Song Lyrics. Conference on Recent Advances in Natural Language Processing (RANLP), Sep 2019, Varna, Bulgaria. ⟨hal-02281137⟩

Share

Metrics

Record views

33

Files downloads

19