Skip to Main content Skip to Navigation
New interface
Conference papers

Comparing Automated Methods to Detect Explicit Content in Song Lyrics

Michael Fell 1 Elena Cabrio 2 Michele Corazza 2 Fabien Gandon 2 
2 WIMMICS - Web-Instrumented Man-Machine Interactions, Communities and Semantics
CRISAM - Inria Sophia Antipolis - Méditerranée , Laboratoire I3S - SPARKS - Scalable and Pervasive softwARe and Knowledge Systems
Abstract : The Parental Advisory Label (PAL) is a warning label that is placed on audio recordings inrecognition of profanity or inappropriate references, with the intention of alerting parents of material potentially unsuitable for children.Since 2015, digital providers – such as iTunes,Spotify, Amazon Music and Deezer – also follow PAL guidelines and tag such tracks as “explicit”. Nowadays, such labelling is carried out mainly manually on voluntary basis, with the drawbacks of being time consuming and therefore costly, error prone and partly a subjective task. In this paper, we compare auto-mated methods ranging from dictionary-basedlookup to state-of-the-art deep neural networks to automatically detect explicit contents in English lyrics. We show that more complex models perform only slightly better on this task, and relying on a qualitative analysis of thedata, we discuss the inherent hardness and subjectivity of the task.
Complete list of metadata

Cited literature [16 references]  Display  Hide  Download
Contributor : Michael Fell Connect in order to contact the contributor
Submitted on : Sunday, September 8, 2019 - 1:11:08 PM
Last modification on : Thursday, August 4, 2022 - 4:54:59 PM
Long-term archiving on: : Thursday, February 6, 2020 - 7:08:39 PM


Files produced by the author(s)


  • HAL Id : hal-02281137, version 1


Michael Fell, Elena Cabrio, Michele Corazza, Fabien Gandon. Comparing Automated Methods to Detect Explicit Content in Song Lyrics. RANLP 2019 - Recent Advances in Natural Language Processing, Sep 2019, Varna, Bulgaria. ⟨hal-02281137⟩



Record views


Files downloads