Learning Dynamic Author Representations with Temporal Language Models

Edouard Delasalles; Sylvain Lamprier; Ludovic Denoyer

doi:10.1109/ICDM.2019.00022

Communication Dans Un Congrès Année : 2019

Learning Dynamic Author Representations with Temporal Language Models

(1) , (1) , (2)

1
2

Edouard Delasalles

Fonction : Auteur
PersonId : 177562
IdHAL : edouard-delasalles
ORCID : 0000-0002-1571-9910
IdRef : 258104686

Machine Learning and Information Access

Sylvain Lamprier

Fonction : Auteur
PersonId : 740402
IdHAL : sylvain-lamprier
ORCID : 0000-0002-2508-922X
IdRef : 142632201

Machine Learning and Information Access

Ludovic Denoyer

Fonction : Auteur
PersonId : 9178
IdHAL : ludovic-denoyer
ORCID : 0000-0002-7348-788X
IdRef : 089291255

Facebook AI Research [Paris]

Résumé

Language models are at the heart of numerous works, notably in the text mining and information retrieval communities. These statistical models aim at extracting word distributions, from simple unigram models to recurrent approaches with latent variables that capture subtle dependencies in texts. However, those models are learned from word sequences only, and authors' identities, as well as publication dates, are seldom considered. We propose a neural model, based on recurrent language modeling, which aims at capturing language diffusion tendencies in author communities through time. By conditioning language models with author and temporal vector states, we are able to leverage the latent dependencies between the text contexts. This allows us to beat several temporal and non-temporal language baselines on two real-world corpora, and to learn meaningful author representations that vary through time.

Domaines

Apprentissage [cs.LG]

Edouard Delasalles : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02466142

Soumis le : mardi 4 février 2020-12:22:50

Dernière modification le : samedi 7 octobre 2023-21:36:22

Dates et versions

hal-02466142 , version 1 (04-02-2020)

Identifiants

HAL Id : hal-02466142 , version 1
DOI : 10.1109/ICDM.2019.00022

Citer

Edouard Delasalles, Sylvain Lamprier, Ludovic Denoyer. Learning Dynamic Author Representations with Temporal Language Models. 2019 IEEE International Conference on Data Mining (ICDM), Nov 2019, Beijing, China. pp.120-129, ⟨10.1109/ICDM.2019.00022⟩. ⟨hal-02466142⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS LIP6 SORBONNE-UNIVERSITE SU-SCIENCES

47 Consultations

0 Téléchargements

Learning Dynamic Author Representations with Temporal Language Models

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager