Skip to Main content Skip to Navigation
Conference papers

Generating all Possible Palindromes from Ngram Corpora

Abstract : We address the problem of generating all possible palindromes from a corpus of Ngrams. Palin-dromes are texts that read the same both ways. Short palindromes (" race car ") usually carry precise , significant meanings. Long palindromes are often less meaningful, but even harder to generate. The palindrome generation problem has never been addressed, to our knowledge, from a strictly combinatorial point of view. The main difficulty is that generating palindromes require the simultaneous consideration of two interrelated levels in a sequence: the " character " and the " word " levels. Although the problem seems very combina-torial, we propose an elegant yet non-trivial graph structure that can be used to generate all possible palindromes from a given corpus of Ngrams, with a linear complexity. We illustrate our approach with short and long palindromes obtained from the Google Ngram corpus. We show how we can control the semantics, to some extent, by using arbitrary text corpora to bias the probabilities of certain sets of words. More generally this work addresses the issue of modelling human virtuosity from a combinatorial viewpoint, as a means to understand human creativity.
Complete list of metadatas

Cited literature [3 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01344082
Contributor : Jean-Charles Regin <>
Submitted on : Monday, July 11, 2016 - 12:31:46 PM
Last modification on : Thursday, March 5, 2020 - 12:20:24 PM

File

palindromes.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01344082, version 1

Collections

Citation

Alexandre Papadopoulos, Pierre Roy, Jean-Charles Régin, François Pachet. Generating all Possible Palindromes from Ngram Corpora. IJCAI 2015, Jul 2015, Buenos Aires, Argentina. ⟨hal-01344082⟩

Share

Metrics

Record views

161

Files downloads

119