A Workflow For On The Fly Normalisation Of 17th c. French

Abstract : If NMT has proven to be the most efficient solution for normalising pre-orthographic texts, the amount of training data required remains an obstacle. In this paper, we address for the first time the case of normalising modern French and we propose a workflow to create the parallel corpus that an NMT solution requires.
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-02276150
Contributor : Simon Gabay <>
Submitted on : Monday, September 2, 2019 - 12:36:10 PM
Last modification on : Thursday, September 5, 2019 - 1:25:59 AM

File

DH2019_final.pdf
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

  • HAL Id : hal-02276150, version 1

Citation

Simon Gabay, Marine Riguet, Loïc Barrault. A Workflow For On The Fly Normalisation Of 17th c. French. DH2019, ADHO, Jul 2019, Utrecht, Netherlands. ⟨hal-02276150⟩

Share

Metrics

Record views

10

Files downloads

17