Skip to Main content Skip to Navigation
Conference papers

Building Document Treatment Chains Using Reinforcement Learning and Intuitive Feedback

Esther Nicart 1 Bruno Zanuttini 1 Hugo Gilbert 2 Bruno Grilhères 3 Fredéric Praca 4
1 Equipe MAD - Laboratoire GREYC - UMR6072
GREYC - Groupe de Recherche en Informatique, Image et Instrumentation de Caen
2 DECISION
LIP6 - Laboratoire d'Informatique de Paris 6
Abstract : We model a document treatment chain as a Markov Decision Process, and use reinforcement learning to allow the agent to learn to construct and continuously improve custom-made chains " on the fly ". We build a platform which enables us to measure the impact on the learning of various models, web services, algorithms, parameters, etc. We apply this in an industrial setting, specifically to an open source document treatment chain which extracts events from massive volumes of web pages and other open-source documents. Our emphasis is on minimising the burden of the human analysts, from whom the agent learns to improve guided by their feedback on the events extracted. For this, we investigate different types of feedback, from numerical feedback, which requires a lot of tuning, to partially and even fully qualitative feedback, which is much more intuitive, and demands little to no user calibration. We carry out experiments, first with numerical feedback, then demonstrate that intuitive feedback still allows the agent to learn effectively.
Document type :
Conference papers
Complete list of metadata

Cited literature [25 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01534282
Contributor : Bruno Zanuttini Connect in order to contact the contributor
Submitted on : Wednesday, June 7, 2017 - 2:53:37 PM
Last modification on : Friday, October 8, 2021 - 7:12:12 PM
Long-term archiving on: : Friday, September 8, 2017 - 12:41:17 PM

File

ICTAI16_v0-1.pdf
Files produced by the author(s)

Identifiers

Citation

Esther Nicart, Bruno Zanuttini, Hugo Gilbert, Bruno Grilhères, Fredéric Praca. Building Document Treatment Chains Using Reinforcement Learning and Intuitive Feedback. 28th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2016), Nov 2016, San Jose, United States. pp.635 - 639, ⟨10.1109/ICTAI.2016.0102⟩. ⟨hal-01534282⟩

Share

Metrics

Record views

305

Files downloads

262