Cheating to achieve Formal Concept Analysis over a large formal context

Abstract : Researchers are facing one of the main problems of the Information Era. As more articles are made electronically available, it gets harder to follow trends in the different domains of research. Cheap, coherent and fast to construct knowledge models of research domains will be much required when information becomes unmanageable. While Formal Concept Analysis (FCA) has been widely used on several areas to construct knowledge artifacts for this purpose (Ontology development, Information Retrieval, Software Refactoring, Knowledge Discovery), the large amount of documents and terminology used on research domains makes it not a very good option (because of the high computational cost and humanly-unprocessable output). In this article we propose a novel heuristic to create a taxonomy from a large term-document dataset using Latent Semantic Analysis and Formal Concept Analysis. We provide and discuss its implementation on a real dataset from the Software Architecture community obtained from the ISI Web of Knowledge (4400 documents).
Complete list of metadatas

Cited literature [23 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00654576
Contributor : Victor Codocedo <>
Submitted on : Thursday, December 22, 2011 - 12:38:30 PM
Last modification on : Thursday, February 7, 2019 - 3:47:47 PM
Long-term archiving on : Friday, March 23, 2012 - 2:30:11 AM

File

codocedo.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00654576, version 1

Collections

Citation

Victor Codocedo, Carla Taramasco, Hernan Astudillo. Cheating to achieve Formal Concept Analysis over a large formal context. The Eighth International Conference on Concept Lattices and their Applications - CLA 2011, Oct 2011, Nancy, France. pp.349-362. ⟨hal-00654576⟩

Share

Metrics

Record views

503

Files downloads

248