Are research landscapes from submitted project proposals or from the S&T literature similar ? A comparison using text mining and clustering
Résumé
This work aims at studying and comparing, within the scientific field of Information and Communication Technologies (ICT), two types of scientific production, both asking for consequent research efforts. The first considered data set is a corpus of records extracted from a bibliographic database and representing the results of research works published in the scientific and technological literature. The second one is a corpus of records extracted from a database collecting the information related to the proposals answering the calls for projects launched under the aegis of the European Commission in relation to the Seventh Framework Programme (FP7). After the application of a text mining approach operated with tools coming from the NLP (natural language processing) domain, a clustering step supplies a representation of each corpus by producing a 2 thematic map of clusters. Then, with the help of an expert, a content analysis is produced allowing comparing the map and the content of the clusters obtained for each of the two corpora under two criteria: the distribution of the developed works by topic and their potential applicability. This work intends to answer the question: Are the works published by the community of ICT researchers in scientific and technical literature and those developed in projects submitted for funding equivalent in terms of their potential applicability?
Origine : Fichiers produits par l'(les) auteur(s)
Loading...