Code2vect: An efficient heterogenous data classifier and nonlinear regression technique - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Comptes Rendus Mécanique Année : 2019

Code2vect: An efficient heterogenous data classifier and nonlinear regression technique

Résumé

The aim of this paper is to present a new classification and regression algorithm based on Artificial Intelligence. The main feature of this algorithm, which will be called Code2Vect, is the nature of the data to treat: qualitative or quantitative and continuous or discrete. Contrary to other artificial intelligence techniques based on the “Big-Data,” this new approach will enable working with a reduced amount of data, within the so-called “Smart Data” paradigm. Moreover, the main purpose of this algorithm is to enable the representation of high-dimensional data and more specifically grouping and visualizing this data according to a given target. For that purpose, the data will be projected into a vectorial space equipped with an appropriate metric, able to group data according to their affinity (with respect to a given output of interest). Furthermore, another application of this algorithm lies on its prediction capability. As it occurs with most common data-mining techniques such as regression trees, by giving an input the output will be inferred, in this case considering the nature of the data formerly described. In order to illustrate its potentialities, two different applications will be addressed, one concerning the representation of high-dimensional and categorical data and another featuring the prediction capabilities of the algorithm.
Fichier principal
Vignette du fichier
1-s2.0-S1631072119301731-main.pdf (1.2 Mo) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-02390399 , version 1 (04-12-2019)

Licence

Paternité - Pas d'utilisation commerciale - Pas de modification

Identifiants

Citer

Clara Argerich Martin, Rubén Ibáñez Pinillo, Anaïs Barasinski, Francisco Chinesta. Code2vect: An efficient heterogenous data classifier and nonlinear regression technique. Comptes Rendus Mécanique, 2019, 347 (11), pp.754-761. ⟨10.1016/j.crme.2019.11.002⟩. ⟨hal-02390399⟩
94 Consultations
56 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More