| HAL : halshs-00278312, version 1 |
| Fiche détaillée | Export this paper |
|
|
| JADT, Rome : Italy (1995) |
|
|
|
|
| ON COUNTING MEANINGFUL UNITS IN TEXTS |
|
|
| Maurice Gross 1 |
|
|
| (1995) |
|
|
| We analyse a sample text. By identifying compounds and other sequences of words between which strong dependencies hold, we embed simple words that have no meaning by themselves into larger units that do carry specific meaning. Hence, the counts of simple words, and those of the units marked by our method become quite different. The analysis presented is operational to a large extent. |
|
|
|
|
|
|
|
|
|
|
| 1 : | Laboratoire d'automatique documentaire et linguistique (LADL) |
| CNRS : UMR247 – Université Paris VII - Paris Diderot | |
|
|
|
|
|
|
|
|
| Discipline | : | Humanities and Social Sciences/Linguistics Computer Science/Document and Text Processing |
|
|
| Electronic dictionaries – Electronic Grammars – Parsing – Compound Words – Units of Meaning – Corpus Analysis |
|
|
| Liste des fichiers attachés à ce document : | |||||
|
|
|
| halshs-00278312, version 1 | |
| http://halshs.archives-ouvertes.fr/halshs-00278312 | |
| oai:halshs.archives-ouvertes.fr:halshs-00278312 | |
| Contributeur : Eric Laporte | |
| Submitted on : Sunday, 11 May 2008 15:50:23 | |
| Updated on : Tuesday, 13 May 2008 10:01:58 | |