JRC EuroVoc Indexer - JEX
|
| Language |
Version |
Indexing
(basic) |
Indexing and Training
(advanced) |
|---|---|---|---|
bg |
1.0 |
download (18 MB)
|
download (89 MB) |
cs |
1.0 |
download (20 MB)
|
download (75 MB) |
da |
1.0 |
download (29 MB)
|
download (116 MB) |
de |
1.0 |
download (32 MB)
|
download (131 MB) |
el |
1.0 |
download (27 MB)
|
download (156 MB) |
en |
1.0 |
download (15 MB)
|
download (99 MB) |
es |
1.0 |
download (17 MB)
|
download (110 MB) |
et |
1.0 |
download (21 MB)
|
download (72 MB) |
fi |
1.0 |
download (35 MB)
|
download (121 MB) |
fr |
1.0 |
download (24 MB)
|
download (117 MB) |
hu |
1.0 |
download (14 MB)
|
download (72 MB) |
it |
1.0 |
download (25 MB)
|
download (117 MB) |
lt |
1.0 |
download (18 MB)
|
download (117 MB) |
lv |
1.0 |
download (19 MB)
|
download (72 MB) |
mt |
1.0 |
download (16 MB)
|
download (68 MB) |
nl |
1.0 |
download (25 MB)
|
download (117 MB) |
pl |
1.0 |
download (18 MB)
|
download (76 MB) |
pt |
1.0 |
download (24 MB)
|
download (116 MB) |
ro |
1.0 |
download (22 MB)
|
download (119 MB) |
sk |
1.0 |
download (18 MB)
|
download (75 MB) |
sl |
1.0 |
download (18 MB)
|
download (70 MB) |
sv |
1.0 |
download (28 MB)
|
download (115 MB) |
You can find more information on JEX in the documents listed below, depending on your interests and needs.
The user manual gives an easy-to-understand overview of the software and explains how to use it, step by step:
The following document, published in 2012, explains JEX, its history and possible uses. It describes the documents JEX was trained on, gives an overview of the indexing methodology and presents automatic evaluation results for all 22 languages. It also explains how to use JEX:
This third document, mostly targeted at the scientific community, explains the categorisation algorithm in more depth and also describes the results of a manual evaluation of the automatic classification, performed by specialised human EuroVoc indexers, for English and Spanish documents.
You find many more related publications on the publications page of the JRC's Language Technology website.
We would like to thank Bruno Pouliquen, who has developed a major part of the main assignment method, and Mladen Kolar, who has implemented an initial Java version of the tool. We would like to mention the support of Victoria Fernandez-Mera from the Spanish Congress of Deputies and Elisabet Lindkvist from the Swedish Riksdagen, who gave us a lot of advice on practices relating to manual EuroVoc indexing and who helped us to thoroughly evaluate the software. Finally, we are grateful to the Publications Office of the European Commission for having provided their collection of manually EuroVoc-indexed documents. The initial work on JEX was funded as a JRC Exploratory Research Project. The preparation of the first public release of JEX, in May 2012, was partially funded under the JRC’s Innovative Project Competition scheme.
Keywords (English, German, French):
EuroVoc, automatic EuroVoc indexing, multilingual, multilingual classification, multilingual categorization, controlled vocabulary indexing, official European Union languages; Klassifikation von Dokumenten, kontrolliertes Vokabular, Mehrsprachigkeit, automatische Verschlagwortung, sprachübergreifend, Computer-Linguistik, Taxonomie, Ontologie; indexation de documents, linguistique informatique, multilingue, traitement du langage naturel, linguistique, vocabulaire contrôlé, thésaurus.
Please send comments on this page to Ralf Steinberger (Email address format: Firstname.Lastname@jrc.ec.europa.eu)
Last update: 15 May 2012