NewsExplorer
MedISys
NewsBrief
EMM-Labs
Together since 1957 – 50th anniversary


Ralf  Steinberger

European Commission
Joint Research Centre - Ispra site
Institute for the Protection and Security of the Citizen (IPSC)

T.P. 267
21027 Ispra (VA), Italy

Ralf.Steinberger _@_ jrc.ec.europa.eu  (spam protection)

http://langtech.jrc.ec.europa.eu
http://emm.newsbrief.eu/overview.html
http://emm.newsexplorer.eu/
http://medusa.jrc.it/

Tel: +39 - 0332 78 6271 + 5648
Fax: +39 - 0332 78 5154

 

How NOT to spell my name:
Ralph Steinberger, Ralph Steinburger, Ralph Stienberger, Ralph Stienburger,

Ralf Steinburger, Ralf Stienburger, Ralf Steimberger, Ralf Stijnberger, Ralf Steimbergher,

Ralf Steinbergher, Ralf Stainbergher, Ralf Steinberg, Ralf Stienberg, Ralf Stienburg.

 

Professional Profile    Publications  
Professional Experience    Reports  
University Education   Hobbies and Interests
Languages Disclaimer

 

Professional Profile

I am a computational linguist with specialisation in multilingual and cross-lingual applications. My personal aim has always been to apply scientific knowledge to produce applications for real-life environments. Rather than aiming for monolingual performance optimisation, our focus always is on higher language coverage (typically 10 to 20 languages).


As a linguist and scientist, I have worked in the framework of different grammar theories, but I believe that systems using statistical and heuristic information in addition to linguistic knowledge can achieve better results. I started my LT career working with rule-based approaches (machine translation). The JRC requirement of covering many languages while working in a small team led me to statistical and Machine Learning approaches.


From an application point of view, I worked on machine translation, computer-assisted language learning, dictionary conversion, multilingual document generation, information extraction, keyword assignment and classification. Furthermore, I have initiated and supervised work on multilingual sentiment analysis (opinion mining), information extraction (including named entity recognition, event and relation extraction, geo-tagging, quotation extraction), document clustering and classification, document navigation, visualisation of extracted data, language recognition, summarisation, and various social networks, based on different types of input extracted from text.


Professional Experience


1998 - today

Language Technology Project Manager at the European Commission's Joint Research Centre (JRC) in Ispra (I).


Main focus: News analysis, multilingual document retrieval, information extraction and information visualisation, using mainly multilingual thesauri and nomenclatures, as well as statistical and Machine Learning techniques.


Our tool set includes multilingual tools for language recognition; automatic keyword identification; cross-lingual assignment of thesaurus indexing terms; the identification and disambiguation of geographical references in text (geo-tagging); creation of animated geographical maps on the basis of place names mentioned in text; document similarity calculation; clustering; classification; summarisation; terminology extraction; named entity recognition; event recognition, visualisation of the contents of large document collections and document navigation, sentiment analysis, software for document retrieval using a web crawling software agent. (See the JRC's Language Technology page)

1994 - 1997

Senior Research Scientist at Sharp Laboratories of Europe (SLE) in Oxford, UK: responsible for Language Technology products (multilingual document generator and multilingual phrase book); data mining from electronic dictionaries, machine translation, summarisation.

IV - VII 1994

Research Fellow at Kyushu Institute of Technology (KIT) in Iizuka, Japan.

1991 - 1994

Research Associate at the Language Technology Department of the University of Manchester - Institute of Science and Technology (UMIST) in the UK: machine translation and computer-assisted language learning (CALL).


Deputy head and technical manager of the UMIST teams of EC-funded MLAP machine translation projects TRADE and CAT2-EDS.

1991

Visiting Scientist at Institute for Applied Information Science (IAI) in Saarbrücken (D): machine translation.

1986 - 1990

Production management, marketing, public relations, sales promotion in the industrial rubber foam company PANA Schaumstoff GmbH in Geretsried (D)
(part time / full time).

1984 - 1985

Teacher assistant at Lycée Louis-Le-Grand in Paris (F).

1981 - 1982

All-round executive training in the textiles firm PANA Werk KG in Wolfratshausen (D).


University Education


1992 - 1994

Ph.D. in Computational Linguistics at University of Manchester - Institute of Science and Technology (UMIST) in Manchester (UK): A Study of Word Order Variation in German with special Reference to Modifier Placement. (read PDF version)

1983 - 1991

Magister Artium (M.A.) ‘with distinction’ in Theoretical Linguistics with French linguistics and Spanish linguistics at Ludwig Maximilians Universität (LMU) München (D), in parallel to working. Studies in Berlin and Munich.

1980

Abitur with specialisation in French and Mathematics at Gymnasium Pullach (D).


Language Skills


German

Native language

English

7 years at high-school level, lived in the UK for 6 years.

French

7 years at high-school level, lived in France for ~ 24 months; university studies of French linguistics, language, literature and mediaeval studies.

Italian

Courses at university and at the JRC; living in Italy since 1998.

Spanish

4 months of intensive, full-time language courses in Spain. University studies of Spanish linguistics, language, literature and mediaeval studies.

Japanese

Basic notions of the grammar and of the writing systems.


Hobbies and Interests


I am interested in projects to support developing countries. Besides engaging in private funding activities, I am an active member of the charitable EC – JRC organisation Association Europe – Third World (Europa - Terzo Mondo, ETM).


My favourite sports are table tennis (see me play with various partners: 1, 2, 3), volleyball and tennis. I also like to play softball and Frisbee.

 

I love to travel, especially long-distance, or to experience a country through longer work-related stays.

 

I am interested in photography, and there especially in capturing people. A few photos from trips to Senegal (two trips), Cameroon, India, Mali, Ethiopia, Zambia and more are online.

 

I like playing chess. I am interested in cinema, going to the theatre and visiting art galleries, although our children recently did not leave a lot of time for these activities.

 

I actively enjoy meeting new people at social gatherings.

Publications      (Please contact the author for papers not available here) (Look on Google Scholar)

  • Tanev Hristo, Bruno Pouliquen, Vanni Zavarella & Steinberger Ralf (2010). Automatic Expansion of a Social Network Using Sentiment Analysis. In: Annals of Information Systems, Special Issue on Data Mining for Social Network Data.
  • Kabadjov Mijail, Josef STeinberger, Ralf Steinberger, Massimo Poesio & Bruno Pouliquen (2010). Enhancing N-gram-based Summary Evaluation Using Information Content and a Taxonomy. In: Proceedings of the 32nd European Conference on Information Retrival (ECIR'2010). Milton Keynes, UK, 28-31 March 2010.
  • Steinberger Ralf & Bruno Pouliquen (2009). Cross-lingual Named Entity Recognition. In: Satoshi Sekine & Elisabete Ranchhod (eds.): Named Entities - Recognition, Classification and Use, Benjamins Current Topics, Volume 19, pp. 137-164. John Benjamins Publishing Company. ISBN 978-90-272-8922 3. (Order online)
  • Steinberger Ralf, Bruno Pouliquen & Erik van der Goot (2009). An Introduction to the Europe Media Monitor Family of Applications. In: Fredric Gey, Noriko Kando & Jussi Karlgren (eds.): Information Access in a Multilingual World - Proceedings of the SIGIR 2009 Workshop (SIGIR-CLIR'2009), pp. 1-8. Boston, USA. 23 July 2009. (PDF)
  • Pouliquen Bruno & Ralf Steinberger (2009). Automatic Construction of Multilingual Name Dictionaries. In: Cyril Goutte, Nicola Cancedda, Marc Dymetman & George Foster (eds.): Learning Machine Translation. MIT Press - Advances in Neural Information Processing Systems Series (NIPS). (Order online)
  • Koehn Philipp, Alexandra Birch & Ralf Steinberger (2009). 462 Machine Translation Systems for Europe. In: Laurie Gerber, Pierre Isabelle, Roland Kuhn, Nick Bemish, Mike Dillinger & Marie-Josée Goulet (eds.): Proceedings of the Twelfth Machine Translation Summit (MT-Summit XII), pages 65-72. Ottawa, Canada, 26-30 August 2009. (PDF)
  • Tanev Hristo, Vanni Zavarella, Jens Linge, Mijail Kabadjov, Jakub Piskorski, Martin Atkinson & Ralf Steinberger (2009). Exploiting Machine Learning Techniques to Build an Event Extraction System for Portuguese and Spanish. In: linguaMÁTICA Journal:2, pp. 55-66. Available at: http://linguamatica.com/index.php/linguamatica/article/view/37.
  • Balahur-Dobrescu Alexandra & Ralf Steinberger (2009). Rethinking sentiment analysis in the news: from theory to practice and back. 'Workshop on Opinion Mining and Sentiment Analysis' (WOMSA), held at the 2009 CAEPIA-TTIA 13th Conference of the Spanish Association for Artificial Intelligence, pp. 1-12. Sevilla, Spain, 13.11.2009. (PDF)
  • Steinberger Ralf (2009). Preface. In: Tadić Marco, Bojana Dalbelo Bašić, Marie-Francine Moens (eds.): Technologies for the Processing and Retrieval of Semi-Structured Documents - Experience from the CADIAL Project, pp. vii-ix. Croatian Language Technologies Society, Zagreb, Croatia. (Table-of-Contents; Cover)
  • Balahur-Dobrescu Alexandra, Mijail Kabadjov, Josef Steinberger, Ralf Steinberger & Andrés Montoyo (2009). Summarizing Opinions in Blog Threads. Proceedings of the 23rdPACLIC), pp. 606-613, Hong Kong, 3-5 December 2009.
  • Kabadjov Mijail, Josef Steinberger, Bruno Pouliquen, Ralf Steinberger & Massimo Poesio (2009). Multilingual Statistical News Summarisation: Preliminary Experiments with English. Proceedings of 'IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology', pp. 519-522; Workshop 'Intelligent Analysis and Processing of Web News Content' (IAPWNC). Milano, Italy, 15.09.2009. (PDF)
  • Balahur Alexandra, Ralf Steinberger, Erik van der Goot, Bruno Pouliquen & Mijail Kabadjov (2009). Opinion Mining on Newspaper Quotations. Proceedings of 'IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology', pp. 523-526; Workshop 'Intelligent Analysis and Processing of Web News Content' (IAPWNC). Milano, Italy, 15.09.2009. (PDF)
  • Steinberger Ralf (2009). Linking News Content Across Languages. In: Kristiina Jokinen & Eckhard Bick (eds.) NEALT Proceedings Series Vol.4 - Proceedings of the 17th Nordic Conference of Computational Linguistics (NODALIDA'2009), p. 4-5, Odense, Denmark, 14-16 May 2009.
  • Linge Jens, Ralf Steinberger, Thomas Weber, Roman Yangarber, Erik van der Goot, Delilah Al Khudhairy & Nikolaos Stilianakis (2009). Internet Surveillance Systems for Early Alerting of Health Threats. EuroSurveillance Vol. 14, Issue 13. Stockholm, Sweden, 2 April 2009. (PDF)
  • Yangarber Roman, Peter von Etter & Ralf Steinberger (2009). Automatic Epidemiological Surveillance from On-line News in MedISys and PULS. Proceedings of the International Meeting on Emerging Diseases and Surveillance (IMED'2009), Vienna, 13-16 February 2009.
  • Norguet Jean-Pierre, Esteban Zimányi & Ralf Steinberger (2009). Semantic analysis of web site audience by integrating web usage mining and web content mining. In I-Hsien Ting (editor): Web Mining Applications in E-commerce and E-services, Springer Verlag book series Studies in Computational Intelligence, October 2008. (Purchase online)
  • Steinberger Ralf, Pouliquen Bruno & Camelia Ignat (2008). Using language-independent rules to achieve high multilinguality in Text Mining. In: Fogelman-Soulié Françoise, Domenico Perrotta, Jakub Piskorski & Ralf Steinberger (eds.): Mining Massive Data Sets for Security. pp. 217-240. IOS Press, Amsterdam, The Netherlands. (PDF)
  • Steinberger Ralf, Flavio Fuart, Erik van der Goot, Clive Best, Peter von Etter & Roman Yangarber (2008). Text Mining from the Web for Medical Intelligence. In: Fogelman-Soulié Françoise, Domenico Perrotta, Jakub Piskorski & Ralf Steinberger (eds.): Mining Massive Data Sets for Security. pp. 295-310. IOS Press, Amsterdam, The Netherlands. (PDF)
  • Fogelman-Soulié Françoise , Perrotta Domenico, Jakub Piskorski & Ralf Steinberger (eds.) (2008): Mining Massive Data Sets for Security. IOS Press, Amsterdam, The Netherlands. (Purchase online)
  • Best Clive, Jakub Piskorski, Bruno Pouliquen, Ralf Steinberger & Hristo Tanev (2008). Automatic Event Extraction for the Security Domain. In: Intelligence and Security Informatics - Techniques and Applications, Volume 135/2008, pp. 17-43, Studies in Computational Intelligence Series, Springer, Heidelberg/New York. (Purchase online)
  • Pouliquen Bruno & Ralf Steinberger (2008). Story tracking: linking similar news over time and across languages . In Proceedings of the 2nd workshop Multi-source Multilingual Information Extraction and Summarization (MMIES'2008) held at CoLing'2008. Manchester, UK, 23 August 2008. (PDF)
  • Atkinson Martin, Jakub Piskorski, Bruno Pouliquen, Ralf Steinberger, Hristo Tanev & Vani Zavarella (2008). Online-monitoring of security-related events. In Proceedings of the 22nd International Conference on Computational Linguistics (CoLing'2008). Manchester, UK, 18-22 August 2008. (PDF)
  • Steinberger Ralf, Flavio Fuart, Bruno Pouliquen & Erik van der Goot (2008). MedISys: A Multilingual Media Monitoring Tool for Medical Intelligence and Early Warning. In: Proceedings of the International Disaster and Risk Conference (IDRC'2008), pp. 612-614, Davos, Switzerland. (PDF)
  • Yangarber Roman, Peter von Etter & Ralf Steinberger (2008). Content Collection and Analysis in the Domain of Epidemiology. In Proceedings of the 1st international MIE'2008 workshop on describing medical web resources (DRMed), held at the 21st International Congress of the European Federation for Medical Informatics. Göteborg, Sweden, 27 May 2008. (PDF)
  • Steinberger Ralf & Bruno Pouliquen (2007). Cross-lingual Named Entity Recognition. In: Satoshi Sekine & Elisabete Ranchhod (eds.) Journal Linguisticae Investigationes, Special Issue on Named Entity Recognition and Categorisation, LI 30:1, pp. 135-162. John Benjamins Publishing Company. ISSN 0378-4169. (Purchase online)
  • Pouliquen Bruno, Ralf Steinberger, Clive Best (2007). Automatic detection of quotations in multilingual news. Proceedings of the International Conference Recent Advances in Natural Language Processing (RANLP'2007), pp. 487-492. Borovets, Bulgaria, 27-29 September 2007. (PDF)
  • Pouliquen Bruno, Ralf Steinberger, Jenya Belyaeva (2007). Multilingual multi-document continuously updated social networks. Proceedings of the Workshop Multi-source Multilingual Information Extraction and Summarization (MMIES'2007) held at RANLP'2007, pp. 25-32. Borovets, Bulgaria, 26 September 2007. (PDF)
  • Yangarber Roman, Clive Best, Peter von Etter, Flavio Fuart, David Horby & Ralf Steinberger (2007). Combining Information about Epidemic Threats from Multiple Sources. Proceedings of the Workshop Multi-source Multilingual Information Extraction and Summarization (MMIES'2007) held at RANLP'2007, pp. 41-48. Borovets, Bulgaria, 26 September 2007.
  • Pouliquen Bruno & Ralf Steinberger (2007). Acquisition and Use of Multilingual Name Dictionaries. Proceedings of the Workshop Acquisition and Management of Multilingual Lexicons (AMML'2007) held at RANLP'2007. Borovets, Bulgaria, 26 September 2007.
  • Piskorski Jakub, Hristo Tanev, Bruno Pouliquen & Ralf Steinberger (eds.) (2007). Proceedings of the Workshop on Balto-Slavonic Natural Language Processing 2007 (BSNLP'2007) - Special Theme: Information Extraction and Enabling Technologies. Held at the 45th Annual Meeting of the Association for Computational Linguistics (ACL'2007). Prague, Czech Republic, 29 June 2007. (PDF of the Preface) (Full BSNLP Proceedings)
  • Steinberger Ralf,  Bruno Pouliquen, Anna Widiger, Camelia Ignat, Tomaž Erjavec, Dan Tufiş, Dániel Varga (2006). The JRC-Acquis: A multilingual aligned parallel corpus with 20+ languages. Proceedings of the 5thInternational Conference on Language Resources and Evaluation (LREC'2006), pp. 2142-2147. Genoa, Italy, 24-26 May 2006.
  • Pouliquen Bruno, Marco Kimler, Ralf Steinberger,  Camelia Ignat, Tamara Oellinger, Ken Blackler, Flavio Fuart, Wajdi Zaghouani, Anna Widiger, Ann-Charlotte Forslund, Clive Best (2006). Geocoding multilingual texts: Recognition, Disambiguation and Visualisation. Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC'2006), pp. 53-58. Genoa, Italy, 24-26 May 2006.
  • Norguet Jean-Pierre, Esteban Zimányi & Ralf Steinberger (2006). Semantic analysis of web site audience. 21st Annual ACM Symposium on Applied Computing (ACM SAC'2006), Dijon, France, 23-27.04.2006. Pages 525-529.
  • Žižka Jan, Jiří Hroza, Bruno Pouliquen, Camelia Ignat & Ralf Steinberger (2006). The selection of electronic text documents supported by only positive examples. Proceedings of the 8th International Conference on the Statistical Analysis of Textual Data (JADT'2006). Besançon, 19-21 April 2006.
  • Pouliquen Bruno, Ralf Steinberger, Camelia Ignat & Tamara Oellinger (2006). Building and displaying name relations using automatic unsupervised analysis of newspaper articles. Proceedings of the 8th International Conference on the Statistical Analysis of Textual Data (JADT'2006). Besançon, 19-21 April 2006.
  • Best Clive, Bruno Pouliquen, Ralf Steinberger, Eric van der Goot, Ken Blackler, Flavio Fuart, Tamara Oellinger & Camelia Ignat (2006). Towards automatic event tracking. In: Sharad Mehrota, Daniel Zeng, Hsinchun Chen, Bhavani Thuraisingham & Fei-Yue Wang (Eds.): Intelligence and Security Informatics - Proceedings of IEEE International Conference on Intelligence and Security Informatics (ISI'2006), San Diego, California, USA, 23-24.05.2006. Lecture Notes in Computer Science, LNCS 3975, pp. 26-34. Springer-Verlag, Berlin Heidelberg, New York. ISBN: 978-3-540-34478-0.
  • Norguet Jean-Pierre, Esteban Zimányi & Ralf Steinberger (2006). Improving web sites with web usage mining, web content mining, and semantic analysis. In: Jirí Wiedermann, Gerard Tel, Jaroslav Pokorný, Mária Bieliková, Július Štuller (Eds.): SOFSEM 2006: Theory and Practice of Computer Science. 32nd Conference on Current Trends in Theory and Practice of Computer Science, Merin, Czech Republic, 21.-27.01.2006. Lecture Notes in Computer Science, LNCS 3831, pages 430-439. ISBN: 978-3-540-31198-0. Springer-Verlag, Berlin, Heidelberg, New York.
  • Steinberger Ralf, Bruno Pouliquen, Camelia Ignat (2005). Navigating multilingual news collections using automatically extracted information. Journal of Computing and Information Technology - CIT 13, 2005, 4, 257-264. Available online at: http://cit.zesoi.fer.hr/downloadPaper.php?paper=767. ISSN: 1330-1136.
  • Pouliquen Bruno, Ralf Steinberger, Camelia Ignat, Irina Temnikova, Anna Widiger, Wajdi Zaghouani & Jan Žižka (2005). Multilingual person name recognition and transliteration. Journal CORELA - Cognition, Représentation, Langage. Numéros spéciaux, Le traitement lexicographique des noms propres. Available online at: http://edel.univ-poitiers.fr/corela/document.php?id=490. ISSN 1638-5748.
  • Erjavec Tomaž, Camelia Ignat, Bruno Pouliquen & Ralf Steinberger (2005). Massive multilingual corpus compilation: Acquis Communautaire and totale. Journal Archives of Control Sciences, Volume 15(LI), 2005, No. 4, pages 529-540.
  • Steinberger Ralf, Bruno Pouliquen, Camelia Ignat (2005). Navigating multilingual news collections using automatically extracted information. In: Vesna Lužar-Stiffler & Vesna Hljuz Dobrić (Eds.): Proceedings of the 27th International Conference 'Information Technology Interfaces' (ITI'2005), pp. 27-34. Cavtat / Dubrovnik, Croatia, June 20-23, 2005.
  • Montejo-Ráez Arturo, L. Alfonso Ureña-López & Ralf Steinberger (2005). Text categorisation using bibliographic records: beyond document content. Procesamiento del Lenguaje Natural, núm. 35 (2005), pp. 119-126. Proceedings of the 21st Conference of the Spanish Society for Natural Language Processing (SEPLN'2005). Granada, Spain, 14-16 September 2005.
  • Ignat Camelia, Bruno Pouliquen, Ralf Steinberger & Tomaž Erjavec (2005). A tool set for the quick and efficient exploration of large document collections. Proceedings of the Symposium on Safeguards and Nuclear Material Management. 27th Annual Meeting of the European SAfeguards Research and Development Association (ESARDA-2005). London, UK, 10-12 June 2005.
  • Tomaž Erjavec, Camelia Ignat, Bruno Pouliquen & Ralf Steinberger (2005). Massive multilingual corpus compilation; Acquis Communautaire and totale. In: 2nd Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics (L&T'05). Poznań, Poland, 21-23 April 2005.
  • Pouliquen Bruno, Ralf Steinberger, Camelia Ignat, Irina Temnikova, Wajdi Zaghouani & Jan Žižka (2005). Detection of person names and their translations in multilingual news. Colloque Traitement lexicographique des noms propres, Tours, 24 March 2005.
  • Best Clive, Erik van der Goot, Ken Blackler, Teófilo Garcia, David Horby, Ralf Steinberger and Bruno Pouliquen (2005). Mapping World Events. In: Peter van Oosterom, Siyka Zlatanova & Elfriede M. Fendel (eds.) Geo-information for Disaster Management. pp. 683-696. Springer. ISBN: 3-540-24988-5.
  • Pouliquen Bruno, Ralf Steinberger & Camelia Ignat (2004). Automatic Linking of Similar Texts Across Languages. In:  N. Nicolov, K. Bontcheva, G. Angelova & R. Mitkov (eds.): Current Issues in Linguistic Theory 260 - Recent Advances in Natural Language Processing III. Selected Papers from RANLP'2003. John Benjamins Publishers, Amsterdam.
  • Steinberger Ralf, Pouliquen Bruno & Camelia Ignat (2004). Providing cross-lingual information access with knowledge-poor methods. In: Informatica. An international Journal of Computing and Informatics. Volume 28. Special Issue.
  • Montejo-Ráez Arturo & Ralf Steinberger (2004). Why keywording matters. In. High Energy Physics Libraries Webzine, Issue 10, December 2004. Available at http://library.cern.ch/HEPLW/10/papers/2/. (PDF)
  • Ralf Steinberger, Pouliquen Bruno & Camelia Ignat (2004). Exploiting Multilingual Nomenclatures and Language-Independent Text Features as an Interlingua for Cross-lingual Text Analysis Applications. In: Proceedings of the 4th Slovenian Language Technology Conference. Information Society 2004 (IS'2004). Ljubljana, Slovenia, 13-14 October 2004. (PDF)
  • Montejo-Ráez Arturo, Luís Alfonso Ureña-López, Ralf Steinberger (2004). Adaptive selection of base classifiers in one-against-all learning for large multi-labeled collections. In: J.L. Vicedo, P. Martínez-Barco, R. Muñoz et al. (eds). Advances in Natural Language Processing: 4th International Conference, España for Natural Language Processing (EsTAL'2004), Proceedings, Alicante, Spain, 20-22 October 2004. Springer Lecture Notes in Computer Science, LNCS 3230, pages 1-12. Springer-Verlag, Berlin Heidelberg. ISBN: 3-540-23498-5. (PDF)
  • Pouliquen Bruno, Ralf Steinberger, Camelia Ignat, Emilia Käsper & Irina Temnikova (2004). Multilingual and Cross-lingual News Topic Tracking. In: Proceedings of the 20th International Conference on Computational Linguistics (CoLing'2004). Geneva, Switzerland, 23-27 August 2004. (PDF)
  • Pouliquen Bruno, Ralf Steinberger, Camelia Ignat & Tom de Groeve (2004). Geographical Information Recognition and Visualisation in Texts Written in Various Languages. In: Proceedings of the 19th Annual ACM Symposium on Applied Computing (SAC'2004), Special Track on Information Access and Retrieval (SAC-IAR), vol. 2, pp. 1051-1058. Nicosia, Cyprus, 14 - 17 March 2004.
  • Pouliquen Bruno, Ralf Steinberger & Camelia Ignat (2003). Automatic Identification of Document Translations in Large Multilingual Document Collections. In: Proceedings of the International Conference Recent Advances in Natural Language Processing (RANLP'2003), pp. 401-408. Borovets, Bulgaria, 10 - 12 September 2003. (PDF)
  • Ignat Camelia, Bruno Pouliquen, António Ribeiro & Ralf Steinberger (2003). Extending an Information Extraction Tool Set to Central and Eastern European Languages. In: Proceedings of the International Workshop Information Extraction for Slavonic and other Central and Eastern European Languages (IESL'2003), held at RANLP'2003, pp. 33-39. Borovets, Bulgaria, 8 - 9 September 2003. (PDF)
  • Pouliquen Bruno, Steinberger Ralf, Camelia Ignat (2003). Automatic Annotation of Multilingual Text Collections with a Conceptual Thesaurus. In: Proceedings of the Workshop Ontologies and Information Extraction at the Summer School The Semantic Web and Language Technology - Its Potential and Practicalities (EUROLAN'2003). Bucharest, Romania, 28 July - 8 August 2003 (PDF).
  • Steinberger Ralf, Bruno Pouliquen, Stefan Scheer & António Ribeiro (2003). Continuous Multi-Source Information Gathering and Classification. In: Proceedings of the International Conference on Computational Intelligence for Modelling, Control and Automation (CIMCA'2003). Vienna (A), 12-14 February 2003 (PDF).
  • Steinberger Ralf, Bruno Pouliquen & Johan Hagman (2002). Cross-lingual Document Similarity Calculation Using the Multilingual Thesaurus Eurovoc. In: A. Gelbukh (ed.) Computational Linguistics and Intelligent Text Processing, Third International Conference, CICLing'2002. Springer Lecture Notes in Computer Science, LNCS 2276, pp. 415-424. Mexico-City, Mexico, 17-23 February 2002. Springer-Verlag, Berlin Heidelberg. ISBN: 3-540-43219-1. (PDF).
  • Steinberger Ralf  (2001). Cross-lingual Keyword Assignment. Proceedings of the XVII Congress of the Spanish Society for Natural Language Processing (SEPLN'2001). Procesamiento del Lenguaje Natural, Revista No 27, pp. 273-280. Jaén, Spain, September 2001. ISSN 1135-5948. (PDF).
  • Steinberger Ralf, Stefan Scheer & Johan Hagman (2001). Language Engineering. ISIS Annual Report 2000, pages 47-48. Office for Official Publications of the European Communities, Luxembourg, 2001. ISBN 92-894-0602-X.
  • Steinberger Ralf, Johan Hagman & Stefan Scheer (2000). Using Thesauri for Information Extraction and for the Visualisation of Multilingual Document Collections. Proceedings of the Workshop on Ontologies and Lexical Knowledge Bases (OntoLex’2000), pp. 130-141. Sozopol, Bulgaria, September 2000. (PDF)
  • Hagman Johan, Domenico Perrotta, Ralf Steinberger & Aristide Varfis (2000). Document Classification and Visualisation to Support the Investigation of Suspected Fraud. Working Notes of the Workshop on Machine Learning and Textual Information Access (MLTIA) at the Fourth European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD’2000), 12 pages. Lyon, September 2000. (PDF, Cover)
  • Garg Anjula, Thomas Barbas & Ralf Steinberger (2000). Information Management and Computer Communications for Anti-Fraud. ISIS Annual Report 1999, pages 79-80. Office for Official Publications of the European Communities, Luxembourg, 2000. ISBN 92-828-9029-5.
  • Barbas Thomas & Ralf Steinberger (1999). New Information Infrastructures. ISIS Annual Report 1998, pages 25-26. Office for Official Publications of the European Communities, Luxembourg, 1999. ISBN 92-828-6645-9.
  • Hagman Johan, Ralf Steinberger, Domenico Perrotta & Aristide Varfis (1999). Approaches to document classification and visualisation. Working Notes of the Workshop on Text Mining at the Sixth International Joint Conference on Artificial Intelligence (IJCAI'99), pages 36-37. Stockholm, August 1999. JRC reference number: ORA 60278. (PDF)
  • Sanfilippo Antonio & Ralf Steinberger (1997): Automatic Selection and Ranking of Translation Candidates. Proceedings of the 7th Conference on Theoretical and Methodological Issues in Machine Translation: "MT Yesterday, Today, and Tomorrow" (TMI'97), Santa Fe, New Mexico, USA. (PDF)
  • Steinberger Ralf (1994): Treating `Free Word Order' in Machine Translation. In: Proceedings of the 15th International Conference for Computational Linguistics (COLING 1994), Vol. I, pp. 69-75, Kyoto, Japan. (PDF)
  • Steinberger Ralf (1994): Lexikoneinträge für deutsche Adverbien (Dictionary Entries for German Adverbs). In: Harald Trost (Hg.): Informatik Xpress 6: Tagungsband KONVENS '94 Verarbeitung natürlicher Sprache (2. Konferenz zur Verarbeitung natürlicher Sprache), pages 320-329, Vienna. (PDF)
  • Steinberger Ralf & Paul Bennett (1994): Automatic Recognition of Theme, Focus and Contrastive Stress. In: Peter Bosch & Rob van der Sandt (eds.): Focus and Natural Language Processing, Proceedings of a conference in celebration of the 10th anniversary of the Journal of Semantics, Working Paper 6 of the IBM Institute for Logic and Linguistics, Vol. I, pages 205-214, Meinhard-Schwebda (Germany). (PDF)
  • Steinberger Ralf () (1994): ヨーロッパの現在のMT活動 (Current MT Activities in Europe). In: AAMT Journal - The Asia-Pacific Association for Machine Translation, No. 7, June 1994, pages 10-14, Tokyo (An English version appeared in the English edition of the AAMT Journal). (PDF)
  • Steinberger Ralf (1994): A study of German word order in German, with special reference to modifier placement. Ph.D. Thesis, Umist, Manchester, UK. (PDF)
  • Steinberger Ralf (1993): Grenzen und Möglichkeiten der Maschinellen Übersetzung (Machine Translation: Prospects and Limitations). In: Informatik Forum - Fachzeitschrift für Informatik, Band 7, Doppelheft 1/2, 6/93, Vienna, Austria.
  • Steinberger Ralf (1992): Beschreibung der Adverbstellung im deutschen und englischen Satz im Hinblick auf Maschinelle Übersetzung (Adverb placement in German and English with special reference to Machine Translation). EUROTRA-D Working Paper 23, Saarbrücken (IAI), 2/92 (47 pages) (PDF)
  • Steinberger Ralf (1992): Der Skopus von Gradpartikeln: Seine Übersetzung und seine Implementierung im Maschinellen Übersetzungssystem CAT2 (Scope of degree modifiers: Translation and implementation in the CAT2 MT formalism). EUROTRA-D Working Paper 24, Saarbrücken (IAI), 4/92 (35 pages). (PDF)

Reports (Restricted Distribution)     Please contact the author for a copy

  • Best Clive, Ralf Steinberger & Stamatia Halkia (2007). Web Mining and Intelligence (EMM) - Support to External Security Unit. Activity Report 2005/2006. European Communities 2007. 17 pages. ISBN 92-79-03400-6.
  • Ribeiro António & Ralf Steinberger (2004). IDoRA for OLAF - Final project report. JRC Technical Note, 23 pages. March 2004.
  • Steinberger Ralf & Bruno Pouliquen (2003). Cross-lingual Indexing. Final Report for the IPSC Exploratory Research Project. JRC Internal Note, 30 pages. October 2003. (PDF)
  • Pedersen Jane & Ralf Steinberger (2002).Evaluation of Multilingual Name Recognition Software - Thing Finder (TM) 2.2. JRC Technical Note No. I.02.120, 29 pages. December 2002 (PDF).
  • Scheer Stefan, Ralf Steinberger & Giovanni Valerio (2000): A Methodology to Retrieve, to Manage, to Classify and to Query Open Source Information - Results of the OSILIA Project. JRC Technical Note No. I.01.016. 35 pages. (PDF)
  • Steinberger Ralf (2000): Evaluation of DMP's Linguistic Software - Comments on the linguistic software distributed by Document Management Partners (DMP) in Antwerp (B). Report for OLAF. 16 pages.
  • Steinberger Ralf, Johan Hagman & Thomas Barbas (2000): Modus Operandi Final Project Report – Summary and Conclusions. JRC Technical Note No. I00.88. 17 pages.
  • Steinberger Ralf (2000): Software Solutions to Overcome the Language Barrier. JRC Technical Note No. I.00.91. 10 pages.
  • Steinberger Ralf & Johan Hagman (2000): Commercial Keyword Identification and Clustering Software. JRC Technical Note No. I.00.90. 19 pages.
  • Steinberger Ralf (2000): The Free Text Field of the IRENE Database. JRC Technical Note No. I.00.89. 28 pages.
  • Steinberger Ralf (2000): Fraud-Related Multi-Word Expressions - English, French and German. Modus Operandi deliverable 7. 50 pages.
  • Hagman Johan & Ralf Steinberger (1999): Clustering of 1500 IRENE Record Text Files. Modus Operandi deliverable 15. 50 pages.
  • Steinberger Ralf (1999). Language Engineering Technologies and their use for TF-UCLAF. JRC Technical Note No. I.99.83. 28 pages.
  • Steinberger Ralf (1997): Multilingual Phrase Book Design Study - Issues regarding the extension of the Japanese-English Phrase Book to German, French, Spanish and Italian. Sharp internal document. 15 pages.
  • Johnson Ian, Osamu Nishida, Junzo Ogawa, Ralf Steinberger (1997): Multilingual Phrase Book Data Format (v. 3) - Representation of the Multilingual Phrase Book data, Sharp internal document. 23 pages.
  • Steinberger Ralf (1997): Multilingual Phrase Book Instructions (v. 2) - Task description for translators, Sharp internal document.
  • Steinberger Ralf (1997): Sharp Abridgement Machine (SAM), Sharp internal document. 9 pages.
  • Steinberger Ralf (1997): Multilingual Document Generator - Instructions, Sharp internal document. 16 pages.
  • Steinberger Ralf (1996): Conversion of machine-readable dictionaries to electronic dictionaries. Sharp internal document. 20 pages.
  • Steinberger Ralf (1995): Evaluation of the Sharp Intelligent Dictionary (SID). Sharp internal document. 11 pages.
  • Steinberger Ralf, Chris Chambers, Ingrid Weber & Blaise Nkwenti-Azeh (1994): English Coverage Definition. Internal report Nr. 6 for the MLAP project TRADE on the linguistic phenomena occurring in a legal social security text, 34 pages, Barcelona
  • Steinberger Ralf & Chris Chambers (1994): English Test Suite. Internal report Nr. 7 for the MLAP project TRADE including a suite of sentences for the testing of the TRAnslation DEmonstrator, 15 pages, Barcelona
  • Mazzini Gianpaolo, Maite Melero & Ralf Steinberger (1994): Corpus Study and Coverage Definition. Internal report for the MLAP project TRADE, Barcelona
  • Steinberger Ralf (1994): The Legal Sublanguage in the English Version of the `United Nations Convention on Contracts for the International Sale of Goods'. Report on work carried out at the Kyushu Institute of Technology for the project 法律エキスパートシステム (Legal Expert System), 50 pages, Iizuka, Japan
  • Steinberger Ralf (1993): Cost, Calculation & Financing: Description of the possible Cost Factors. In: S. Krauwer (ed.), A. Bech, B. Maegaard, M. Mendes, R. Steinberger & N. Underwood: How to produce an application - the long way from a brilliant idea to a commercial product, CCL Report 93/1, pages 19-37, Manchester (also appeared as EUROTRA internal paper, Luxembourg).
  • Steinberger Ralf (1993): Corpus Annotation and Use of Corpora. Internal Report for the CALL project of the Teaching and Learning Technology Programme, 4/93, 7 pages, Manchester
  • Steinberger Ralf & Cécile Potier (1992): How to deal with `empty' subjects in sentential verb complements, UMIST - CCL Report 92/14, Manchester (62 pages, also appeared as the final report of the EUROTRA Contrastive Research Cluster Sentential Complementation, Luxembourg).
  • Steinberger Ralf (1992): Empty subjects in sentential verb complements (French-English). Linguistics. EUROTRA intermediate report, 5/92 (30 pages), Luxembourg
  • Steinberger Ralf (1992): Empty subjects in sentential verb complements (French-English). Implementation. EUROTRA intermediate report, 9/92 (11 pages), Luxembourg
  • Steinberger Ralf (1992): Report of the Implementation of the French-English Transfer Module. EUROTRA final report, 12/92 (7 pages), Luxembourg


Site Meter

Please send comments on this page to Ralf Steinberger (Email address format: Firstname.Lastname@jrc.ec.europa.eu)

Last update:  18 January 2010