IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Combining Machine Learning and Natural Language Processing for Language-Specific, Multi-Lingual, and Cross-Lingual Text Summarization: A Wide-Ranging Overview

Combining Machine Learning and Natural Language Processing for Language-Specific, Multi-Lingual, and Cross-Lingual Text Summarization: A Wide-Ranging Overview
View Sample PDF
Author(s): Luca Cagliero (Politecnico di Torino, Italy), Paolo Garza (Politecnico di Torino, Italy) and Moreno La Quatra (Politecnico di Torino, Italy)
Copyright: 2020
Pages: 31
Source title: Trends and Applications of Text Summarization Techniques
Source Author(s)/Editor(s): Alessandro Fiori (Candiolo Cancer Institute – FPO, IRCCS, Italy)
DOI: 10.4018/978-1-5225-9373-7.ch001

Purchase


Abstract

The recent advances in multimedia and web-based applications have eased the accessibility to large collections of textual documents. To automate the process of document analysis, the research community has put relevant efforts into extracting short summaries of the document content. However, most of the early proposed summarization methods were tailored to English-written textual corpora or to collections of documents all written in the same language. More recently, the joint efforts of the machine learning and the natural language processing communities have produced more portable and flexible solutions, which can be applied to documents written in different languages. This chapter first overviews the most relevant language-specific summarization algorithms. Then, it presents the most recent advances in multi- and cross-lingual text summarization. The chapter classifies the presented methodology, highlights the main pros and cons, and discusses the perspectives of the extension of the current research towards cross-lingual summarization systems.

Related Content

Luca Cagliero, Paolo Garza, Moreno La Quatra. © 2020. 31 pages.
Amal M. Al-Numai, Aqil M. Azmi. © 2020. 29 pages.
Junsheng Zhang, Wen Zeng. © 2020. 27 pages.
Mohamed Atef Mosa. © 2020. 37 pages.
Sandhya P., Mahek Laxmikant Kantesaria. © 2020. 25 pages.
Xin Zhao, Zhe Jiang, Jeff Gray. © 2020. 36 pages.
Jochen L. Leidner. © 2020. 29 pages.
Body Bottom