DP

Hi, I'm Akib Sadmanee — a computer scientist with avid interest in Natural Language Processing, a subfield of artificial intelligence that makes computers understand human languages. I work as a Graduate Research Assistant in the daprtment of Information and Computer Science at the University of Hawai'i at Mānoa under the supervision of Dr. Mahdi Belcaid. My research focus is structured text summarization of topic specific multi-documents. This is an essential area of research within the field of natural language processing. It pertains to the task of generating a concise and coherent summary from a collection of documents that are focused on a particular subject matter.

Multi-document vs Single-Document text summary

Multi-documents text summarization is substantially different from single document summarization. When working with text data from multiple documents, we need to consider some issues unique to multi-document text summarization.

1. Different opinions from different authors

Different research papers on the same topic may propose contradicting ideas. When summarizing the research papers, we can not say A is True. and A is False. without the context.

2. Same opinion in multiple documents

Different research papers may focus on the same topic. When writing a concise summary, we should not overwhelm the summary with similar Information.

3. Coherence between ideas

When multiple documents introduce multiple different ideas, it becomes really hard to maintain a harmonious coherence and logical flow between the ideas.

4. Control over content

Different documents may talk about different aspects of a topic and it is important to have control over what gets included in the summary and what not which is most of the time not the case for single-document summaries.


Thus, multi-documents text summarization can be considered more complicated than single document text summarization. Even though chatGPT is doing an incredible job at doing text summarization when the document is small enough, it still fails to summarize large volume of text from multiple text documents. Nor does it allow the user to have any control over what to include in the summary and what to exclude. This makes the research on multi-documents text summarization still an important problem to solve.