NLP Based Text Analytics and Visualization of Political Speeches

Published in Elsevier Scopus - International Journal of Recent Technology and Engineering (IJRTE), 2019

Received the BEST PAPER AWARD at NCRAC-2019, Coimbatore

View paper here

Abstract

This paper presents a case study of implementing computational methods like Natural Language Processing (NLP) to perform Text Analytics and Visualization on political speech transcripts. The speech transcripts are published on websites, social media, and documents in large volumes and multiple languages. These transcripts are available in unstructured textual format and thus they are a part of big-data requiring analytics to derive insights from it. In this experiment, a significantly large volume of speech transcripts are analyzed and graphical visualizations are generated such as Lexical Dispersion Plot, Time Series Plot, WordCloud, Bar-Graphs using various Python libraries. The study has been useful in identifying issues highlighted across a large number of speech transcripts. So far, the linguists have tried to perform analysis using manual linguistic approaches which are extremely time-consuming and complex to understand the Political Discourse. Our experiment of applying NLP based text analytics proves to be a very efficient technique for Political Discourse Analysis (PDA).