ISSN 2079-3537      

 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                                                                                             

Scientific Visualization, 2019, volume 11, number 4, pages 13 - 26, DOI: 10.26583/sv.11.4.02

Approaches to visualizing big text data at the stage of collection and pre-processing

Authors: E. A.  Makarova1, D. G.  Lagerev2, F.Y.  Lozbinev3

Bryansk State Technical University

1 ORCID: 0000-0002-5410-5890 , m4karova.e@yandex.ru

2 ORCID: 0000-0002-2702-6492 , LagerevDG@mail.ru

3 ORCID: 0000-0002-8745-6910

 

Abstract

This paper describes the text data analysis in the course of management decision making. We examine in detail the process of collection of text data for further analysis and the use of imaging to increase the efficiency of human resources during collection and data pre-processing. A modification of the algorithm for creating an “n-gram cloud” visualization is proposed, which makes visualization accessible to people with visual impairments. Also, a method of visualization of n-gram vector representation models (word embedding) is proposed. On the basis of the conducted research, a part of a software package was implemented, which is responsible for creating interactive visualizations in a browser and interoperating with them.

 

Keywords: visualization, natural language processing, web application accessibility.