Lydia, E. Laxmi and Kannan, S. and SumanRajest, S. and Satyanarayana, S. (2021) Correlative study and analysis for hidden patterns in text analytics unstructured data using supervised and unsupervised learning techniques. International Journal of Cloud Computing, 9 (2/3). p. 150. ISSN 2043-9989
![[thumbnail of 2132.pdf]](https://ir.vistas.ac.in/style/images/fileicons/text.png)
2132.pdf
Download (305kB)
Abstract
Abstract: Two-third of the data generated by the internet is unstructured text in
the form of e-mails, audio, video, pdf files, word documents, text documents.
Extraction of these unstructured text patterns using mining techniques achieve
quick access to outcomes. Textual data available at online contains different
patterns and when those huge incoming unstructured data enters into the system
creates a problem while organising those documents into meaningful groups.
This paper discusses document classification using supervised learning by
focusing on the concept-based algorithm and also deals with the hiddenpatterns in the documents using unsupervised clustering technique andtopic-based modelling for the analysis and improvement of systematicarrangement of documents by applying k-means and LDA algorithm. Finally,this presents comparative study and importance of clustering than classification
for unstructured documents.
Item Type: | Article |
---|---|
Subjects: | Computer Science > Design and Analysis of Algorithm |
Divisions: | Computer Science Engineering |
Depositing User: | Mr IR Admin |
Date Deposited: | 10 Sep 2024 10:42 |
Last Modified: | 10 Sep 2024 10:42 |
URI: | https://ir.vistas.ac.in/id/eprint/5459 |