Analysing effective methodologies used for text clustering using weighted algorithms

Sree Dharinya S

Profiles Research Units Publications

Articles

Analysing effective methodologies used for text clustering using weighted algorithms

Published in Asian Research Publishing Network

2018

Volume: 13

Issue: 3

Pages: 1145 - 1149

Abstract

Clustering of text documents is an important technique for enhancing automated learning. Matching is the technique used in order to relate or match the various set of related documents. Clustering groups a set of documents which are similar and dissimilar for unsupervised learning where the user has learning materials which are from raw data which requires further classification. Established feature extraction strategies intend to change over the representation of the major dimensional data set into a lower-dimensional informational collection by anticipating process through mathematical changes. The concept of feature clustering is to aggregate the features into clusters with a high level of pair wise semantic relations. Each cluster is dealt as a single new feature, and, hence, feature dimensionality can be radically lessened. HAC, K-Means, TF/IDF-weighted vectors and cosine similarities is used for the various vectors of data and is applied to text in a direct way to optimize the vectors. © 2006-2018 Asian Research Publishing Network (ARPN).

About the journal

Journal	ARPN Journal of Engineering and Applied Sciences
Publisher	Asian Research Publishing Network
ISSN	18196608
Open Access	No

Authors (1)

Sree Dharinya S

ABOUT US

ACADEMICS

INTERNATIONAL RELATIONS

RESEARCH

RANKINGS & PLACEMENT

ABOUT US

ACADEMICS

INTERNATIONAL RELATIONS

RESEARCH

RANKINGS & PLACEMENT