Project Details

Project Name: A Similarity Measure for Text Classification and Clustering

Project Code: ILSW005DM

Summary:

Measuring the similarity between documents is an important operation in the text processing ﬁeld. In this paper, a newsimilarity measure is proposed. To compute the similarity between two documents with respect to a feature, the proposed measuretakes the following three cases into account: a) The feature appears in both documents, b) the feature appears in only one document,and c) the feature appears in none of the documents. For the ﬁrst case, the similarity increases as the difference between the twoinvolved feature values decreases. Furthermore, the contribution of the difference is normally scaled. For the second case, a ﬁxedvalue is contributed to the similarity. For the last case, the feature has no contribution to the similarity. The proposed measure isextended to gauge the similarity between two sets of documents. The effectiveness of our measure is evaluated on several real-worlddata sets for text classiﬁcation and clustering problems. The results show that the performance obtained by the proposed measure isbetter than that achieved by other measures

More Details

Technology Use: ASP. NET MVC, MS-SQL, JAVASCRIPT, HTML, CSS, BOOTSTRAP, ENTITY FRAMEWORK
Modules: NA
Algoritham Use: NA

Project Details

Project Name: A Similarity Measure for Text Classification and Clustering

Project Code: ILSW005DM

More Details

ASP.Net MVC Training

Java Training

Angular Training

React Training

Database Training

Projects Domains

Project Technology

Basic Languages

Python Training

Java Training

.Net Training

Database Training

Mobile Training

UI Training

Other Training

Project Details

More Details