sir, i m searching a java project as mini project for my 3rd yr.. I m studying in engineering. I lyk ths topic "text clustering" so u cud advice me for ths project n suggest me to build ths project
hi,do you get the frequent term based text clustering code? and would you like to share with me? my mail is zhangxueson[at]gmail.com looking for your apply
Posts: 6,843
Threads: 4
Joined: Mar 2015
Abstract
Text-mining methods have become a key feature for homeland-security technologies, as they can help explore effectively increasing masses of digital documents in the search for relevant information. This research presents a model for document clustering that arranges unstructured documents into content-based homogeneous groups. The overall paradigm is hybrid because it combines pattern-recognition grouping algorithms with semantic-driven processing. First, a semantic-based metric measures distances between documents, by combining content-based and behavioral analysis. Such a metric allows taking into account the lexical properties, the structure and the styles characterizing the processed documents. In a second step, the model relies on a Radial Basis Function (RBF) kernel-based mapping for clustering documents. As a result, the major novelty aspect of the proposed approach is to exploit the implicit mapping of RBF kernel functions to tackle the crucial task of normalizing similarities, while embedding semantic information in the whole mechanism. Experimental results on Reuters and Newsgroup 20 databases validate the proposed approach.