I am looking for a project which is based on text mining
Posts: 6,843
Threads: 4
Joined: Mar 2015
java code for effective pattern discovery for text mining
Abstract
Abstract:
An innovative and effective pattern discovery is proposed to overcome the low-frequency and misinterpretation problems for text mining. The proposed technique uses two processes, pattern deploying and pattern evolving, to refine the discovered patterns in text documents. We focus on the development of a knowledge discovery model to effectively use and update the discovered patterns and apply it to the field of text mining.
Aim & Objective
• To solve the misinterpretation problem effective pattern discovery technique is designed
• It also considers the influence of patterns from the negative training examples to find ambiguous (noisy) patterns and try to reduce their influence for the low-frequency problem.
Problem Statement
Most existing text mining methods adopted term-based approaches; they all suffer from the problems of polysemy and synonymy. Phrase-based approaches could perform better than the term based ones, as phrases may carry more “semantics” like information. There are two fundamental issues regarding the effectiveness of pattern-based approaches: low frequency and misinterpretation.
Contribution
This technique inner pattern evolution, only changes a pattern’s term supports within the pattern. A threshold is usually used to classify documents into relevant or irrelevant categories. The basic idea of updating patterns is explained as follows: complete conflict offenders are removed from d-patterns first. For partial conflict offenders, their term supports are reshuffled in order to reduce the effects of noise documents.