ASK HERE

seminar class · 28-03-2011, 04:33 PM

[attachment=11194]
DESCRIPTION:
A data stream is a massive, open-ended sequence of data elements continuously generated at a rapid rate. Mining data streams is more difficult than mining static databases because the huge, high-speed and continuous characteristics of streaming data. In this paper, we propose a new one-pass algorithm called DSM-MFI (stands for Data Stream Mining for Maximal Frequent Itemsets), which mines the set of all maximal frequent itemsets in landmark windows over data streams.
The proposed algorithm DSM-MFI (stands for Data Stream Mining for Maximal Frequent Itemsets) is composed of four steps. First, it reads a block of transactions from the buffer in main memory, and sorts the items of transactions in the lexicographical order. Second, it constructs and maintains the in-memory summary data structure SFI-forest (stands for Summary Frequent Itemset forest). Third, it prunes the infrequent patterns from the summary data structure. Fourth, it searches the set of all maximal frequent itemsets from the current summary data structure
A new summary data structure called summary frequent itemset forest (abbreviated as SFIforest) is developed for incremental maintaining the essential information about maximal frequent itemsets embedded in the stream so far. Theoretical analysis and experimental studies show that the proposed algorithm is efficient and scalable for mining the set of all maximal frequent itemsets over the entire history of the data streams.
SYSTEM REQUIREMENTS
Hardware Requirements :
• Pentium IV 500 MHz
• 512 MB RAM
• 500 MB Free Hard disk space
• Color Monitor
Software Requirements:
Front End : Java
Synthetic datasets
Operating System : Windows XP
RAM : 512 MB
Tools : Edit plus, Microsoft FrontPage for editing

Possibly Related Threads...
Thread		Author	Replies	Views	Last Post
	Online Art Gallery	project topics	2	5,026	12-09-2017, 01:27 PM Last Post: Mohankumari
	Online Training and Placement	mechanical engineering crazy	17	13,669	11-05-2017, 01:42 PM Last Post: Guest
	A Link-Based Cluster Ensemble Approach for Categorical Data Clustering		1	1,106	16-02-2017, 10:51 AM Last Post: jaseela123d
	Exploiting the Functional and Taxonomic Structure of Genomic Data by Probabilistic To		1	781	14-02-2017, 04:15 PM Last Post: jaseela123d
	An Efficient Algorithm for Mining Frequent Patterns full report	project topics	3	4,806	01-10-2016, 10:02 AM Last Post: Guest
	online examination full report	project report tiger	14	42,964	03-09-2016, 11:20 AM Last Post: jaseela123d
	Online Ticket Reservation System for Cinema Halls	Electrical Fan	16	19,406	04-07-2016, 03:10 PM Last Post: visalakshik
	Online Dictionary	nit_cal	2	2,337	06-04-2016, 12:16 PM Last Post: dhanabhagya
	Remote Server Monitoring System For Corporate Data Centers	smart paper boy	3	2,890	28-03-2016, 02:51 PM Last Post: dhanabhagya
	Development of an Online Course Portal for a campus	seminar topics	5	6,658	19-03-2016, 02:13 PM Last Post: dhanabhagya

Important Note..!

ASK HERE