19-04-2011, 11:34 AM
Presented by:
Shaikh Farhat I
[attachment=12363]
INTRODUCTION
The Web is huge, dynamic & diverse, and thus raises the scalability, multimedia data and temporal issues respectively.
àThus we are drowning in information and facing information overload. Information users can encounter problems when interacting with the Web.
àHighly Dynamic…
Explosive growth of amount of content on the internet.
àWeb search engines return thousands of results so difficult to browse.
àOnline repositories are growing rapidly.
àOnline organizations generate a huge amount of data …How to make best use of data?
Finding Relevant information: irrelevance of many of the search results, inability to index all the information available on the web.
àCreating new knowledge out of the information available on the web: presumes that we already have a collection of web data and we want to extract potentially useful knowledge out of it.
àPersonalization of the information: This problem is often associated with the type and presentation of the information.
àLearning about consumers or individual users: This problem is about knowing what the customers do and want.
ROLE OF WEB MINING
àWeb mining techniques could be directly or indirectly used to solve the information overload problems described before.
Directly - application of web mining techniques directly addresses the problem.
Attack the problem with web mining techniques
Indirectly- web mining techniques are used as a part of a bigger application that addresses the problems mentioned before.
OTHER APPROACHES
àWeb mining NOT only useful tool: other useful techniques include
1. DB
2. IR
3. NLP
In-depth syntactic and semantic analysis
4. Web document community
Standards, manually appended meta-information, maintained directories, etc
DEFINATION OF WEB MINING
àWeb mining is the use of data mining techniques to automatically discover and extract information from Web documents/services.
Shaikh Farhat I
[attachment=12363]
INTRODUCTION
The Web is huge, dynamic & diverse, and thus raises the scalability, multimedia data and temporal issues respectively.
àThus we are drowning in information and facing information overload. Information users can encounter problems when interacting with the Web.
àHighly Dynamic…
Explosive growth of amount of content on the internet.
àWeb search engines return thousands of results so difficult to browse.
àOnline repositories are growing rapidly.
àOnline organizations generate a huge amount of data …How to make best use of data?
Finding Relevant information: irrelevance of many of the search results, inability to index all the information available on the web.
àCreating new knowledge out of the information available on the web: presumes that we already have a collection of web data and we want to extract potentially useful knowledge out of it.
àPersonalization of the information: This problem is often associated with the type and presentation of the information.
àLearning about consumers or individual users: This problem is about knowing what the customers do and want.
ROLE OF WEB MINING
àWeb mining techniques could be directly or indirectly used to solve the information overload problems described before.
Directly - application of web mining techniques directly addresses the problem.
Attack the problem with web mining techniques
Indirectly- web mining techniques are used as a part of a bigger application that addresses the problems mentioned before.
OTHER APPROACHES
àWeb mining NOT only useful tool: other useful techniques include
1. DB
2. IR
3. NLP
In-depth syntactic and semantic analysis
4. Web document community
Standards, manually appended meta-information, maintained directories, etc
DEFINATION OF WEB MINING
àWeb mining is the use of data mining techniques to automatically discover and extract information from Web documents/services.