09-06-2012, 05:39 PM
SEMINAR ON A NEW TREND IN DATA WAREHOUSING
DATA WAREHOUSING.doc (Size: 1.18 MB / Downloads: 2)
ABSTRACT:
The problem with majority of data on the web is that it is difficult to use on a large scale, because there is no global system for publishing data in such a way as it can be easily processed by anyone. Everyone using the WWW has the problem that who can you trust to send you e-mails; how can I know s u r e if a transaction really occurred. So the semantic web can be seen as a huge engineering solution… but it is more than that.
The Semantic Web is a mesh of information linked up in such a way as to be easily procesable by machines, on a global scale. The Semantic Web provides a common framework that allows data to be shared and reused across application. It is a collaborat- ive effort led by W3C. The Semantic Web is about common formats for integration and combination of data drawn from diverse sources, where the original Web mainly concentrates on the interchange of documents.
The Semantic Web approach instead develops languages for expressing information in a machine processable form. This development of Semantic Web is occurring in atleast two areas: from the infrastructural, all-embracing, position as espoused by the W3C/MIT and other academically -focused organizations.
PROBLEMS WITH THE WWW:
Data that is generally hidden away in HTML files is often useful in some contexts, but not in others. The problem with the majority o f data on the web that is in this form at the moment is that it is difficult to use on large scale, because there is no global system for publishing data in such a way as it can be easily processed by anyone. Technically WWW means a set of protocols and languages driven by a strong standards approach namely URI, HTTP, HTML, and HML.
WEB ONTOLOGY LANGUAGE (OWL):
OWL is an RDF-based language for Ontology modeling. It enable class and instance definition, using relations and properties such as Properties (price is a property of product), subclass Of (Employee is subclass Of person).
OWL ontologies can be developed independently, having concepts reference each other. Network effect is shown in second figure.
BOTTLENECKS:
Sufficient metadata is the main bottleneck of the Semantic Web. There is a loop:
- Without metadata, no applications will be built
- Without applications, no one will create metadata
The gap between academic and commercial is called THE META DATA GAP.
META DATA CHASM:
Ontology creation requires companies and organization to standardize their concepts. It is much harder than to standardize than communication protocols. Ontology creation requires large investments. Because ontologies reduce the uncertainty of information, their benefits will be revealed mainly in the long run.