COMP20008 Study Guide - Final Guide: Interquartile Range, Recommender System, Pearson Product-Moment Correlation Coefficient

190 views23 pages
CVS vs Excel
CSV is flat file
Excel: operations, formatting
HTML vs XML
XML and HTML were designed with different goals:
XML was designed to carry data - with focus on what data is
HTML was designed to display data - with focus on how data looks
XML tags are not predefined like HTML tags are
XML Namespace:
Aim: to provide uniquely name element or attribute
Avoid ambiguity, uniquely describe what I mean using the URL
First look at its bracket first than look at it's parents or grandparents , NOT
SIBLING
XML attribute vs elements
Elements: to store information. Attribute identify elements.
find more resources at oneclass.com
find more resources at oneclass.com
Unlock document

This preview shows pages 1-3 of the document.
Unlock all 23 pages and 3 million more documents.

Already have an account? Log in
Some of the problems with attributes are:
attributes cannot contain multiple values (child elements can)
attributes are not easily expandable (for future changes)
attributes cannot describe structures (child elements can)
attributes are more difficult to manipulate by program code
attribute values are not easy to test against a DTD
What I am trying to say here is that metadata (data about data) should be stored as attributes, and that
data itself should be stored as elements.
XML vs JSON
XML allows complex schema definitions (via regular expressions)
allows formal validation
makes you consider the data design more closely
JSON is more streamlined, lightweight and compressed
Which appeals to programmers looking for speed and efficiency
Widely used for storing data in noSQL databases
find more resources at oneclass.com
find more resources at oneclass.com
Unlock document

This preview shows pages 1-3 of the document.
Unlock all 23 pages and 3 million more documents.

Already have an account? Log in
XML schema: describe the structure of document
The purpose is to define the legal building blocks of an XML document: the
elements and attributes that can appear in a document, number of child
elements, data types
find more resources at oneclass.com
find more resources at oneclass.com
Unlock document

This preview shows pages 1-3 of the document.
Unlock all 23 pages and 3 million more documents.

Already have an account? Log in

Document Summary

Xml namespace: aim: to provide uniquely name element or attribute, avoid ambiguity, uniquely describe what i mean using the url, first look at its bracket first than look at it"s parents or grandparents , not. What i am trying to say here is that metadata (data about data) should be stored as attributes, and that data itself should be stored as elements. Xml allows complex schema definitions (via regular expressions) Makes you consider the data design more closely. Which appeals to programmers looking for speed and efficiency. Widely used for storing data in nosql databases. The purpose is to define the legal building blocks of an xml document: the elements and attributes that can appear in a document, number of child elements, data types. Name space has to be parents, not sibling. To measure data quality which are: accuracy, completeness, consistency, timeliness, believability.