EECS 1520 Chapter 3: Chapter 3 _ cse
mahika.c06 and 40040 others unlocked
54
EECS 1520 Full Course Notes
Verified Note
54 documents
Document Summary
Chapter 3 data representation: this chapter explores how we represent and store the various kids of data a computer manages. Analog and digital data: computers are finite. In 1960, a survey revealed 60 different character sets were in use: at ibm alone there were 9 different sets, by agreeing to use one particular character set, computer manufacturers made the processing of text data easier. The unicode character set: however, even the extended version of the ascii character set is not enough for international use, the unicode character set uses 16 bits per character. The unicode character set can represent 2^16, or over 65 thousand characters: the unicode is a superset of the ascii code. Run-length encoding: frequently used words are replaced with a single character for example the word (cid:498)as(cid:499) is replaced with (^), and the word (cid:498)that(cid:499) is replaced with ($). This type of repetition doesn"t generally take place in english text, but often occurs in large data stream.