STAT 100 Lecture Notes - Lecture 4: Xml, Json, Linear Algebra

21 views2 pages
Data cleaning and exploratory data analysis
Data frame :
series
:Anamed icolumn of data With an index
indexes :mapping from key to
rows
Dataram :collection of Series with common index
Methods .
.
Filtering on predicts and slicing
df .Ioc :Location by index
df .iloc =location by integer address
group by and pivot
Data cleaning :process of transforming raw
data to facilitate subsequent analysis
Exploratory path Analysis CEPA )
The process of transforming ,visualizing ,and
summarizing data to -
Build confirm understanding of the data
identify potential issue
(inform subsequent analysis
werid pattern Leven if they don't exist )
conclusion ,subsets
Discover potential hypothesis
Key data properties to consider in EDA
-structure :shape
-Granularity =how fine
-scope :how Lin )complete is data
Unlock document

This preview shows half of the first page of the document.
Unlock all 2 pages and 3 million more documents.

Already have an account? Log in

Get access

Grade+20% off
$8 USD/m$10 USD/m
Billed $96 USD annually
Grade+
Homework Help
Study Guides
Textbook Solutions
Class Notes
Textbook Notes
Booster Class
40 Verified Answers
Class+
$8 USD/m
Billed $96 USD annually
Class+
Homework Help
Study Guides
Textbook Solutions
Class Notes
Textbook Notes
Booster Class
30 Verified Answers

Related textbook solutions

Related Documents