“A picture is worth a thousand words”
What is Data Visualization?
The pictorial format represents some form of collected data about the world which will help us in making decisions.
From a Data Scientist’s prospective, how visualization will help us?
Why Do we have to Visualize Data?
- It compresses the huge volume to easy to an understandable level.
- Many Answers, we can get in many ways in less time.
- Helps in decision-making.
How Do We Visualization?
The gestalt principle is a human perception principle that describes how we recognize and simplify complex visual patterns.
-
Principle of figure and ground

Principle of figure and ground -
Principle of similarity

Principle of similarity -
Principle of the focal point

Principle of the focal point -
Principle of continuity

Principle of continuity -
Principle of proximity

Principle of proximity -
Principle of common region

Principle of common region -
Principle of closure

7 Stages of Visualizing the Data
- Acquire: Obtaining data.
- Parse: Structuring data.
- Filter: Removing noise from data.
- Mine: Exploring the undiscovered patterns in data.
- Represent: Visualizing the data.
- Refine: Improving the visualization.
- Interact: Analysing the important one.
Data Science Process
-
Data understanding process: You can view the different format data in the following types
-
- Structured Data
- Unstructured Data
- Semi-Structured Data
2. Data preparation process:
-
- Clearing unnecessary data like duplicate, corrupt, irrelevant, and inaccurate.
- Finding the missing data
- Strategy to find the missing data.
- Relative proxy data
- Building the data pipeline
3. Modelling Process: Building a strategy for accessing data and putting it through different visualization tools.
4. Evaluation Process: Studying and evaluating the model data and finalizing the key goal statement and solution.
5. Deployment process: Applying the solution in the business.
We can compare things through
- Horizontal bar graphs

Horizontal bar graphs - Column bar graphs

Column bar graphs - Stacked bar graphs

Stacked bar graphs
A distribution chart can be used for Distribution visualization
Pie chart can be used for representing the composition

Bubble charts and word clouds can be used to visualize different variables.
Data Types for Visualization
- Nominal Scale: Categories, types (Fruits: Apple, grapes, banana)
- Ordinal Scale: Orderly Representation (AAA, AA, A)
- Interval Scale: Fixed interval set (0-5, 10-50)
- Ration Scale:(Share value, Employee salary)
Effectiveness of visual Encodings
- Detection
- Assembly
- Estimation
Edward Tufte’s Design Principles
- Principle 1: Maximizing Data-Ink Ratio

Maximizing Data-Ink Ratio - Principle 2: Minimizing Chart Junk
- Unintentional optical art
- The grid
- The Duck
- Principle 3: Minimizing Lie Factor

Minimizing Lie Factor
How Do we Visualize Single Measure?
- Visualizing basic comparison charts
- Bar chart

bar-graphs - Big number

Big number - Histogram

Histogram - Box and whisker chart

Box and whisker chart
- Bar chart
- Visualization for composition
- Pie chart
- Icon array
How Do we Visualize Multiple Measure?
- Visual Comparision
- Stacked bar chart
- Box chart and boxen chart
- Violin Chart
- Strip chart and swarm chart
- Visualizing relationship
- Scatter chart and joint plot
- Pair plot
- Heat map
- Parallel coordinates chart
- Dual axis chart
- Visualizing trends
- Line chart
Various forms of unstructured text data
- Document
- Emails
- Social media
Visualizing Text Data
- .txt format
- Excel format
Visual Storytelling
” Data should be able to speak to the eyes, as eye is the best judge of proportion, being able to estimate it with more quickness and accuracy than any other of our organ”
Storytelling with Data
Secrets of Great Design
How to define the stats with story
The Neuroscience of Branding
Branding
Typography
Color Psychology
Tableau Full Course
Power BI Full Course
Additional Tips


