"Data visualization is the process of displaying data/information in graphical charts, figures and bars. It is used as means to deliver visual reporting to users for the performance, operations or general statistics of an application, network, hardware or virtually any asset." from techopedia.com
"Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively."
"don’t need vis when fully automatic solution exists and is trusted." from Prof. Tamara Munzner
Webster defines Visualization as:
Hamming: "The purpose of computing is insight not numbers"
What are the advantages? (adapted from [Ware 2000])
How do we make good visualizations? (adapted from [Tufte 1983])
Charles Joseph Minard's 1861 graphic
showing Napoleon's losses during his 1812 march to and from Moscow -
possibly the best statistical graph ever drawn ... why?
The image is discussed in detail on p41 of The
Visual Display of Quantitative Information
The chart show 6 variables
"Visual Analytics is the science of analytical reasoning facilitated by interactive visual interfaces. People use visual analytics tools and techniques to synthesize information and derive insight from massive, dynamic, ambiguous, and often conflicting data, provide timely, defensible, and understandable assessments; and communicate assessment effectively for action. The overall goal is to detect the expected and discover the unexpected. "
Here is a table from the US Environmental Protection Agency from a
few years ago. What are the problems with this table?
A better version of the table (font, background, alignment):
A better version would be:
Using color to highlight the pattern (one color): Below is an example of a bar chart from a report to investors published by one of the largest companies in the world.
This course will also deal with Visual Analytics - using
interactive visualizations to enhance the analysis of large amounts of
data - that is, the visualization is not the end-product but rather it
is the means by which people can understand complex phenomena
So, what is the difference between InfoVis and VAST: no clear border
Example: Scatterplot and visual features
Source Sector
Total Emissions
Electricity Generation
652,314
Fires
14,520,530
Fossil Fuel Combustion
1,499,367
Industrial Processes
2,414,055
Miscellaneous
33,786
Non Road Equipment
22,414,896
On Road Vehicles
62,957,908
Residential Wood Combustion
2,704,197
Road Dust
0
Solvent Use
3,294
Waste Disposal
2,018,496
Source Sector
Total Emissions
Electricity
Generation
652,314
Fires
14,520,530
Fossil Fuel
Combustion
1,499,367
Industrial
Processes
2,414,055
Miscellaneous
33,786
Non Road Equipment
22,414,896
On Road Vehicles
62,957,908
Residential Wood
Combustion
2,704,197
Road Dust
0
Solvent Use
3,294
Waste Disposal
2,018,496
Here is a sample table with two values: Yes/No.
Yes
No
Yes
Yes
Yes
Yes
No
No
No
Yes
No
Yes
Yes
Yes
Yes
No
No
No
No
Yes
Yes
Yes
No
No
Yes
Yes
-
Yes
Yes
Yes
Yes
-
-
-
Yes
-
Yes
Yes
Yes
Yes
-
-
-
-
Yes
Yes
Yes
-
-
Yes
Yes
No
Yes
Yes
Yes
Yes
No
No
No
Yes
No
Yes
Yes
Yes
Yes
No
No
No
No
Yes
Yes
Yes
No
No
Yes
Stop making charts when a table is better.
"Graphs are essentially a visual display of quantitative information along two axes. Visuals are used as a way for our brains to quickly understand information, which is a powerful tool if used correctly. Graphs can show a large amount of data quickly in a way that is easy to process, without distracting people with a bunch of numbers.""
"When debating table vs. graph, ask yourself how the data will be used, consider your target audience, and decide the best way to map out your information. Think about the utility of your visual and let that help drive your decision-making."
This is the bar charts of the population of the USA over the last 8 years. The left chart shows how 3D hurts the visualization. The right chart is a less fancy but much more effective in 2D.
Here is an example how visualization lies: from Time Magazine (4/9/79) via Tufte. Please avoid this.
These examples are charts from a recent version of Excel. 3D distorts the view making it more difficult (less accurate as well) to read the visualization.
A 2D version is much better. Picking up the right colors is also helpful: we could use colors to relate consoles. For example, Playstation 2 and Playstation 3 are both in (darker and lighter) blue.
Here is another bad example from Fox news:
Five Ways to Fail at Data Visualization
More examples of bad visualizations can be found at WTF visualizations
Or The 9 Worst Data Visualizations Ever Created
Or MANY more Flowing data: Ugly Charts