| title | Data visualization |
|---|---|
| pagetitle | Data visualization |
| subtitle | from a communication perspective |
| date | <span style="display:block;margin-top:-1em;margin-bottom:1em;font-size:.8em;font-style:normal;">with Mikel Madina</span><span style="font-style:normal;">European Data Incubator</span><br />The WWW, 4<small>th</small> November 2020 <div style="background-color:tomato;border-radius:5px;color:white;font-weight:bold;padding:.6em;width:10em;text-align:center;transform:rotate(10deg);position:absolute;top:.5em;right:-1em;font-size:.6em;font-family:Roboto;line-height:.9em;">GLOBAL PANDEMIC edition</div> |
| author | Miren Berasategi |
| theme | deusto |
| deusto | true |
artefact (US artifact) noun
1 An object made by a human being, typically one of cultural or historical interest.‘gold and silver artefacts’
2 Something observed in a scientific investigation or experiment that is not naturally present but occurs as a result of the preparative or investigative procedure.
‘the curvature of the surface is an artefact of the wide-angle view’
The Oxford Dictionary of English
::: notes
Dataviz are manufactured products, are not "found in nature" -> they carry interpretations by the creator, so it's important to be aware of this
Now let's dissect the artefact...
:::
Maximum data density is 1:1, and this is not usually the case:
| data points | < | pixels |
| observations |
::: notes
Observations need to be transformed into visual marks.
Data Density: An overall characteristic of the visual elements, used in combination, is data density. Data density refers to the number of useful data points you can get into an area.
We can only represent as many marks (as representations of observations) as we have pixels: in a picture of 1000 x 1000 pixels, the maximum of observations to fit is 1 million. How big is that?
More than 1 pixel is used to represent each data point, + complimentary pixels for titles, labels, white space.
Simplification/Reduction is required.
:::
Some strategies to overcome this constraint:
- Filter observations
- Split data into multiple charts
- Augmented visualizations
- Densify
::: notes
Not only that so much data can be displayed, but that it cannot actually be correctly interpreted.
1, 2, and 3, escape densification; 4 represents density.
:::
1. Filter observations
- By design, communicating a selection of data
- By allowing users to filter according to their interests
- Innovative filtering (i.e. Smart brushing)
::: notes
To make a selection of the data to display.
- by design, using criteria present in the data.
- by users, requires interaction / UX design.
innovation in the part of how the user interacts with the data: smart brushing
:::
{data-transition="fade" data-background-iframe="https://www.youtube.com/embed/VU117h6wT6w?controls=0&start=580"}
::: notes
Instead of selecting by line, allows the user to select by pattern. Useful for data exploration
run up to now they're translating the brush from left to right
Interpretation is up to the specific dataset and the objectives of the viz, but the point is that while it may allow for more complex insight, it also requires complex interaction capacity on the part of the user
:::
2. Split data into multiple charts
Facets, trellis, small multiples.
::: notes
Small multiples are sets of charts of the same type, with the same scale, presented together at a small size and with minimal detail, usually in a grid of some kind (in this case grid irrelevant). The term was at least popularized by Edward Tufte, appearing first in his seminal book Visual Display of Quantitative Information in 1983. He says of them:
Small multiples are economical: once viewers understand the design of one [chart], they have immediate access to the data in all the other [charts]… as the eye moves from one [chart] to the next, the consistency of the design allows viewers to focus on changes in the data rather than on changes in graphical design.
These will show up again later.
:::
3. Augmented visualizations
. . .
::: notes
adding redundant views on the same data
:::
4. Densify
- Escaping overplotting in scatterplots
. . .
::: notes
Crowded scatterplots are a classic example. What do you think we could do?
:::
4. Densify
- Escaping overplotting in scatterplots
::: notes
We can restort to a graphic resource like the alpha channel: still not useful
:::
4. Densify
- Escaping overplotting in scatterplots
::: notes
Actually requires/involves statistical simplification.
Color is in logaritmic scale, so density is more apparent.
:::
4. Densify
- Timelines
::: notes
a stock market dataset of 3,500 historical New York Stock Exchange closing prices.
:::
4. Densify
- Timelines
::: notes
Dense clusters of lines are easy to spot in blue, while bright yellow shows areas with few stock price lines. The drop that came with the financial crisis in 2008 is clearly visible. Similarly, we can see two dense bands of stock values around $15 and $25, showing that companies (or customers) tend toward round stock prices.
This is similar to what happens with marathon finishing times, what is called "Reference-Dependent Preferences" that model behaviour towards arbitrary references for no evident reasons other than the reference itself.
:::
::: notes
Graph showing a large dataset of marathon finishing times (n=9,789,093)
We provide visual and statistical evidence that round numbers (e.g., a four-hour marathon) serve as reference points in this environment and as a result produce significant bunching of performance at these round numbers. Bunching is driven by planning and adjustments in effort provision near the finish line and cannot be explained by explicit rewards (e.g., qualifying for the Boston Marathon), peer effects, or institutional features (e.g., pacesetters).
:::
A couple more things at this level of the number of observations:
- the time factor
- nothingness
- uncertainty, projections and other non-factual data
::: notes
Back to topic.
:::
The time factor
Require an additional effort for operational intelligence, where immediate decision making could be a requirement.
Source: Aragues 2018
::: notes
- A static visualization that uses the data that is available when the visualization is created. Any new data requires a refresh.
- From Windows Task Manager:
Ctrl+Shift+Esc- A real-time visualization that looks like the static one but updates itself constantly.
- A streaming data visualization that shows the flow of data and the impact it has on the statistics.
Perception of moving elements poses an additional difficulty for interpretation.
:::
{data-transition="fade" data-background-iframe="https://www.youtube.com/embed/hVimVzgtD6w?controls=0&start=149"}
::: notes
Global Development-Health to Swedish undegraduate students, top-grade instution. Did a pre-test: asked about child mortality on 5 pairs of countries (with double difference), mean 1.8/5 CI 0.4 = his course made sense. Chimpanzees 2.5. Repeated on professors, 2.4/0.4
Hans Rosling shifted the widespread view of a world divided into us/west and them/developing countries
:::
The time factor
A note of caution on using animations:
- They rely on working memory (from one frame to the next), so change should be obvious
- Users needs to find the area of interest on their own, so they may miss the point (selective attention)
{data-transition="fade" data-background-iframe="https://www.youtube.com/embed/vJG698U2Mvo?controls=0"}
::: notes
we are missing a lot of what goes on around us, and that we have no idea that we are missing so much.
https://forms.gle/fyWU4enxBwhMQtnC6
:::
{data-transition="fade" data-background-iframe="https://www.youtube.com/embed/IGQmdoK_ZfY?controls=0"}
::: notes
Subjects who knew about videos like this one consistently spotted the gorilla in the new video, but they were slightly less likely to notice the other events.
Foreknowledge that unexpected events might occur does not enhance the ability to detect other such events.
https://forms.gle/sZWuRP9Ht2AQyCH78
:::
How to communicate nothingness? (Kirk 2014)
- Null Absence of measurement
- Zero Absence of amount/magnitude
::: notes
first question we should ask: what is missing? what can we learn from the gaps?
null in the age of information overload, data absence is more and more significant. may help in the detection of faulty sensors. If models are built filtering nulls, this insight may be lost. -> flat electroencephalogram, absence of brainwaves is important to notice
:::
{data-transition="fade" data-background-iframe="https://www.youtube.com/embed/JqzAuqNPYVM?start=542&controls=0&stop=607"}
::: notes
run up to John Snow
:::
{data-transition="fade" data-background-iframe="https://www.youtube.com/embed/JqzAuqNPYVM?start=669"}
::: notes
run up to the only option we've got in many respects
:::
How to communicate nothingness?
- Null Absence of measurement
- Zero Absence of amount/magnitude
- Blank Try to use nothing to represent something
::: notes
zero can be represented by
- position + axis
- area + axis
- color shading
- density
- angle? we can't see 0º
- area only? we can't see
the last two require an alternative (like annotation)
blank gives us space to breath. Requires courage, implies losing the potential to densify, add advertising.
There is one thing we don't want to make visible: the design. Should not interfere.
:::
{data-transition="fade" data-background-iframe="https://www.youtube.com/embed/JqzAuqNPYVM?start=1153"}
::: notes
glass ceiling only
:::
How to communicate nothingness?
- Null Absence of measurement
- Zero Absence of amount/magnitude
- Blank Try to use nothing to represent something
- The design should be invisible
{data-transition="fade" data-background-iframe="https://www.youtube.com/embed/JqzAuqNPYVM?start=1363"}
::: notes
Invisible design, should not interfere with reading the graph. There is quite a lot of blank space, but very little non-data-ink.
Run up to Antoine de Saint-Exupery
:::
Communicating uncertainty, projections,
and other non-factual data is challenging.
. . .
Communicating uncertainty, projections,
and other non-factual data is challenging.
::: notes
Better ways to do this are still being investigated.
Margins of error must also be communicated, and they are often mistakenly interpreted as the probability for an event to happen.
This was about how to reduce the number of observations to represent in a graph. Now, about the number of variables, there is some translation to be made.
:::
A mark is a basic graphical element in an image.
::: notes
Marks represent observations/groups of observations. We have a limited space to include those.
:::
A visual channel is a way to control the appearance of marks.
::: notes
Channels represent information about the marks (variables), qualitites.
Don't take up space, BUT there are only so many channels we can use in a visual representation.
One and only one attribute/variable should be used per channel. Multiple channels per attribute are possible (redundant encoding).
All channels are not equal: the selection of marks and channels should be guided by the principles of expressivenes and effectiveness. i.e. if communicating magnitude, no sense in using size to represent anything else.
:::
It is required to reduce dimensionality (statistically): PCA, factors, clustering.
::: notes
Graphic variables than can be used are limited, a reduction is required.
Once the most important attributes/variables for the desired insight have been identified, the selection of marks and channels should ensure that they are encoded with the highest ranked.
:::
::: notes
The choice of which attributes/variables to encode with position is the most central choice in visual encoding.
The combination of these elements of visual language has generated commonly used ways, or idioms, to represent certain types of data (i.e. line graphs for time-dependent unique variables).
However, sometimes the "classical" or pre-set ways or conventions to represent information will not be the most effective (e.g. line graphs for time progression). In that case,
:::
A word of caution:
- will need to be custom coded
- readers will require training
- correct interpretation may be more time demanding
. . .
Xenographics: Weird but (sometimes) useful charts
::: notes
Many examples of this in Xenographics.
We have seen some Xenographics charts already: denselines
- covid?
:::
::: notes
Compact the area chart by slicing it horizontally, and then then shifting the slices to baseline zero. It’s like a combo area chart and heatmap.
:::
::: notes
Small multiple line charts on a scatter plot. Each line shows change in job count for an industry. Horizontally, they’re organized by average salary, and vertically, they’re organized by relative change since the end of the recession. Green represents growth and red represents decline. Hovering allows more detailed exploration of each line.
Another approach could be using small multiples: adding multiple simple charts, instead of making a complex one. Another way to use multiple charts are
:::
1. Data visualization as an artefact Multiple Linked Views (MLV) {data-transition="slide-in fade-out"}
In a MLV system, a dataset is shown in multiple simple visualizations, with the data items shown in the different charts corresponding to each other. The charts in each visualization can be used to highlight, control, or filter the data items shown in the others.
(Meyer & Fihser 2018)
::: notes
To make an analogy with language, when idioms are not enough to communicate a complex message, combinations of idioms, that is, more complex paragraphs, are required.
:::
{data-background-iframe="https://www.youtube.com/embed/kiXMn2VPZek" data-transition="fade"}
(Lagner, Kister & Dachselt 2019)
::: notes
Multiple views, with an innovative UI. Different levels of viewing: overview vs detail.
AND linked views; a selection in one part drives change in another
= complex interaction on the part of users.
:::
1. Data visualization as an artefact Multiple Linked Views (MLV) {data-transition="fade-in slide-out"}
Another example:
{data-background-iframe="https://www.youtube.com/embed/8j9r9JKIqww" data-transition="fade"}
::: notes
Realidad virtual: Still purely graphic, tú te metes en el gráfico.
:::
{data-background-iframe="https://www.youtube.com/embed/p4fB_OfoaZA?start=167" data-transition="fade"}
::: notes
Realidad aumentada: El gráfico se saca a la realidad y se coordina con elementos de la realidad física.
Run up to eyes
:::
::: notes
Not exactly visualization.
Sound: improves accesibility. Keyboard navigation enables sound cues for each representation, and the whole chart can be sonified.
Already mentioned with electroencephalograms.
:::
The modern approach to data visualization is focused on quickly making data visualization.
(Meeks 2018)
::: notes
These artefacts, or products, we are making to be told to others, or communicated. We will now see the implications of this social action applied to data visualization through a series of articles published by Elijah Meeks: Data visualization Engineer at Apple, formerly Netflix.
Even if this section shows the most communicative-social science view, he is actually an engineer and wrote a book about D3js. He is advocating this communicative view into the more "this is an engineering problem" take.
He sees a great problem in current data visualization.
:::
Focus on speed affects:
- how data visualization products are designed
- what tools are used to create them
- the role of the creator in relation to the product
- how engagement with readers in envisioned
::: notes
This breackneck pace is a real data visualization constraint.
The fast mode of dataviz is real and important, but when we let it become our only view into what dataviz is, we limit ourselves in planning for how to build, support and design dataviz.
Which doesn't mean being fast will not be important sometimes, BUT it can not be at the expense of the fact that [NEXT]
:::
Ultimately, data visualization is not a technical problem, it's a design problem and, more than that, a communication problem.
(Meeks 2018)
. . .
Let's look at what charts say, mean, and do.
::: notes
Which is the main point of my presentation here today. So let's go slow down a bit
:::
2. Data visualization as a communication product What charts say {data-transition="slide-in fade-out"}
Explicitly
Charts do "show me the data" (actually, it's more that they tell the data than actually show it).
Means choosing the right specific chart to use in order to display and query the data.
. . .
How to improve: Expose data cleanly and clearly. Aim for either query or validation. Distinguish accuracy vs. precision.
::: notes
Accuracy refers to the closeness of a measured value to a standard or known value. For example, if in lab you obtain a weight measurement of 3.2 kg for a given substance, but the actual or known weight is 10 kg, then your measurement is not accurate. In this case, your measurement is not close to the known value.
Precision refers to the closeness of two or more measurements to each other. Using the example above, if you weigh a given substance five times, and get 3.2 kg each time, then your measurement is very precise. Precision is independent of accuracy. You can be very precise but inaccurate, as described above. You can also be accurate but imprecise.
For example, if on average, your measurements for a given substance are close to the known value, but the measurements are far from each other, then you have accuracy without precision.
:::
Implicitly
No chart is an unbiased view of the data, as data visualization is a manufactured artefact.
All data is transformed to be in a chart, and the inaction of not designing that transformation carries just as strong an implication as the action of transforming it.
(Meeks 2018)
::: notes
there is manufacture = subjectivity from the first moment of the data coding/gathering: choice of variables, variable names, gathering means...
:::
| {height="450"} | {height="450"} |
::: notes
This chart would be an example of "not designing" or using the usual ways of doing dataviz. However this is actually a remake of this original chart:
Carries a very strong message. Red, bars point down, looks like blood. Very evocative, emphasized by the title.
There is deliberate choice in:
- Rotation: There's still a peak, but the decline is more apparent
- Title: To match the main takeaway of the graph
- Color: not like blood anymore
- "Situation in Iraq is improving" --> Completely different message!
Neither of those charts is lying. The opinionated nature of charts should be acknowledged and embraced.
:::
Implicitly
The implicit channel of a data visualization (the title and other framing elements) can be even more powerful than the explicit channel.
. . .
How to improve: Style should be intentional, purposeful and thematically appropriate, not the result of defaults or superficial decisions.
::: notes
As these usually appeal to emotion instead of reason.
:::
About the underlying system
[...] all charts display data and all data is a proxy for the systems that created and measured that data.
(Meeks 2018)
::: notes
There's sort of a cruel irony to the way we simultaneously share these data products so enthusisastically while bemoaning the privacy issues that are necessary for their creation.
:::
About the underlying system
::: notes
Do you ever remember telling Facebook what football team you root for?
:::
About the underlying system
[...] all charts display data and all data is a proxy for the systems that created and measured that data.
(Meeks 2018) How to improve: Caution not to reveal an underlying system that is proprietary or confidential.
Descriptively
- internally: axes, labels, annotations
- externally: surrounding text, figure descriptions, discussions
. . .
Unlike the implicit channel, the descriptive channel is active and purposeful (not subconscious).
. . .
How to improve: Consider annotations, labels, axis elements as part of the data visualization.
::: notes
In the way they are described.
vs. implicit channel: NOT "read this as a swear of blood, dripping down your screen" :::
2. Data visualization as a communication product What charts say {data-transition="fade-in slide-out"}
By being more explicit in our own understanding of what charts say and how we can systematically describe what they say, we can grow more capable of using the channels available in that expression to our advantage.
(Meeks 2018)
. . .
2. Data visualization as a communication product What charts mean {data-transition="slide-in fade-out"}
Intentionally
The mode and purpose of a chart should be well understood by the chart maker and immediately apparent to the chart reader.
(Meeks 2018)
::: notes
When they get perceived by the receivers, they generate meaning.
Charts mean more than just what they say.
:::
Historically
Charts are products of their time.
It is important to provide background about the data sources, to enable checking whether they are still based on relevant priorities, dimensions and metrics.
Historically
Culturally
Charts should be adapted to the culture they will be consumed in.
::: notes
In broad terms (as western culture) but also in the most local sense (as company intra-culture).
:::
Contextually
A chart might end up serving as context: design and provide a version of the chart that is suitable for inclusion alongside other charts.
Enable removing and adjusting data visualization elements to reduce complexity, not based on screen size as in responsive data visualization, but on priority.
::: notes
A chart that’s being integrated into another display or presentation can find itself in front of an audience that does not have the same domain knowledge as the primary audience, so this is yet another reason to have its data sources, dimensions and metrics more explicitly described.
:::
2. Data visualization as a communication product What charts mean {data-transition="fade-in slide-out"}
Meaning-making may sound too soft to the kind of technical professionals that make and read data visualization but communication without meaning is just noise.
(Meeks 2018)
2. Data visualization as a communication product What charts do {data-transition="slide-in fade-out"}
The most important thing about a chart is its impact.
::: notes
Impact is the effect that should be caused by the chart.
Like does it drive decision-making or not. Different kinds of impact:
:::
Provide insights
Identify and emphasize the insights that the readers might expect.
. . .
::: notes
In this example those consist of the maxima/minima of the data (whether as an extent or as individual features), volume of the total data represented, points of interest, volatility of change or overall trend in the direction of change.
:::
Cause change
As difficult to measure as it is important.
How have they impacted business decisions? How were they used in presentations? Where they modified (changed colours, cropped, annotated) somehow?
::: notes
Understanding how a particular chart was an effective piece of evidence or motivation is key to developing more effective data visualization.
- add murdoch crop
:::
Cause visual literacy
All data visualization was, at some point, complex data visualization, until an audience grew comfortable and literate enough to read it.
(Meeks 2018)
::: notes
Languages are live things, that evolve over time, and require breakthrough.
[...] the new form cannot be introduced until it is understood but can only be understood once it's introduced.
:::
Cause visual literacy
Create new charts
::: notes
Catalogs of data visualization have the tendency to treat every form as a category of one without drawing connections between them, which disguises how charts lead to other charts.
chart lineage
:::
2. Data visualization as a communication product What charts do {data-transition="fade-in slide-out"}
All communication is evaluated based on content, but persuasive communication, which is all data visualization unless it is purely decorative, is rightly also evaluated based on effect.
(Meeks 2018)
::: notes
persuasive in the sense of driving decision or actions on users.
Ultimately, what charts do may be the one most important thing.
:::
::: notes
To put a product to use, transmit a message, prevent making noise, it is required to adapt to your audience.
Couple ways to adapt that something we tell to those others: data counseling, responsive data visualization.
:::
[...] brings domain expertise into the operationalization process to help inform decisions about good proxies as well as to uncover insights using the resulting visualizations.
(Meyer & Fisher 2018)
::: notes
proxies = the best indicator to answer to the questions that you have.
:::
Based on interviews (1) for
- gaining an understanding of the questions and data
- get feedback on proxies, explorations, and visualization prototypes (2)
::: notes
- gaining... FIRST/BEFORE
- get feedback on... AFTER
:::
Interviews
The role of the interviewer is to ask questions that will guide the stakeholders toward elucidating the information necessary for working through an operationalization process and designing visualizations.
(Meyer & Fisher 2018)
Interviews
Identify stakeholders:
- analysts
- data producers
- gatekeepers
- decision makers
- connectors
Interviews
Require practice and experience.
Semistructured: be prepared, but also be open.
- start with open ended questions (problem, data, context)
- use the conversation to search out the more abstract question
::: notes
Are kind of an art.
- unstructured: can uncover unknown needs and goals, but it can take a significant amount of time to get to anything useful.
- structured: while efficient, leave little room for discovering new insights.
:::
Interviews
Use traditional conversation / interpersonal communication skills to prevent dead ends: keep them talking
- rephrase responses back
- ask the same or similar questions in different ways
- explore a completely different conversational topic
::: notes
Some techniques to achieve this:
:::
Interviews
Contextual interviews
- take place in the stakeholder's work environment
- consist of demonstrations of the tools and data inspection methods currently in use
::: notes
often starts by asking the stakeholder to either walk through a specific analysis task they have already performed or conduct someof their work for that day with the interviewer present. The stakeholder talks through each step they are taking.
:::
Rapid prototyping
[...] is a process of trying out many visualization ideas as quickly as possible and getting feedback from stakeholders on their efficacy.
(Meyer & Fisher 2018)
Rapid prototyping
≠ fast data visualization
. . .
≈ agile/lean methodologies
and user-centered design
::: notes
NOT to be confused with fast dataviz, which is more related to using the defaults or leaving un-designed elements.
:::
Rapid prototyping
::: notes
low-fi:
- sketched on paper or on a whiteboard,
- digital mockups that may include some controls for explaining interaction ideas (slides = digital mockups)
- can incorporate charts generated in Excel or Tableau with fake or sampled data (= existing chart tools).
- Great for communicating the gist of an idea in an interview, fast and easy to produce.
- Rapidly establish whether the visualizationd designer is on the same page as the stakeholders.
hi-fi:
- custom visualizations which must be created from scratch. Largely contain the core functionality.
- Risk being confused with final
- Meant to be thrown away, the point is not to worry about the code other than to confirm that ideas can work.
:::
Rapid prototyping
Prototypes are made to obtain feedback on them: get to the stakeholders early and often.
Focus not on whether they like it or not, but rather on what the visualization can and cannot do (contextual interview where the stakeholder uses the visualization).
Responsive web design, and responsive data visualization are not simply a way to make our content accessible on smaller screens. We need to build an ergonomic web that feels natural regardless of device type.
(Hinderman 2018)
::: notes
= implies more than simply using fluid measuers
:::
Unknowns require adaptability.
- the context in which the user is trying to consume the visualization
- changes in the data that is being displayed
::: notes
[...] every single thing that you can do to make your visualization more clear, more usable, or more communicative is a reaction to either the context in which the user is trying to consume it, or a change in the data being displayed.
- the user: core of responsive web design as a whole
- the data: unique to data visualization. This is what makes a visualization FULLY responsive vs. simply scaling them at different viewport sizes, like an image.
:::
Output side (the client)
Making things work in all screen types by redrawing charts to fit its container.
Match CSS breakpoints + add any new ones as the content requires: group data to fit (trade-off precision for reduced rendering complexity and performance).
::: notes
With a combination of CSS and D3.
Breakpoints: build for your smallest supported screen: then, as you find a spot at which it makes sense to redraw, create a new breakpoint there.
:::
Input side (the data)
Adapting at breakpoints. No need to just redraw the exact same elements:
As long as the message being conveyed by the data is the same, and the point you're trying to prove is always present, you should prove it with as much firepower as you have available.
(Hinderman 2018, p.361)
::: notes
We need to KNOW
- what is the message being conveyed by the data (what does it SAY)
- what is the point you're trying to prove (what does it MEAN)
:::
Input side (the data)
Adapting at interaction points.
[...] present a rational default but enable users to dig into more complex or specific layers of data when the device's capabilities limit the presentation of both at the same time.
(Hinderman 2018, p.362)
::: notes
The most crucial and digestive information should be prioritised, but interaction should enable more granularity.
:::
Different views on heartrate depending on device:
Different views on heartreate depending on device:
Different views on heartreate depending on device:
::: notes
In adapting to different devices, perception of different kinds of charts may differ.
:::
{data-background-iframe="https://www.youtube.com/embed/MwJmjTBfOx0" data-transition="fade-in slide-out"}
::: notes
a vast majority of smartwatch interactions last under 5 s. Some chart types are not apropriate for certain devices.
:::
::: notes
Data visualization is a communication product.
There is no such thing as raw data, and there is neither such a thing as a raw data visualization (aseptic, unbiased view on the data). Even if we understood datavis as a photograph of the data, photos are long taken as political products, with a meaning, and one view of reality and not another one. But what's more, I wouldn't even say that datavis are photos of the data, they are more illustrations, drawings of the data, more crafted than photographs. And as such, we should draw them with purpose, and we should draw them FOR the user/the reader, meaning we should make them adapted, and responsive.
:::
This presentation is available at
http://mrn.bz/EDI2020
{width="50" style="vertical-align:middle;"}
@ DeustoKomunika
{width="50" style="vertical-align:middle;padding-left:25px;"}
@ DeustoData @ mberasategi {style="padding-left:25px;"} @ neregauzak{style="padding-left:25px;"}
- D3.js
A JavaScript library for manipulating documents based on data. D3 helps you bring data to life using HTML, SVG, and CSS. - ggplot2
Data visualization package for the statistical programming language R. - IEEEVIS 2020
Conference on Scientific visualization, Information visualization and Visual Analytics. Papers from the 2019 edition. - matplotlib, seaborn
Just two of the many data visualization libraries available for Python - Open Access Vis
A collection of open access visualization research at the VIS 2018 conference. - Xenographics
Weird but (sometimes) useful charts. - Data visualization Catalogue
A library of different information visualization types.
Eric J. Allen, Patricia M. Dechow, Devin G. Pope, George Wu (2017) "Reference-Dependent Preferences: Evidence from Marathon Runners". Management Science 63(6):1657-1672. https://doi.org/10.1287/mnsc.2015.2417
Anthony Aragues (2018), Visualizing Streaming Data. O'Reilly Media
Jeremy Ashkenas & Alicia Parlapiano (2014), "How the Recession Reshaped the Economy, in 255 Charts" in TheUpshot at The New York Times
Tanja Blascheck, Lonni Besançon, Anastasia Bezerianos, Bongshin Lee, Petra Isenberg (2019). "Glanceable Visualization: Studies of Data Comparison Performance on Smartwatches". IEEE Transactions on Visualization and Computer Graphics 25(1) 10.1109/TVCG.2018.2865142`
Matthew Brehmer, Bongshin Lee, Petra Isenberg, Eun Kyoung Choe (2019). "Visualizing Ranges over Time on Mobile Phones: A Task-Based Crowdsourced Evaluation". IEEE Transactions on Visualization and Computer Graphics 25(1) 10.1109/TVCG.2018.2865234
Brendan Gregg (2016), "The Flame Graph". ACM Queue 14(2)
Luc Guillemot (2018), "How Does This Data Sound?"
Jeffrey Heer, Nicholas Kong, Maneesh Agrawala (2009), "Sizing the Horizon: The Effects of Chart Size and Layering on the Graphical Perception of Time Series Visualizations". ACM Human Factors in Computing Systems (CHI), pp. 1303 - 1312 10.1145/1518701.1518897
Bill Hinderman (2015), Building Responsive Data Visualization for the Web. O'Reilly
Christophe Hurter, Nathalie Henry Riche, Steven M. Drucker, Maxime Cordeil, Richard Alligier, Romain Vuillemot (2018), "FiberClay: Sculpting Three Dimensional Trajectories to Reveal Structural Insights", IEEE Transactions on Visualization and Computer Graphics 25(1)
Andy Kirk (2014), "The Design of Nothing: Null, Zero, Blank", OpenVis Conference 2014
Ihor Kovalyshyn (2017), "When Scatter Plot Doesn't Work"
Ricardo Langner, Ulrike Kister, Raimund Dachselt (2019). "Multiple Coordinated Views at Large Displays for Multiple Users: Empirical Findings on User Behavior, Movements, and Distances", IEEE Transactions on Visualization and Computer Graphics 25(1) (proc. InfoVis 2018) 10.1109/TVCG.2018.2865235
Elijah Meeks (2017), "Strategic Innovation in Data Visualization Will Not Come From Tech"
— (2018), "Data Visualization, Fast and Slow"
Dan Meth (2009), "The Trilogy Meter"
Miriah Meyer & Danyel Fisher (2018), Making Data Visual. O'Reilly Media
Dominik Moritz and Danyel Fisher (2018), "Visualizing a Million Time Serieswith the Density Line Chart" arXiv:1808.06019v2 [cs.HC]
Tamara Munzner (2015). Visualization Analysis and Design. CRC Press
Jonas Schöley (2018), "Choropleth maps with tricolore"
Daniel J. Simons (2010), "Monkeying around with the gorillas in our midst: familiaritywith an inattentional-blindness task does not improve thedetection of unexpected events", i-Perception, vol.1, pp. 3–6 10.1068/i0386
Ronell Sicat, Jiabao Li. DXR: A Toolkit for Building Immersive Data Visualizations
{height="50px" style="display:inline-block;margin-right:2em;"}
{height="50px" style="display:inline-block;"}
{height="50px" style="display:inline-block;"}
Copyright © 2019 University of Deusto
This work (except for the quoted images, whose rights are reserved to their owners) is licensed under the Creative Commons “Attribution-ShareAlike” License. To view a copy of this license, visit [http://creativecommons.org/licenses/by-sa/3.0/](http://creativecommons.org/licenses/by-sa/3.0/)

































