Published
Edited
Sep 20, 2020
Insert cell
md`# Day Twelve, Sept. 5, 2020: Scales`
Insert cell
d3 = require("d3@6")
Insert cell
scale = d3.scaleLinear()
Insert cell
md`

\`d3\` scales are really difficult to understand. I read [Jerome Cukier's great page on 'd3:scales and color'](http://www.jeromecukier.net/2011/08/11/d3-scales-and-color/), [Scale Functions](https://www.d3indepth.com/scales/) on [D3 in Depth](https://www.d3indepth.com/), [Using scales](http://jonathansoma.com/tutorials/d3/using-scales/) by [Jonathan Soma](http://jonathansoma.com/tutorials/d3/using-scales/) and [D3 attr(), style() and classed()](https://www.carlosrendon.me/unfinished_d3_book/d3_attr.html) by [Carlos Rendon](https://www.carlosrendon.me/). However, I still didn't understand it very well. That was until I saw [this tutorial](https://www.dashingd3js.com/d3js-scales) on Dashing D3.js.

`
Insert cell
md`

One thing I didn't understant, and one thing that many tutorials didn't touch upon, or worst, keep being misleading, is **What does D3 scale capture, or maintain?**.

[Using scales](http://jonathansoma.com/tutorials/d3/using-scales/) by Jonathan Soma is especially misleading, although I have to say [his tutorial](http://jonathansoma.com/tutorials/d3/) is the best I have ever seen online. Why did I say that his tuorial on D3 scales is misleading? It's because it doen't capture the real purpose of D3 scale.

For example, he used this example where dataset, or input, is [1, 2, 3, 4, 5, 50]. If our svg is only 120 pixels wide, then it's natural to come up with a good idea that the output range be [0, 100], this way, D3 will map input to output on a 1:2 scale.

1 will mapped to be 2, 2 to 4, 3 to 6, 4 to 8, 5 to 10, and 50 to 100. that is, the output will be [2, 4, 6, 8, 10, 100]. Nothing wrong, right?

It's not wrong. It's just misleading because it is a very special case.

`
Insert cell
md`

If that is the case for all d3 examples, then we would think that any number in the output range divided by the corresponding number in the input domain will get the same quotient. **But that is not the case at all!**

For example, if we have this dataset: [55, 98, 25, 68, 109] and we only have a screen that's 1080 pixes wide. You can use your own versions of input domain and output range, but here, let's say we'll set the input domain to be [10, 110]. What should be the output range? Again, you can use **anything** you like, but here, let's say the output range is [5, 350]. Why? No special reason. I just thought about those numbers in my mind.
`
Insert cell
scale.domain([10,110]).range([5,350])
Insert cell
md` The input of 55 will be:`
Insert cell
scale(55)
Insert cell
md` Similarly:`
Insert cell
scale(98)
Insert cell
scale(25)
Insert cell
scale(68)
Insert cell
scale(109)
Insert cell
md`

Now, let's calculate.

`
Insert cell
scale(109)/109
Insert cell
scale(68)/68
Insert cell
scale(25)/25
Insert cell
scale(98)/98
Insert cell
scale(55)/55
Insert cell
md` **You can see that they are not equal to each other at all!! **

If d3 scale functions did not maintain the ratio, what does it maintain? What does it capture?

**The difference!**

I came to this understanding after reading [this tutorial](https://www.dashingd3js.com/d3js-scales), although I have to point it out that the example in this tutorial is also misleading.`
Insert cell
md` What did I mean by "d3 scale is maintaining the **difference**:

If we devide the difference between two numbers in the output range by the difference between two corresponding numbers in the input domain, we'll get (almost) exactly the same quotient.

Let me take the above example to illustrate this:

`
Insert cell
(scale(98)-scale(25)) / (98-25)
Insert cell
(scale(109)-scale(55)) / (109-55)
Insert cell
(scale(68)-scale(22)) / (68-22)
Insert cell
md` In fact, it doesn't have to be the five numbers above. Any numbers within our input domain is okay:

`
Insert cell
(scale(100)-scale(22)) / (100-22)
Insert cell
md` Why 3.45? How did d3 come up with this number. Well, you will know the mechinism after reading [this tutorial](https://www.dashingd3js.com/d3js-scales). Simply because (350-5)/(110-10) = 3.45.

Then, you may ask, why is that we were getting 3.4499999999999997 above, not exactly 3.45?

I don't know.

One thing I noticed is that if the starting point of both the input domain and the output range is zero, then d3 scale is not only preserving the difference, but also the ratio.

For example. Let the input domain to be [0, 66] and the output range [0, 109].
`
Insert cell
scale.domain([0,66]).range([0,109])
Insert cell
(scale(98)-scale(60)) / (98-60)
Insert cell
(scale(25)-scale(7)) / (25-7)
Insert cell
md` Same right?

Also:`
Insert cell
scale(68)/68
Insert cell
scale(35)/35
Insert cell
md` One thing I didn't quite understand is this:

`
Insert cell
109/66
Insert cell
md` Which is not exactly 1.6515151515151514
`
Insert cell
md`
Go to [Day 13](https://observablehq.com/@hongtaoh/day-thirteen-2020-09-06).
`
Insert cell

One platform to build and deploy the best data apps

Experiment and prototype by building visualizations in live JavaScript notebooks. Collaborate with your team and decide which concepts to build out.
Use Observable Framework to build data apps locally. Use data loaders to build in any language or library, including Python, SQL, and R.
Seamlessly deploy to Observable. Test before you ship, use automatic deploy-on-commit, and ensure your projects are always up-to-date.
Learn more