I require more time to go deeper but presently I have no time. Therefore let me take some annotations here for making easier future efforts.
To go to the details, one can go to the more technical book by Pearl himself, Causality. However, it happened I went to browse some chapters of the very good Shalizi's book on statistics. Chapters from the 20th are a reasonable starting point. Shalizi book itself is not fully explicative but a compromise where some theorems are not deminstrated but assumed and make explicit. Nice enough but requiring in any case the appropriate dedication. Shalizi seems to be a voracious reader, and in the bibliography of his chapter 21, he cites some fundamentals work to put in line for a full understanding the topic. His subsequent chapters also enlarge the vision to information theory, and is connections between the science of causal statistics. Cool. While postponing the study and trying to grasp concepts, I fully report the bibliography I came across here below (mostly from Shalizi's).
All of these are also a good reading for those who believe that data science is a practice which springs out from nowhere.
- Chalak, K., & White, H. (2012). Causality, Conditional Independence, an Graphical Separation in Settable Systems. Neural Computation, 1–60.
- Cover, Thomas M. and Joy A. Thomas (2006). Elements of Information Theory. New York: John Wiley, 2nd edn.
- Dinno, A. (2017). An Introduction to the Loop Analysis of Qualitatively Specified Complex Causal Systems (pp. 1–23).
- Guttorp, Peter (1995). Stochastic Modeling of Scientific Data. London: Chapman and Hall.
- Jordan, Michael I. (ed.) (1998). Learning in Graphical Models, Dordrecht. Kluwer Academic.
- Kindermann, Ross and J. Laurie Snell (1980). Markov Random Fields and their Ap- plications. Providence, Rhode Island: American Mathematical Society. URL http://www.ams.org/online_bks/conm1/.
- Lauritzen, S.L., Dawid, A.P., Larsen, B.N., Leimer, H.G. (1990), Independence properties of directed Markov fields, Networks, 20, 491-505
- Lauritzen, S.L. (1996) Graphical Models. New York: Oxford University Press.
- Loehlin, John C. (1992). Latent Variable Models: An Introduction to Factor, Path, and Structural Analysis. Hillsdale, New Jersey: Lawrence Erlbaum Associates, 2nd edn.
- Moran, P. A. P. (1961). “Path Coefficients Reconsidered.” Australian Journal of Statis- tics, 3: 87–93. doi:10.1111/j.1467-842X.1961.tb00314.x.
- Pearl, J. (2000). Causality- Models, Reasoning, and Inference (pp. 1–386). Cambridge University Press.
- Wright,S., The Method of Path Coefficients. Annals of Mathematical Statistics 5:161-215.
- Wysocki, W. (1992). “Mathematical Foundations of Multivariate Path Analysis.” In- ventiones Mathematicae, 21: 387–397. URL https://eudml.org/doc/263277.