Easily listen to Learning Bayesian Statistics

Artwork for podcast Learning Bayesian Statistics

#135 Bayesian Calibration and Model Checking, with Teemu Säilynoja

Behavioral & Social Sciences Episode 135 • 25th June 2025 • Learning Bayesian Statistics • Alexandre Andorra

Speaker: 00:00:05

Today, I'm excited to have Teemu Sainiounoria on the show, a doctoral researcher and data

scientist at Aalto University in Finland.

: 00:00:14

Teemu works within the Bayesian workflow research group led by none other than Aki Betari,

where he focuses on model validation through calibration assessments and predictive

: 00:00:25

checking.

: 00:00:26

In our conversation,

: 00:00:27

Temu dives deep into his research on simulation-based calibration, that you may know under

the acronym SBC, and visual predictive checking.

: 00:00:38

He explains why these methods are essential tools for validating vision models, how

visualizations complement numerical metrics, and common pitfalls to avoid in interpreting

: 00:00:51

these visuals.

: 00:00:53

We'll also explore his recent work on posterior SBC, a novel approach designed to ensure

models are calibrated specifically for datasets at hand, particularly useful when data

: 00:01:05

collection is expensive or limited.

: 00:01:08

A word of caution, you will hear some construction noise on my end for this episode.

: 00:01:14

This is really Murphy's Law in action.

: 00:01:18

I unfortunately had no control over this and I'm sorry about it.

: 00:01:23

But I still kept the episode because, well, Teemu had really great stuff for us, so I hope

that you will still be able to enjoy it.

: 00:01:34

This is Learning Basics Statistics, episode 135, recorded May 9, 2025.

: 00:01:45

Welcome Bayesian Statistics, a podcast about Bayesian inference, the methods, the

projects, and the people who make it possible.

: 00:02:07

I'm your host, Alex Andorra.

: 00:02:09

You can follow me on Twitter at alex-underscore-andorra.

: 00:02:12

like the country.

: 00:02:13

For any info about the show, LearnBasedStats.com is Laplace to be.

: 00:02:17

Show notes, becoming a corporate sponsor, unlocking Beijing Merch, supporting the show on

Patreon, everything is in there.

: 00:02:24

That's LearnBasedStats.com.

: 00:02:26

If you're interested in one-on-one mentorship, online courses, or statistical consulting,

feel free to reach out and book a call at topmate.io slash Alex underscore and Dora.

: 00:02:37

See you around, folks.

: 00:02:39

and best patient wishes to you all.

: 00:02:41

And if today's discussion sparked ideas for your business, well, our team at Pimc Labs can

help bring them to life.

: 00:02:48

Check us out at pimc-labs.com.

: 00:02:54

Teemu Saillunoja, welcome to Learning Patient Statistics.

: 00:03:00

Thank you.

: 00:03:01

Glad to be here.

: 00:03:02

Thanks for inviting Yeah, I'm super happy to have you here.

: 00:03:05

That's great.

: 00:03:06

Well, one more people in the Alto University group is now on the show.

: 00:03:11

Yes.

: 00:03:13

You are really going to collect them all.

: 00:03:15

Yeah, exactly.

: 00:03:16

It's like Pokemons.

: 00:03:19

I guess I hope you guys evolve so that then I can invite you back on the show.

: 00:03:41

Welcome to Learning Basin Statistics podcasting.

: 00:03:47

Thank you.

: 00:03:49

I will edit that and make it like I was the one saying that.

: 00:03:56

So, Temu, I have lot of questions for you today because you do so many things.

: 00:04:00

uh But first, as usual, we're going to talk about your origin story.

: 00:04:07

So can you tell us what you're doing nowadays and how you ended up working on this?

: 00:04:13

Yeah, currently I'm...

: 00:04:16

Finishing up my doctoral thesis at Aalto University in Akivector's Bayesian workflow

research group.

: 00:04:24

uh And I've been focusing on calibration assessments and predictive checking in Bayesian

workflow.

: 00:04:33

So the group is kind of structured so that everyone is focusing on some aspect of the

great workflow.

: 00:04:40

And also earlier this week, I happily

: 00:04:45

joined PiMC Labs, so I'm very excited to see what that adventure is bringing to me.

: 00:04:54

Yeah, yeah, that's great addition to the team, obviously.

: 00:04:59

So obviously I'm not working there anymore, but I've seen you around.

: 00:05:04

So yeah, that's awesome.

: 00:05:06

I was happy to see that.

: 00:05:08

and I am mostly looking forward to see what magic you'll do first there.

: 00:05:16

And actually, how did you end up doing patient's dance?

: 00:05:21

Do you remember when and how you were first introduced to them?

: 00:05:26

Yeah.

: 00:05:28

My first introduction was during my master's studies.

: 00:05:32

I did my master's in...

: 00:05:35

mathematics at Helsinki University and there was this course for advanced inverse problems

and it just happened to be that year that it was held ah as Bayesian inversion I think was

: 00:05:53

the name of the course that year we had a visiting professor, W.

: 00:05:57

Hellin who works ah with spatial modeling and does

: 00:06:05

just a lot of Bayesian side for that.

: 00:06:09

yeah.

: 00:06:11

But it was a very theoretical course.

: 00:06:14

Didn't do almost any, well, we didn't do anything computational until the course was over.

: 00:06:21

And then the course had a project work, which where we then got to do some X-ray

tomography, actually.

: 00:06:30

Like we ourselves got to pick an object and went to a lab.

: 00:06:35

took the X-ray and got the raw data and then ah had to code up our own MCMC sampler in

MATLAB and uh come up with some, come up with the priors and get to know what was inside

: 00:06:51

the object.

: 00:06:54

So yeah, but then there was quite a break.

: 00:06:58

I continued my masters for a year and then I studied, went to industry, did some.

: 00:07:04

data science in an ad tech company and then some time later the opportunity rose for

doctoral studies ah in Aalto and with Akki Vestari.

: 00:07:19

So then that brought me back to base.

: 00:07:24

yeah, yeah.

: 00:07:28

And actually, what drew you specifically to what you're doing nowadays, which is mainly,

as you were saying, calibration assessments and visual predictive checkings for vision

: 00:07:41

workflows?

: 00:07:43

Yeah.

: 00:07:44

Well, of course, it has a lot to do with also your supervisor, but I guess you kind of

gravitate towards where your interests intersect with the expertise of your...

: 00:07:56

of your supervisor.

: 00:07:57

ah Personally, I'm quite a visual thinker.

: 00:08:02

I basically need to have a blackboard by me or a whiteboard or whatever.

: 00:08:07

Something easy to draw on and then low threshold to also just wipe everything off if I'm

not happy with what I wrote there.

: 00:08:17

So that's where the visual predictive checking comes from.

: 00:08:20

ah I really enjoy like a well thought graph.

: 00:08:25

and how visualizations can aid in communication.

: 00:08:31

But then calibration assessment, so simulation-based calibration, which I'm sure we will

touch on more deeply.

: 00:08:41

first it came, this Talz et al.

: 00:08:45

paper was when I was working on this graphical test for uniformity and goodness of it.

: 00:08:54

paper back in 2020.

: 00:08:59

The original SPC paper or the first paper to actually call it SPC was under work and there

you need a uniformity test and that's where the uniformity test that we were making kind

: 00:09:20

of met and found a use.

: 00:09:24

That was my first touch to calibration checking, so assessing the inference in base air

modeling.

: 00:09:34

Nice.

: 00:09:35

That makes sense.

: 00:09:39

I think indeed it's a good time now to try and dive into a bit more of what you're doing.

: 00:09:48

Can you give us an overview of simulation-based calibration?

: 00:09:53

SBC is and why it's so important in Bayesian modeling?

: 00:09:58

Yeah, yeah.

100

: 00:10:01

So simulation-based calibration, we like to call it simulation-based calibration checking

just to underline that we are not actually doing any calibration, of not calibrating the

101

: 00:10:11

model, but checking if the model would be calibrated or if the inference and the model

implementation together would be calibrated.

102

: 00:10:22

kind of you could in a way think of it as a check for your model implementation and

inference algorithm together that they are working as you would expect.

103

: 00:10:39

What I mean by model implementation especially now with PPLs, probabilistic programming

languages, essentially you don't need to write your own

104

: 00:10:50

samplers anymore but you define your model in something that's a bit more accessible uh

and then the sampling algorithm hopefully is already ah there for you made by some other

105

: 00:11:05

brilliant minds.

106

: 00:11:07

So in simulation-based calibration we essentially um check that the way you

107

: 00:11:20

written your model is actually like the goal is to check that you haven't done mistakes in

model implementation and also your inference is working and how we approach this is is

108

: 00:11:31

that we draw first ah realizations from prior you you come up with usually that's an easy

part you you somehow randomly draw from your prior parameter values and then you generate

109

: 00:11:46

prior predictive data from those

110

: 00:11:49

parameter values, which is also usually an easy step.

111

: 00:11:55

if we step a uh bit further away and look at what we have now that we have these parameter

values and some data, well, ah this would actually be, if you condition on the data, then

112

: 00:12:15

the distribution of these parameter values should just be your posterior.

113

: 00:12:19

So and what now we manage to essentially create a posterior sample without any model

fitting.

114

: 00:12:28

But then if we want to now compare if our inference is working properly, we actually run

the inference algorithm to this prior predictive sample and we again receive a posterior

115

: 00:12:42

from our MCMC samples or from our variational inference or whatever we use for having this

116

: 00:12:49

posterior approximation.

117

: 00:12:51

And now we can compare this to the original parameters that we drew from the prior.

118

: 00:12:58

And these should be from the same distribution conditional on the data.

119

: 00:13:04

We repeat this many times and essentially do some test for this if it holds that they are

from the same distribution and ah

120

: 00:13:19

What has been handy is just to rank the prior draw among this posterior sample that

corresponds to that particular prior and that particular draw and that particular prior

121

: 00:13:36

predictive sample.

122

: 00:13:38

And if our inference is working as we would expect and wanted to work, then this rank

should be uniformly distributed.

123

: 00:13:46

So that's where the uniformity test then comes.

124

: 00:13:48

Council saw in that I was mentioning earlier.

125

: 00:13:53

Yeah, and also...

126

: 00:13:58

Can you hear that?

127

: 00:14:00

I heard, yeah.

128

: 00:14:03

You have a very large bee in your office.

129

: 00:14:06

Yeah, exactly.

130

: 00:14:09

So thankfully that's not that.

131

: 00:14:11

But as Murphy's Law states...

132

: 00:14:13

m Anything that can go wrong will go wrong.

133

: 00:14:17

Well, it's uh exceptional because they are doing some remodeling of my building.

134

: 00:14:24

And today was like super calm.

135

: 00:14:27

Nothing, no noise at all.

136

: 00:14:30

And today, exactly as we started recording.

137

: 00:14:33

they started piercing the wall, like, here, on my balcony.

138

: 00:14:39

So, it's absolutely terrible, I apologize for that, but could not, absolutely could not

control that.

139

: 00:14:47

So what I will do though is I will make my questions as brief as possible, especially when

the noise is here.

140

: 00:14:56

So, thanks in advance for your patience, dude.

141

: 00:15:02

142

: 00:15:03

So uh thanks.

143

: 00:15:05

That's a great uh presentation and summary of what SBC is.

144

: 00:15:09

um To make that clear, maybe two things.

145

: 00:15:12

One, um that can be done on the prior specification of the model, but then also on the

posterior um to see that the model is actually well calibrated.

146

: 00:15:23

And I think you recommend doing both.

147

: 00:15:26

And then you can go into that very new paper that you have out with other authors.

148

: 00:15:31

149

: 00:15:33

which is exactly what posterior SPC and how to do that.

150

: 00:15:37

And second, how can people do that?

151

: 00:15:40

Like concretely, listeners who just listened to you and were like, yeah, that sounds good.

152

: 00:15:46

That sounds great.

153

: 00:15:47

I definitely want to try that in my models.

154

: 00:15:50

If I'm using Stan or PIMC or any other PPL, how can I do that?

155

: 00:15:55

Prior SPC or posterior SPC or both?

156

: 00:15:59

Both.

157

: 00:16:00

Both.

158

: 00:16:01

Okay, everything.

159

: 00:16:02

160

: 00:16:02

Well, for prior SPC, it's quite simple.

161

: 00:16:07

If you can code this prior generating, prior predictive generation yourself, then you can

just basically code it yourself.

162

: 00:16:17

But also for both PyMC and for Stan, there are already packages.

163

: 00:16:22

For Stan, there's a package called SPC um by Angie Moon and

164

: 00:16:32

and Shiyoung Kim and Mati Modrak and myself, which essentially ah gives you very good

framework for running SPC and also then assessing the uniformity.

165

: 00:16:46

also it gives you the framework for doing the actual computation, but also then assessing

the results.

166

: 00:16:52

And then for Python, there's a package called Simuk by Arvis developers.

167

: 00:17:00

which also does the same at least for PMC models and BAMBI and I think some other models

too but now I'm not super certain about that but that's also there ready you if you have a

168

: 00:17:18

PMC model you can just give it to the give it to the package and it will do ah run is busy

for you and give you some calibration assessment plots already.

169

: 00:17:29

um This might take a bit while because you need to refit your model multiple times.

170

: 00:17:34

ah In the R package for Stan, you can do parallel inference very easily.

171

: 00:17:46

So essentially SPC, you would be generating these prior predictive samples multiple times

to get an overall assessment for uniformity.

172

: 00:17:57

if you have a cluster or something at hand, can just

173

: 00:18:00

put them there, have these simulation iterations essentially run as parallel as you want.

174

: 00:18:09

For posterior SPC, which I didn't get to even explain, but essentially our new paper, um

this standard way of SPC that I was now describing has a view.

175

: 00:18:26

issues uh or drawbacks.

176

: 00:18:28

Essentially if you have very vague or weak priors and some non-linearities in your model,

you might have regions of the parameter space in the prior that where the model inference

177

: 00:18:40

is somehow not working.

178

: 00:18:43

Like you have very pathological parts of your prior space which you would maybe not even

see in your real data.

179

: 00:18:52

Maybe you would, maybe you wouldn't, but

180

: 00:18:55

Anyway, so prior SPC might give you essentially a false positive saying that, well, in a

way false positive if you then would go and fit your model to data that would not produce

181

: 00:19:09

a posterior in this problematic area.

182

: 00:19:13

So we have been developing posterior SPC, which in its simplest form, you just replace the

prior

183

: 00:19:23

from private specie with posterior and you create posterior predictive samples.

184

: 00:19:28

So you essentially focus in the area of the parameter space that is most of interest to

you with your data set.

185

: 00:19:37

uh And if you're worried about um using all of your data and then creating more data, ah

you can only use part of your data and do essentially

186

: 00:19:51

some kind of cross-validation, so using some random partitions of your data.

187

: 00:19:57

But that's been very short the idea of PostUoSPC.

188

: 00:20:03

For PostUoSPC, you need to make a little bit of extra coding around these packages.

189

: 00:20:10

I think Simuk is not yet supporting PostUoSPC, but I have had talks with Osvaldo Martin,

who is actually the...

190

: 00:20:20

developer for Simuc so that we could add post your SPC option for Simuc also.

191

: 00:20:29

And in the SPC package you can run post your SPC ah by defining your data generating

process so that you create the data not from prior predicted draws but from post your

192

: 00:20:46

predicted draws.

193

: 00:20:47

194

: 00:20:49

bit of hand coding but at least you are given quite clear pieces on where to do the

changes.

195

: 00:20:58

And in our paper I also have a github repository with examples on how to do this.

196

: 00:21:07

Yeah, and I put that so these two packages are in the show notes both for R and Python.

197

: 00:21:15

So if I understand correctly, posterior SBC is a bit more computationally intensive than

prior SBC.

198

: 00:21:23

Is that right?

199

: 00:21:25

Yes, but only by one model fit.

200

: 00:21:28

Yeah, because you need to have the posterior and then you draw posterior predictive

samples and then brief it to that.

201

: 00:21:35

And of course, if you have a very large data set and you have your posterior and then you

create posterior predictive samples and while the posterior updating usually happens so

202

: 00:21:46

that you then just give this essentially double sized data set and take that.

203

: 00:21:53

if you're

204

: 00:21:56

model fitting is scaling badly by the size of your dataset then it might be

computationally more expensive or then you need to not use your whole observation dataset.

205

: 00:22:08

But yes.

206

: 00:22:11

And so do you recommend always doing both prior and posterior SPC?

207

: 00:22:19

Not necessarily.

208

: 00:22:21

Prior SPC, if it's very hard to reason about your priors, like you have a very complex big

model and setting these priors is not necessarily.

209

: 00:22:32

um For example, you have a very large data set, so your data is anyway going to dominate

in your procedure and your prior is kind of...

210

: 00:22:49

not that impactful, then you could consider just running PostU SPC and then using

essentially focusing on your computational time just to the region that you are actually

211

: 00:23:02

interested in.

212

: 00:23:07

Okay.

213

: 00:23:08

Yeah.

214

: 00:23:08

Yeah.

215

: 00:23:09

see.

216

: 00:23:10

That's, that's useful.

217

: 00:23:11

And like these kind of practical, practical advice.

218

: 00:23:16

And I'm curious also what, are some of the challenges you faced um in developing posterior

SBC uh and even prior SBC and how did you overcome them?

219

: 00:23:31

Like when you came up with the methods?

220

: 00:23:34

Well,

221

: 00:23:36

I think one of these challenges is then, ah well, possibly the inference time.

222

: 00:23:43

Like, ah well, you are doing multiple model fits.

223

: 00:23:47

So ah sometimes, especially if you are trying to build a model and it's not necessarily

the final model.

224

: 00:23:59

So you're still also in the process of iterating with your models.

225

: 00:24:02

226

: 00:24:04

stopping to run SPC might not be ah the optimal ah workflow.

227

: 00:24:12

ah So there you might look at running just very small, short number of simulations just to

get like an idea that okay, at least everything is somewhat okay.

228

: 00:24:30

Like you would detect some gross

229

: 00:24:32

grows problems quite quickly.

230

: 00:24:36

But also what we noticed that when you're only fitting your model once, it doesn't

necessarily impact you so much if the very start of the sampling is slow.

231

: 00:24:52

But then when you're 500 or 1000 refits of your model, yeah.

232

: 00:24:59

So for example,

233

: 00:25:03

being smart with how you initialize your MCMC chains might actually come and affect you

quite a lot.

234

: 00:25:10

in the paper for one example, I use Pathfinder to get initial states for the chains and

then only start MCMC.

235

: 00:25:20

So it kind of, requirements for computation ah kind of gave me challenges that I wouldn't

have.

236

: 00:25:31

otherwise considered but they were actually challenges that you would kind of have anyway

if you're fitting your model.

237

: 00:25:40

Yeah, yeah.

238

: 00:25:41

For sure.

239

: 00:25:42

I don't think there is that many more challenges anyways because of simulation-based

calibration.

240

: 00:25:55

mean, the only challenge is you need to feed the model multiple times.

241

: 00:26:00

But that's the main bottleneck, right?

242

: 00:26:02

Because otherwise, if you have sampling problems, it's not because of SPC, it's because of

the model.

243

: 00:26:09

um So yeah, like I can guess that, yeah, if you need um some form of variational inference

to sample the model, then you'll need that also for SBC.

244

: 00:26:24

Yeah.

245

: 00:26:25

But here we were just in like kind of running a very short pathfinder chain to come up

with initial values for the MCMC.

246

: 00:26:37

So we were then anyway, in the end, fitting the model with.

247

: 00:26:43

And how, like, yeah, what's your experience doing that actually?

248

: 00:26:46

Because I know that's what Bob Carpenter intended when he developed um Pathfinder.

249

: 00:26:54

So basically to use Pathfinder as an initialization for Nuts.

250

: 00:26:57

um In your experience, that help a lot?

251

: 00:27:02

And in which circumstances?

252

: 00:27:06

Like this particular example was a ODE model where the, especially with the priors, which

kind of at first look they look sensible.

253

: 00:27:18

So when you're setting the priors, kind of have, you have reasons why you put the priors

you have there.

254

: 00:27:25

But what happens is that you quite often generate these um multimodal posteriors where

there's a very large

255

: 00:27:35

like in area very large posterior mode and then a very small very high peak in the

density.

256

: 00:27:46

That's actually the one that you would find more reasonable for like this was a ODE model

for a lockable terra situation so predator prey model and the very large

257

: 00:28:04

mode was essentially just a posterior mode for where most of the variation in your data

was just measurement error.

258

: 00:28:15

So not necessarily the mode that you would want to explore so much.

259

: 00:28:21

So Pathfinder was very good at getting the change to start from the mode that gives you

260

: 00:28:31

gives you a posterior that actually finds the seasonal trends and dynamics between the

species.

261

: 00:28:41

And also Akivehtari has another case study, this birthday case study, which is in the, if

you look at the cover of the base and data analysis book, the third edition, uh there's a

262

: 00:28:57

picture from that case study and there also

263

: 00:29:00

They have a big GP model where they do also Pathfinder to get good initial values for the

chains.

264

: 00:29:11

Nice, yeah, this is super cool.

265

: 00:29:13

Yeah, I can guess how um challenging the audio sampling must have been.

266

: 00:29:20

uh And is that actually related to why SBC is also interesting uh in amortized patient

inference?

267

: 00:29:31

um Can you maybe elaborate a bit on that and tell us even how that's important in these

settings?

268

: 00:29:41

Yeah, well, I'm not a big fan of The reason why we really like SPC there is slightly

different.

269

: 00:29:53

um We were lucky to have Marvin Smith, who your listeners might already know, um visiting

Aalto University as he actually recently defended his thesis.

270

: 00:30:10

um very successfully.

271

: 00:30:13

um He was co-supervised by Aki and Paul Berkner.

272

: 00:30:19

So he was visiting Aalto and he's of course an expert on amortized space and inference.

273

: 00:30:24

um yeah, in amortized space and inference essentially what SPC is good for there is that

you don't have for amortized space and inference these

274

: 00:30:38

these convergence checks, called standard convergence checks that we have for MCMC, R-HATS

and such.

275

: 00:30:47

how do you know the quality of your posterior for the data set that you then observe and

the...

276

: 00:30:55

When you've gone through all the trouble of training your posterior, for your neural

network to give you a...

277

: 00:31:07

posterior approximation.

278

: 00:31:08

Well then you can run posterior SPC when you have a new data set.

279

: 00:31:16

When you get an observation for an amortized Bayesian, what do you call it, amortized

Bayesian model, something that you have online already ready to run inference.

280

: 00:31:26

And in amortized Bayesian inference, of course, the inference part is almost instant.

281

: 00:31:31

And also making predictions after you have this in inference is also almost instant.

282

: 00:31:36

So then you just, well, you have maybe 500 times something that's almost instant.

283

: 00:31:42

Still, it doesn't take too long.

284

: 00:31:43

So it's very fast to run SPC or posterior SPC.

285

: 00:31:49

kind of we don't have the main drawback of SPC in amortized base and inference.

286

: 00:31:56

And we are also missing some standard good checks for the quality of our posterior.

287

: 00:32:03

So here.

288

: 00:32:05

Post-USPC is actually quite useful for checking if your model is calibrated with the new

data or is this perhaps out of distribution observation and we don't have any guarantees

289

: 00:32:18

of the quality of the post-USPC then.

290

: 00:32:22

Also, prior SBC works with amortized base and inference.

291

: 00:32:25

Before you see any data, you can check how your parameter recovery, for example, is

working.

292

: 00:32:33

Okay, yeah.

293

: 00:32:33

I mean, that makes sense.

294

: 00:32:34

Yeah.

295

: 00:32:35

been a lot.

296

: 00:32:37

No, no.

297

: 00:32:37

All what you're saying makes sense.

298

: 00:32:40

And actually, I think SBC definitely makes a ton of sense.

299

: 00:32:44

ah Sorry, but the noise again.

300

: 00:32:48

301

: 00:32:50

Yeah, SBC makes a ton of sense in the armatization framework for sure because you already

have the ability to just sample once you have trained the neural network you have the

302

: 00:33:01

ability to just sample from the posterior distribution in a matter of seconds.

303

: 00:33:07

It's just for free so you know why not do SBC?

304

: 00:33:10

Yeah, you've already paid the cost so you can just reap the benefits.

305

: 00:33:15

Right, yeah, exactly.

306

: 00:33:17

307

: 00:33:19

So actually, also something I wanted to ask you about, because you said you were doing

that a lot.

308

: 00:33:25

And I certainly am also a visual learner and visual thinkers, having a blackboard, a

notepad.

309

: 00:33:34

I always have a notepad with me, blackboard when I can.

310

: 00:33:38

it's really how I think also, when I need to understand some concepts, seeing the code,

seeing the formula is actually extremely helpful.

311

: 00:33:47

to me and so you do a lot of work that I definitely appreciate on visual predictive

checking.

312

: 00:33:53

And you've released recently a set of guidelines for visualizations, which I put in the

show notes.

313

: 00:34:01

I definitely encourage people uh to look into that because that's also my use that all the

time at my work.

314

: 00:34:08

So the work you do and also with Osvaldo, Osvaldo Martin works a lot on that with the new

version of RVs.

315

: 00:34:15

And he also has

316

: 00:34:17

uh He is this project online of an online book about exploratory analysis of patient

models, where he demonstrates all the cool thing you can do with Arviz once you have fit a

317

: 00:34:30

patient model.

318

: 00:34:31

But first, can you tell us why is visual checking so crucial in the patient monitoring

workflows?

319

: 00:34:39

Because there are certain needs, at least in the classical machine learning world, much

more of an emphasis on

320

: 00:34:47

statistical metrics much more than on the plants and

321

: 00:34:51

and even sometimes I've encountered maybe even a distressed of plots because they are

visual whereas metrics seem more objective because you they are numbers.

322

: 00:35:07

you are not the one making the interpretation you just have a number and some threshold.

323

: 00:35:12

Exactly whereas a plot you always like often I've encountered often people who can be

shocked by that

324

: 00:35:22

By the fact that in the Bayesian framework we use plots a lot and that seems very

subjective to them.

325

: 00:35:31

Well, like if you're doing kind of by the book the workflow, you should be having a prior

predictive plot even before you look at your data at all.

326

: 00:35:40

kind of what might be very odd for a more kind of frequentist machine learning side.

327

: 00:35:49

Well,

328

: 00:35:50

Visualizations, like my view on that, why they are so important, I like instead of just

having like some kind of numerical assessments is well a very good example is this

329

: 00:36:08

unscromed squarted where you have essentially very different datasets all giving the same

summary statistics.

330

: 00:36:15

Very classical example where visualizations gives you immediately.

331

: 00:36:20

something you were missing with numbers.

332

: 00:36:24

And then in kind of Bayesian modeling, visual predictive checking comes into two stages of

the workflow.

333

: 00:36:33

Like as I said, first with prior predictive checking.

334

: 00:36:36

So before looking at your data at all, you can do prior predictions and see what you are

actually predicting from the model and assessing your priors.

335

: 00:36:50

by through that.

336

: 00:36:52

And then once you have done fitting, then you can do course, post-re predictive checking,

comparing your post-re predictions to your data, which sometimes is frowned upon because

337

: 00:37:04

you're kind of double using the data ah in that you're not predicting out of sample.

338

: 00:37:11

But if that's very much a worry, you can do live on our predictions quite easily.

339

: 00:37:17

These days you have

340

: 00:37:19

SisLU to give you with one model fit ah good approximations of what you would be

predicting with this LIBONOUT uh predictive distribution.

341

: 00:37:32

And then also kind of on the same SisLU also gives you an estimate of how good this

approximation is.

342

: 00:37:42

that's quite a low hanging fruit for

343

: 00:37:46

for predictive checking if you are worried for predicting on an observation that you've

used in fitting your model.

344

: 00:37:53

I think I don't remember the original question anymore, but yeah.

345

: 00:37:59

were at least the thoughts that the question was raising in me.

346

: 00:38:04

Yeah, yeah, yeah.

347

: 00:38:05

No, that's great.

348

: 00:38:06

that's, I think you...

349

: 00:38:08

You answered my question, which was basically, know, can you explain why visual checks are

important?

350

: 00:38:13

And basically your point is, well, they are important because they don't substitute two

metrics, but they complement them.

351

: 00:38:22

so you might miss things that you, if you only look at metrics, you will miss things that

you won't miss with plots and vice versa.

352

: 00:38:32

Especially assessing your prior predictive distribution with just

353

: 00:38:38

metrics could be, well I've never tried that but that could be quite a challenging task

whereas visual predictive checking is quite...

354

: 00:38:51

Well, it's quite visual.

355

: 00:38:52

You see what you're doing.

356

: 00:38:54

Yeah, yeah, So I think it's less of a problem.

357

: 00:38:58

Like once you're doing Bayesian stance, you have to do prior checks.

358

: 00:39:03

But I think it's less of a problem for new people in the Bayesian world because they don't

have any anchor.

359

: 00:39:10

Because by definition, they didn't have priors before.

360

: 00:39:14

So it's not that they come and they are used to doing something with the prior samples.

361

: 00:39:20

Where the switching thinking might need to happen is when you have the Poiseuille samples,

because then you have a whole zoo in the classic machine learning world of metrics, uh

362

: 00:39:34

which are like, it's a very rich field of the literature.

363

: 00:39:40

And so here people are more anchored and you might not always have, you know, one-to-one

364

: 00:39:49

comparison of the matrix.

365

: 00:39:51

for instance, the calibration of a Bayesian model to me is much more easy.

366

: 00:39:59

much more intuitive to the calibration of very frequentist model because you don't have

the binning to do and so on.

367

: 00:40:06

And actually now we have the new calibration plot in Arviz, in the new version of Arviz

1.0.

368

: 00:40:12

In Arviz plots, there is this new calibration plot that I use all the time.

369

: 00:40:18

this is super useful.

370

: 00:40:20

And you can, since it's based on the Bayesian uh

371

: 00:40:25

ETIs or HDIs, well then you can actually interpret it as, well, 90 % of the data was in

the 90 % interval.

372

: 00:40:34

So this is well calibrated.

373

: 00:40:37

So yeah, in a frequentist model, if I remember correctly, that's not really the

interpretation you can make of it.

374

: 00:40:43

So there is some stuff like that where you need to be careful how you explain it to

stakeholders.

375

: 00:40:51

Yeah.

376

: 00:40:51

And in those cases, maybe that's also kind of the

377

: 00:40:54

You said the mistrust to visualizations.

378

: 00:40:57

If that's the case that your visualization is showing a 95 % interval and then it's not

actually what it's supposed to show or the interpretation is not the first one that you

379

: 00:41:11

would think of.

380

: 00:41:13

That kind of gives you, might make you mistrust visualizations a bit more.

381

: 00:41:19

That's true.

382

: 00:41:19

Yeah, that's a point.

383

: 00:41:21

Yeah, I like that.

384

: 00:41:22

um And actually, could you summarize your main recommendations on choosing visualization

types based on data characteristics and that?

385

: 00:41:32

Like that question is directly basically making you summarize the blog post of yours that

I've...

386

: 00:41:38

Well, it's not even a blog post.

387

: 00:41:39

I think it's a paper you've submitted uh that's still in review, uh to the best of my

knowledge.

388

: 00:41:47

So that's in the show notes.

389

: 00:41:48

390

: 00:41:49

But yeah, free to um tell us the summary of that.

391

: 00:41:54

You can also share your screen and share the paper if you want for people watching on

YouTube.

392

: 00:41:59

But yeah, basically, you give us the rundown for that?

393

: 00:42:04

Yeah, yeah.

394

: 00:42:06

I don't have the paper right now, so I'm trying my best.

395

: 00:42:09

I think the summary in short is that we look at...

396

: 00:42:19

and that what people usually use for visualizations and where we find the most common kind

of pitfalls or chances for issues.

397

: 00:42:29

And as you said, the recommendations are kind of based on what your data characteristics

would be.

398

: 00:42:34

ah If you're looking at continuous data, very often you would use kernel density, just

density plots, or maybe a histogram, which might be very fine.

399

: 00:42:48

400

: 00:42:49

But if your data is bounded, usually the default density plot implementations might give

you a little bit of a...

401

: 00:43:02

They don't do very well with bounds, like strict bounds in the most common implementation.

402

: 00:43:08

There are packages like in R-site, there's ggdist that automatically tries to detect if

your data is bounded and adjust your KDE to actually do this.

403

: 00:43:18

uh boundary correction.

404

: 00:43:21

But if you don't make a boundary correction and you are unlucky enough to use an

implementation that doesn't do that, your visualization is...

405

: 00:43:32

Well basically what you see is not what you get or what you have.

406

: 00:43:36

So your visualization is misleading you a bit.

407

: 00:43:40

The model you're fitting...

408

: 00:43:42

Or the model...

409

: 00:43:43

If you think of your visualization of a model of the data to summarize what you're seeing.

410

: 00:43:47

what you're having.

411

: 00:43:49

That model is biased or miscalibrated in some aspects.

412

: 00:43:54

And then, so kind of, my most important recommendation was just be to think of your data a

bit and perhaps use ah two different, for example, two different visualization methods and

413

: 00:44:10

see if the conclusion you draw would be different.

414

: 00:44:14

Because, yeah.

415

: 00:44:16

uh thinking a bit more.

416

: 00:44:18

Then if you go to discrete data, discrete is a bit challenging.

417

: 00:44:23

You have rootograms which would be a good visualization for count data, especially if you

have a large range of counts.

418

: 00:44:36

If you have a very large range of counts, you could almost very often you can just use a

continuous visualization to give you a summary of the data.

419

: 00:44:46

But then if you have discrete data with small number of individual states, then most

visualization packages, especially for predictive checking, em because if you're just

420

: 00:45:05

looking at the data, then a bar graph is usually what you would use for just kind of a

summary of the discrete data if you're using just the 1D.

421

: 00:45:15

visualization.

422

: 00:45:17

But then once you're doing predictive checking the bar graph is not anymore ah very

useful.

423

: 00:45:24

The only information you gain is essentially that is your model doing as well or worse

than an intercept only model.

424

: 00:45:35

which we saw in the paper ah also as an example.

425

: 00:45:41

Yeah, this is a very good and practical paper.

426

: 00:45:45

uh Really the kind of paper I really love.

427

: 00:45:48

So thank you for doing that.

428

: 00:45:49

um That's super helpful.

429

: 00:45:52

And I definitely encourage people to take a look at it because...

430

: 00:45:57

That's how to make justice here on the podcast.

431

: 00:46:02

Yeah, it does actually look like a blog post, like you said, a bit.

432

: 00:46:07

Because it's done in Quarto, it's an HTML page.

433

: 00:46:12

It's for this journal of visualization and interaction, which has a totally open review

process.

434

: 00:46:19

The review of the paper is a GitHub review.

435

: 00:46:23

Everything is happening in GitHub through issues.

436

: 00:46:26

So we thought that this is an excellent thing to pursue this open review where and this

kind of more ah like not to be chained to PDFs essentially because especially for

437

: 00:46:50

visualizations and if you would have any interactions ah these days

438

: 00:46:56

It's quite rare to anyway print your paper or actually read a journal, like a paper

journal.

439

: 00:47:02

So why not use something a bit more feature rich.

440

: 00:47:13

Really, people don't read journals anymore.

441

: 00:47:15

That's weird.

442

: 00:47:20

but and actually something I recommend people to do is like printing because the paper is

organized um around different types of data.

443

: 00:47:31

like any like every types of data are a new section.

444

: 00:47:35

So something I recommend people to do that I'm going to do at work actually, because it's

the kind of paper you want to have your favorite tab.

445

: 00:47:42

So, but even better, you can like print in A3 each section, you know, so that you have the

example of the plots.

446

: 00:47:51

And so that way you can have that on the walls of your office.

447

: 00:47:57

then like, each time you work with, you know, order data, boom, have the poster right here

and you can use that.

448

: 00:48:03

Or normal data, boom, you have it here.

449

: 00:48:06

So, and I think this is like, at least for me who

450

: 00:48:11

really learn like that, that will be super helpful.

451

: 00:48:13

So I recommend people to do that.

452

: 00:48:15

I'm definitely going to do that.

453

: 00:48:18

So anyway, it's going to end up in a physical form.

454

: 00:48:24

Yeah, exactly.

455

: 00:48:25

Yeah.

456

: 00:48:26

Yeah.

457

: 00:48:26

And so like for your future papers, think about that.

458

: 00:48:29

They will like, how will people consume the paper?

459

: 00:48:33

Yeah, I'll try to have a poster format also.

460

: 00:48:36

Yeah, exactly.

461

: 00:48:39

Actually, I think I remember you mentioning, em not here, but I think I've seen you maybe

write in this paper that visualizations are kind of like models themselves.

462

: 00:48:53

uh Can you explain this perspective and how does it help improve how you think about

predictive checking?

463

: 00:49:03

Yeah, yeah.

464

: 00:49:05

Well,

465

: 00:49:06

This is most clear when you're looking at the density plot because I would be very

surprised if it wouldn't be the case, but when you're making a density plot, you're

466

: 00:49:19

essentially fitting a KD kernel density estimate with Gaussian kernels uh running some

heuristic of deciding the bandwidth for the kernel and then plotting the density of that

467

: 00:49:33

density estimate.

468

: 00:49:33

So you are actually

469

: 00:49:35

literally fitting a model to your data.

470

: 00:49:37

ah But also if you think of a histogram, you could think of this as a step function to

approximate your density.

471

: 00:49:48

Also, your data density is just a step function.

472

: 00:49:50

ah So in that sense, once you think of visualizations as models fit on your data, then you

have goodness of fit tests and you can actually assess that is this visualization

473

: 00:50:06

representing my data trust like truthfully or is there some bias or something that's

missing.

474

: 00:50:16

For example in the case of boundedness or maybe you have a data set that's otherwise

continuous but you have some point masses or something like this and then KDE would not do

475

: 00:50:30

well with steps or point masses in your data set.

476

: 00:50:37

So then this is more, we give a recommendation in the paper that's more for kind of people

developing the packages for visualizations because you have quite lightweight checks for

477

: 00:50:50

goodness of fit.

478

: 00:50:51

So you could have when implementing a visualization, you could just have under the hood a

goodness of fit test and give the user a warning if there's something very bad.

479

: 00:51:04

Let them know that, hey,

480

: 00:51:06

We saw that you're trying to visualize something that it might be the case that your data

is actually bounded or discrete and you're using a continuous visualization.

481

: 00:51:15

So take this into account.

482

: 00:51:17

of proceed with caution.

483

: 00:51:20

Right.

484

: 00:51:20

Yeah.

485

: 00:51:21

Yeah.

486

: 00:51:21

I saw as well though, did that for instance, in the new RVs when you do some plots and

these are binary data, for instance.

487

: 00:51:32

I don't remember exactly which plot, but then it will give you a warning if it sees its

binary outcome data.

488

: 00:51:39

It will output a warning and tell you, maybe you want to...

489

: 00:51:44

We see you have binary outcomes.

490

: 00:51:47

Maybe you want to use plot-pattern calibration plot instead of that one.

491

: 00:51:53

I think it's actually the calibration plot.

492

: 00:51:55

I don't remember, but this warning is Most likely it is.

493

: 00:51:59

Yeah.

494

: 00:52:00

We were lucky to have Osvaldo join the BASEM workflow group in January.

495

: 00:52:08

So we've had discussions on Arvis and Baseplot, which I'm then developing also.

496

: 00:52:14

I've been contributing to Baseplot quite a lot recently.

497

: 00:52:16

ah yeah, these are kind of being developed in uh a uh kind of somehow parallel, but also

having discussion with each other.

498

: 00:52:31

Yeah, and what we've seen as kind of the most common mistake, so in Aalto we have this

annual Bayesian data analysis course for I think roughly three and a half hundred, like

499

: 00:52:48

350 students every year start the course.

500

: 00:52:51

So it's quite a large course and then we have a project work for the course where the

students go through Bayesian workflow.

501

: 00:53:00

and give a presentation of their data analysis essentially.

502

: 00:53:03

And then ah the default post-seo predictive check in currently that you get, which is

partly my fault because I need to change it in baseplot, but what you get for example BRMS

503

: 00:53:23

is a KDE plot where you have overlaid your data as a KDE and then your

504

: 00:53:29

post your prediction samples, a couple of those KDE's.

505

: 00:53:33

And then a lot of these projects have binary response variables.

506

: 00:53:38

So you just have, you would have just zeros and ones and you have a KDE for that.

507

: 00:53:42

So you are not getting, you're getting a very odd choice of visualization.

508

: 00:53:49

And on top of that, you're not getting any additional information aside from just, are you

doing better than an interceptor on the model?

509

: 00:53:58

And are you doing worse than an interceptor numeral actually?

510

: 00:54:04

Yeah, so in that case, we also plan to have for base plot a warning that, now this might

not be what you want to do.

511

: 00:54:16

Yeah, I think that's cool.

512

: 00:54:19

And I think in the future, a warning to try and improve would be the one on the Pareto K

shape issue in the compareLueCv function.

513

: 00:54:29

Because it's very.

514

: 00:54:32

It's very technical, I think when users see that, they don't really know what to do as the

alternative.

515

: 00:54:41

It's like, but I don't even know what that part of K shape means.

516

: 00:54:46

Yeah, that is...

517

: 00:54:48

Yeah, and this is one thing that we also want to do is have warnings that then...

518

: 00:54:57

The warning itself would be quite short, but it would have a link, but hey.

519

: 00:55:02

For more information, here's a vignette.

520

: 00:55:06

Go look at the documentation page, and this is where we explain it.

521

: 00:55:10

Give an example that this is what's happening.

522

: 00:55:13

Yeah, Fisher.

523

: 00:55:16

That would be helpful.

524

: 00:55:22

In general, I'm curious, how do you approach uncertainty visualization in your projects?

525

: 00:55:29

And why do you think, and do you think it's overlooked by practitioners or not?

526

: 00:55:34

uh What is the state of visualization around uncertainty so far in your eyes?

527

: 00:55:42

ah That's a very good question.

528

: 00:55:50

It's of course very central for Bayesian modeling, but it's also for, especially if you

have a user that's coming from

529

: 00:55:59

from, like, is not trained in Bayesian statistics or doesn't have a lot of experience with

probabilistic modeling.

530

: 00:56:08

So then this might be something that kind of comes as a surprise or as something that's

hard to interpret.

531

: 00:56:17

that's when thinking of visualizations, it's...

532

: 00:56:24

in general when thinking of visualizations, it's very important to think of your target

audience.

533

: 00:56:28

Like for example in the paper we don't talk of ECDFs as visualizations for your data, uh

but these are quite commonly used in some fields and then for that audience that would be

534

: 00:56:45

a good visualization.

535

: 00:56:50

So in that sense uh

536

: 00:56:54

thinking of visualizations in general is very much thinking of your audience and what you

want to convey and then uncertainty visualizations.

537

: 00:57:05

Yeah, there are some kind of basic or not very basic actually, some mistakes that you do

very easily.

538

: 00:57:15

Like for example, you have a predictive model and you show them

539

: 00:57:22

for continuous predictions, you show the posterior mean and then some central interval

around it.

540

: 00:57:28

And then there was this very recent example of a hurricane in the States and you do that.

541

: 00:57:39

And it looks like this massive cone that's going to go over the land.

542

: 00:57:44

The visualization is not conveying that actually what the model is trying to say is that

we have multiple possible

543

: 00:57:52

paths of these predictions and it's going to be one of these.

544

: 00:57:56

So you should be using instead of this kind of natural default you should be possibly just

giving a collection of lines, individual lines and showing that one of these it could be

545

: 00:58:11

any of this.

546

: 00:58:12

um So it's not easy the topic of uncertainty visualization.

547

: 00:58:22

I think Matthew Kay, Jessica Halman, they do excellent work on this topic.

548

: 00:58:26

uh So, yeah, and I believe at least Jessica Halman, think you've had as a guest also.

549

: 00:58:35

Yes, yeah, I did.

550

: 00:58:37

And I will put her episode in the show notes.

551

: 00:58:40

552

: 00:58:43

May have had day and met you I think at least we were in contact I have to check if I

already had him on the show.

553

: 00:58:51

Yeah, he's a very busy person Hey, yeah, we were at least in contact.

554

: 00:58:56

That's what I can tell you about and um Actually, what do you so I'll start playing this

out here because I know it's late for you, but I want to pick your brain on

555

: 00:59:09

the trends that you see shaping the near future of probabilistic modeling and also where

would you like your research to go next?

556

: 00:59:19

Yeah, if only I could see the future.

557

: 00:59:22

think amortized inference is definitely going to become popular, even more popular ah than

it is now.

558

: 00:59:30

Now, for example, for base in experimental design, that's a...

559

: 00:59:36

you can essentially adjust your experiment on the fly when you're getting data, which

would not be possible with MCMC.

560

: 00:59:48

So there what we have done with SPC and hopefully other people also come up with good

diagnostics for validating then these um

561

: 01:00:05

posterior affirmation in amortized patient inference.

562

: 01:00:12

What other cases do you see for amortized patient inference?

563

: 01:00:17

like here, that would be like online learning and change of the design analysis, if I

understood correctly.

564

: 01:00:26

Do you foresee any other cases where it will be particularly helpful?

565

: 01:00:32

Well...

566

: 01:00:33

It's not my expertise, but I would say that, well, essentially, when you need to have a

very fast posterior inference, this, well, which essentially is this online learning or

567

: 01:00:48

something like quick decision making, perhaps some autonomous, I don't know, like robots,

like something where you need to be very fast.

568

: 01:01:00

And it's essentially

569

: 01:01:03

um worth the...

570

: 01:01:07

it's better to pay the cost of computation in advance and then...

571

: 01:01:12

um For me, I would...

572

: 01:01:19

in my mind that sounds like a very very good feature to have in your pocket as a modeler.

573

: 01:01:27

But if needed, if this is my use case, I would have at least the

574

: 01:01:34

some knowledge or some basic ability to also do this, like have a bit amortized model in

my back pocket for that need.

575

: 01:01:47

Yeah.

576

: 01:01:48

Yeah.

577

: 01:01:48

So cases where it's worth paying the cost of inference upfront.

578

: 01:01:56

And then once you have trained the neural network,

579

: 01:01:59

you will get uh posterior uh samples for free.

580

: 01:02:06

You still have to pay the cost of training the neural network, which can be more costly

than training M-SimC for a lot of models.

581

: 01:02:17

Here, have to see if that works in your case, folks, because sometimes it will be even

longer to use our anticipation inference than M-SimC.

582

: 01:02:28

You have to feed the requirements basically.

583

: 01:02:33

And on the MCMC side also like, what like with NathPy for example now, advances on the

MCMC side are also not to be discounted.

584

: 01:02:47

And also like there's this very recent, there's a lot of interesting stuff happening with

running a lot of very short chains in parallel.

585

: 01:02:57

which also sounds quite promising and interesting.

586

: 01:03:05

Like, usually the default is running for change, but what if you run 400?

587

: 01:03:16

Yeah, for sure.

588

: 01:03:17

That's interesting.

589

: 01:03:20

And actually now you can do...

590

: 01:03:23

591

: 01:03:24

adaptation of the chains with normalizing flows in NutPy, and then NutPy will use that as

the initialization for MCMC.

592

: 01:03:33

So these can be somewhat similar, like, you know, in the idea to amortized inference.

593

: 01:03:41

It's not exactly the same, but yeah, like basically you would need to train normalizing

flows and then use that for MCMC.

594

: 01:03:48

NutPy does that out of the box for you.

595

: 01:03:52

and earn other food.

596

: 01:03:54

For both Stan and Pintsy, again!

597

: 01:03:58

it needs to be a model that has a particularly complicated posterior geometry because you

need to try a neural network first.

598

: 01:04:06

that's not a complicated enough model for MCMC.

599

: 01:04:10

It will still be much faster to run MCMC than run the neural network and then MCMC.

600

: 01:04:18

it's not going to always be useful, but for some cases, it's going to be a game changer.

601

: 01:04:26

Yeah, knowing which tool to use for which case.

602

: 01:04:31

Now essentially the toolbox is getting more and more tools in it.

603

: 01:04:36

Yeah, that's true.

604

: 01:04:41

605

: 01:04:42

Yeah, and actually I already had Matthew Kay on the show, can confirm.

606

: 01:04:47

There's this episode 66, that's in the show notes, folks, so if you wanna listen to that.

607

: 01:04:52

And Jessica Hullman was also on the show, episode 73, and that's also in the show notes.

608

: 01:05:00

yeah, show notes are big for these episodes, so that's awesome, I'm happy about that.

609

: 01:05:04

That's a good sign.

610

: 01:05:05

611

: 01:05:07

Awesome.

612

: 01:05:08

um Temu, anything to add before I ask you the last two questions?

613

: 01:05:14

The traditional two questions?

614

: 01:05:17

those ones.

615

: 01:05:18

Yeah.

616

: 01:05:21

No, think you've had very, very good questions.

617

: 01:05:26

Thank you for those.

618

: 01:05:27

619

: 01:05:29

Thank you.

620

: 01:05:31

I tried, but you know, I've had five years of training, so that's a good sign that I made

progress on that front.

621

: 01:05:39

See you, podcast host.

622

: 01:05:42

Exactly.

623

: 01:05:45

Awesome.

624

: 01:05:45

Well, Teemu, that was great.

625

: 01:05:47

But of course, before letting you go, I'm going to ask you the last two questions I ask

every guest at the end of the show.

626

: 01:05:54

So first one, if you had unlimited time and resources, which problem would you try to

solve?

627

: 01:06:00

Yeah, I've been because I've been listening to the podcast for quite a while now.

628

: 01:06:06

And I've been thinking that if I would be put on this spot.

629

: 01:06:11

ah I would probably come up with a very boring answer of word piece or something.

630

: 01:06:17

But then now, actually today, ah earlier today, I was walking my dog and this came to my

mind that it's not necessarily such a boring answer to say that I would like to, we have a

631

: 01:06:33

lot of problems at the moment in the world, but I would like to solve communication

between people.

632

: 01:06:39

And I think this is a very m

633

: 01:06:40

in a way also a very Bayesian problem because you have the receiver has a latent model of

the world, their understanding and you as a communicator you should be understanding you

634

: 01:06:52

should be kind of able to assess what's that model and then fit your communication to that

also so perhaps yeah communication misunderstandings understanding other people

635

: 01:07:10

um I wouldn't shoot anything lower than that with unlimited resources.

636

: 01:07:17

For sure.

637

: 01:07:18

Yeah, No, I love that.

638

: 01:07:20

Love that.

639

: 01:07:20

Yeah.

640

: 01:07:21

And for sure it's related to priors and...

641

: 01:07:24

how to elicitate priors from people, yeah all of that related to basically...

642

: 01:07:29

uh update the beliefs of recipient.

643

: 01:07:35

Yeah, related to the Socratic methods in a way for sure.

644

: 01:07:41

Street epistemology and all that good stuff.

645

: 01:07:43

646

: 01:07:45

Love that, love that Tim.

647

: 01:07:47

uh And second question, if you could have dinner with any great scientific mind, dead,

alive or fictional, who would it be?

648

: 01:07:54

ah It would be dead.

649

: 01:07:57

ah So, I have a background in mathematics and especially in the 19th century and before

that, mathematicians had this bad habit of dying very young.

650

: 01:08:11

So...

651

: 01:08:12

So I would pick, I went with this in mind and I found Gotthold Eisenstein who was a

mathematician in the 19th century, a German mathematician who worked on analysis and

652

: 01:08:27

number theory and actually kind of what he managed to do before meeting his untimely death

at the age of 29 was he solved issues that then allowed Gauss.

653

: 01:08:41

to further his research.

654

: 01:08:46

I would, and also was a very interesting person, spent a time in prison for some political

opinions and things like this in Germany.

655

: 01:08:57

then having a dinner and trying to maybe obtain some knowledge from a person who would

probably have had a lot more to give to the world.

656

: 01:09:11

I love that, yeah, yeah.

657

: 01:09:13

Great.

658

: 01:09:14

I love that.

659

: 01:09:15

And you're the first one to answer that.

660

: 01:09:17

So congrats, Tim.

661

: 01:09:20

Thank you.

662

: 01:09:23

Yeah, I also thought that it can't be some very obvious scientific mind.

663

: 01:09:29

Must be some slightly niche.

664

: 01:09:32

It can be, can be, that's fine.

665

: 01:09:34

is judging you.

666

: 01:09:37

Awesome, well, thank you so much Teemu.

667

: 01:09:39

I'm gonna let you go to sleep because it's late for you in Finland.

668

: 01:09:43

You've been kind enough to stay up for me to accommodate my American schedule.

669

: 01:09:51

thank you so much.

670

: 01:09:52

problem, it was a pleasure.

671

: 01:09:54

Sorry again.

672

: 01:09:56

to you and all of you folks for the construction noises that you must have heard from time

to time.

673

: 01:10:02

They seem to have stopped now, but yeah, like you know.

674

: 01:10:06

As uh Epictetus said, this is

675

: 01:10:10

a thing in life I cannot control.

676

: 01:10:13

I tried to keep my calm.

677

: 01:10:16

That was not easy but I kept my calm through the construction noises so I'm happy with

that.

678

: 01:10:22

um And I hope you still could enjoy the episode.

679

: 01:10:26

Thankfully Temu...

680

: 01:10:27

681

: 01:10:28

was the one with many more things to say than myself so not too much construction noises

at your head.

682

: 01:10:36

uh As usual, I'll put resources and links to your website and socials in the show notes.

683

: 01:10:45

Teemu, feel free to add anything in there also if you think I missed some.

684

: 01:10:51

And thanks again for taking the time and being on this show.

685

: 01:10:56

Thank you.

686

: 01:11:01

This has been another episode of Learning Bayesian Statistics.

687

: 01:11:04

Be sure to rate, review, and follow the show on your favorite podcatcher, and visit

learnbaystats.com for more resources about today's topics, as well as access to more

688

: 01:11:15

episodes to help you reach true Bayesian state of mind.

689

: 01:11:19

That's learnbaystats.com.

690

: 01:11:21

Our theme music is Good Bayesian by Baba Brinkman, fit MC Lass and Meghiraam.

691

: 01:11:26

Check out his awesome work at bababrinkman.com.

692

: 01:11:29

I'm your host.

693

: 01:11:30

Alex and Dora.

694

: 01:11:31

can follow me on Twitter at Alex underscore and Dora like the country.

695

: 01:11:35

You can support the show and unlock exclusive benefits by visiting Patreon.com slash

LearnBasedDance.

696

: 01:11:43

Thank you so much for listening and for your support.

697

: 01:11:45

You're truly a good Bayesian.

698

: 01:11:47

Change your predictions after taking it from

699

: 01:11:50

And if you're thinking I'll be less than amazing Let's adjust those expectations Let me

show you how to be a good base here Change calculations after taking fresh data in Those

700

: 01:12:04

predictions that your brain is making Let's get them on a solid foundation

More Episodes

135. #135 Bayesian Calibration and Model Checking, with Teemu Säilynoja