#112 Advanced Bayesian Regression, with Tomi Capretto

Proudly sponsored by PyMC Labs, the Bayesian Consultancy. Book a call, or get in touch!

Our theme music is « Good Bayesian », by Baba Brinkman (feat MC Lars and Mega Ran). Check out his awesome work!

Visit our Patreon page to unlock exclusive Bayesian swag ;)

Takeaways:

Teaching Bayesian Concepts Using M&Ms: Tomi Capretto uses an engaging classroom exercise involving M&Ms to teach Bayesian statistics, making abstract concepts tangible and intuitive for students.
Practical Applications of Bayesian Methods: Discussion on the real-world application of Bayesian methods in projects at PyMC Labs and in university settings, emphasizing the practical impact and accessibility of Bayesian statistics.
Contributions to Open-Source Software: Tomi’s involvement in developing Bambi and other open-source tools demonstrates the importance of community contributions to advancing statistical software.
Challenges in Statistical Education: Tomi talks about the challenges and rewards of teaching complex statistical concepts to students who are accustomed to frequentist approaches, highlighting the shift to thinking probabilistically in Bayesian frameworks.
Future of Bayesian Tools: The discussion also touches on the future enhancements for Bambi and PyMC, aiming to make these tools more robust and user-friendly for a wider audience, including those who are not professional statisticians.

Chapters:

05:36 Tomi's Work and Teaching

10:28 Teaching Complex Statistical Concepts with Practical Exercises

23:17 Making Bayesian Modeling Accessible in Python

38:46 Advanced Regression with Bambi

41:14 The Power of Linear Regression

42:45 Exploring Advanced Regression Techniques

44:11 Regression Models and Dot Products

45:37 Advanced Concepts in Regression

46:36 Diagnosing and Handling Overdispersion

47:35 Parameter Identifiability and Overparameterization

50:29 Visualizations and Course Highlights

51:30 Exploring Niche and Advanced Concepts

56:56 The Power of Zero-Sum Normal

59:59 The Value of Exercises and Community

01:01:56 Optimizing Computation with Sparse Matrices

01:13:37 Avoiding MCMC and Exploring Alternatives

01:18:27 Making Connections Between Different Models

Thank you to my Patrons for making this episode possible!

Yusuke Saito, Avi Bryant, Ero Carrera, Giuliano Cruz, Tim Gasser, James Wade, Tradd Salvo, William Benton, James Ahloy, Robin Taylor,, Chad Scherrer, Zwelithini Tunyiswa, Bertrand Wilden, James Thompson, Stephen Oates, Gian Luca Di Tanna, Jack Wells, Matthew Maldonado, Ian Costley, Ally Salim, Larry Gill, Ian Moran, Paul Oreto, Colin Caprani, Colin Carroll, Nathaniel Burbank, Michael Osthege, Rémi Louf, Clive Edelsten, Henri Wallen, Hugo Botha, Vinh Nguyen, Marcin Elantkowski, Adam C. Smith, Will Kurt, Andrew Moskowitz, Hector Munoz, Marco Gorelli, Simon Kessell, Bradley Rode, Patrick Kelley, Rick Anderson, Casper de Bruin, Philippe Labonde, Michael Hankin, Cameron Smith, Tomáš Frýda, Ryan Wesslen, Andreas Netti, Riley King, Yoshiyuki Hamajima, Sven De Maeyer, Michael DeCrescenzo, Fergal M, Mason Yahr, Naoya Kanai, Steven Rowland, Aubrey Clayton, Jeannine Sue, Omri Har Shemesh, Scott Anthony Robson, Robert Yolken, Or Duek, Pavel Dusek, Paul Cox, Andreas Kröpelin, Raphaël R, Nicolas Rode, Gabriel Stechschulte, Arkady, Kurt TeKolste, Gergely Juhasz, Marcus Nölke, Maggi Mackintosh, Grant Pezzolesi, Avram Aelony, Joshua Meehl, Javier Sabio, Kristian Higgins, Alex Jones, Gregorio Aguilar, Matt Rosinski, Bart Trudeau, Luis Fonseca, Dante Gates, Matt Niccolls, Maksim Kuznecov, Michael Thomas, Luke Gorrie, Cory Kiser, Julio, Edvin Saveljev, Frederick Ayala, Jeffrey Powell, Gal Kampel, Adan Romero, Will Geary, Blake Walters, Jonathan Morgan and Francesco Madrisotti.

Links from the show:

Tomi’s website: https://tomicapretto.com/
Tomi on GitHub: https://github.com/tomicapretto
Tomi on Linkedin: https://www.linkedin.com/in/tom%C3%A1s-capretto-a89873106/
Tomi on Twitter: https://x.com/caprettotomas
Advanced Regression course (get 10% off if you’re a Patron of the show): https://www.intuitivebayes.com/advanced-regression
Bambi: https://bambinos.github.io/bambi/
LBS #35 The Past, Present & Future of BRMS, with Paul Bürkner: https://learnbayesstats.com/episode/35-past-present-future-brms-paul-burkner/
LBS #1 Bayes, open-source and bioinformatics, with Osvaldo Martin: https://learnbayesstats.com/episode/1-bayes-open-source-and-bioinformatics-with-osvaldo-martin/
patsy - Describing statistical models in Python: https://patsy.readthedocs.io/en/latest/
formulae - Formulas for mixed-models in Python: https://bambinos.github.io/formulae/
Introducing Bayesian Analysis With m&m's®: An Active-Learning Exercise for Undergraduates: https://www.tandfonline.com/doi/full/10.1080/10691898.2019.1604106
Richly Parameterized Linear Models Additive, Time Series, and Spatial Models Using Random Effects https://www.routledge.com/Richly-Parameterized-Linear-Models-Additive-Time-Series-and-Spatial-Models-Using-Random-Effects/Hodges/p/book/9780367533731
Dan Simpson’s Blog (link to blogs with the ‘sparse matrices’ tag): https://dansblog.netlify.app/#category=Sparse%20matrices
Repository for Sparse Matrix-Vector dot product: https://github.com/tomicapretto/dot_tests

Transcript

This is an automatic transcript and may therefore contain errors. Please get in touch if you're willing to correct them.

Speaker: 00:00:04

Today I am thrilled to host my friend Tommy Capretto, a multifaceted data scientist from

PMC Labs, a dedicated statistics educator at Universidad Nacional de Rosario, and an avid

2

: 00:00:18

contributor to the open source software community, especially known for his work on Bambi.

3

: 00:00:24

In our conversation, Tommy shares insights from his dual role as an industry practitioner

and an academic,

4

: 00:00:32

We dive deep into the practicalities and pedagogical approaches of teaching complex

statistical concepts, making them accessible and engaging.

5

: 00:00:41

We also explored Tommy's contributions to BEMBEE, which he describes as BRMS for Python.

6

: 00:00:49

And indeed, it is a Python library designed to make patient modeling more approachable for

beginners and non -experts.

7

: 00:00:57

This discussion leads us into the heart of our newly launched course,

8

: 00:01:01

Advanced Regression with Bambi and Pimc, where Tommy, Ravin Kumar and myself unpack the

essentials of regression models, tackle the challenges of parameter identifiability and

9

: 00:01:13

overparameterization, and address overdispersion and the new zero -sum normal

distribution.

10

: 00:01:20

So whether you're a student, a professional, or just a curious mind, I'm sure this episode

is packed with insights that will enrich your understanding.

11

: 00:01:30

statistical world.

12

: 00:01:31

This is Learn Invasion Statistics, episode 112, recorded June 24, 2024.

13

: 00:01:40

Welcome Bayesian Statistics, a podcast about Bayesian inference, the methods, the

projects, and the people who make it possible.

14

: 00:01:48

I'm your host, Alex Andorra.

15

: 00:01:51

You can follow me on Twitter at alex -underscore

16

: 00:02:10

like the country.

17

: 00:02:11

For any info about the show, learnbasedats .com is Laplace to be.

18

: 00:02:15

Show notes, becoming a corporate sponsor, unlocking Beijing Merch, supporting the show on

Patreon, everything is in there.

19

: 00:02:22

That's learnbasedats .com.

20

: 00:02:24

If you're interested in one -on -one mentorship, online courses, or statistical

consulting, feel free to reach out and book a call at topmate .io slash alex underscore

21

: 00:02:34

and dora.

22

: 00:02:35

See you around, folks, and best Beijing wishes to you

23

: 00:02:42

Hello, my dear Vagans!

24

: 00:02:44

A quick note before today's episode.

25

: 00:02:47

STANCON 2024 is approaching!

26

: 00:02:50

It's in Oxford, UK this year from September 9 to 13, and it's shaping up to be an

incredible event for anybody interested in statistical modeling and Vagans in France.

27

: 00:03:02

Actually, we're currently looking for sponsors to help us offer more scholarships and make

STANCON more accessible to everyone.

28

: 00:03:10

And we

29

: 00:03:11

encourage you to buy your tickets as soon as possible.

30

: 00:03:15

Not only will this help with making a better conference, but this will also support our

scholarship fund.

31

: 00:03:23

For more details on tickets, sponsorships or community involvement, you'll find the

Stencon website in the show notes or counting on you.

32

: 00:03:32

OK, on to the show

33

: 00:03:39

Mi capretto, bienvenido a Learning Basics Statistics.

34

: 00:03:44

Hello Alex, muchas gracias.

35

: 00:03:47

Thank you.

36

: 00:03:47

Yeah, thanks a lot for taking the time.

37

: 00:03:50

That's actually a bit weird to talk to you in Spanish in English now because we're still

talking Spanish.

38

: 00:03:56

Yeah.

39

: 00:03:57

for the benefit of the world, we're gonna do that in English.

40

: 00:04:03

So it's awesome.

41

: 00:04:04

I'm really happy to have you on the show because with

42

: 00:04:09

You started as a colleague with the gears now.

43

: 00:04:14

You're definitely a friend, or at least I consider you a friend.

44

: 00:04:19

I will tell you after the recording if I consider you friend, depending on how it goes.

45

: 00:04:25

That's smart move, smart move.

46

: 00:04:27

I've lost quite a few friends because of my editing skills.

47

: 00:04:33

Yeah, so I mean, it's a long

48

: 00:04:37

interview I've had a lot of people who say you should have Tomica Preto on the show and I

always answered yeah I'll come to the show very soon don't worry we're finishing working

49

: 00:04:50

on a project right now together so well I'll invite him at that point so that he can talk

about the project and you guys maybe know what the project is about but you'll see at

50

: 00:05:07

at the middle of the episode, more or less, people.

51

: 00:05:09

But I mean, if you listen to the show regularly, you know which project I'm talking about.

52

: 00:05:16

But first, Tommy, we'll talk a bit about you.

53

: 00:05:21

Yeah, basically, can you tell people what you're doing nowadays?

54

: 00:05:28

You know, and yeah, like, what do your days look like?

55

: 00:05:36

So I'm doing quite a lot of things regularly.

56

: 00:05:42

Mainly, I work at Pimes Loves with a great team.

57

: 00:05:48

We work in very interesting projects doing basin stats.

58

: 00:05:54

I have the pleasure to be working with the people making the tool that I love.

59

: 00:06:02

That's amazing.

60

: 00:06:05

It's also great, I don't know, when we are working on a project, we realize time -seen is

to be able to do something or there's something broken.

61

: 00:06:16

We are not wondering, is this going to be fixed at some point in time?

62

: 00:06:22

Are the developers working on it?

63

: 00:06:24

We can just go and change the things.

64

: 00:06:29

Well, we have to be responsible because otherwise the community will hate us

65

: 00:06:34

changing the things all the time, but I definitely really like it.

66

: 00:06:40

So I work at Pimesy Labs, it's my main job.

67

: 00:06:45

I've been at Labs for around three years, I think.

68

: 00:06:51

In parallel, I also teach in university here in Argentina.

69

: 00:06:56

I live in Rosario, Argentina, which is like the third largest city.

70

: 00:07:04

in the country.

71

: 00:07:04

After, so far, these nerds don't know Argentina.

72

: 00:07:09

We'll see if I know Argentina in well enough after Buenos Aires, of course, and Córdoba.

73

: 00:07:15

Yeah, I think that's the correct order.

74

: 00:07:18

And of course, for the football fans, the city of Angel Di Maria and Lyon ABC, of course.

75

: 00:07:27

Yeah, correct.

76

: 00:07:29

And for some niche fans of football,

77

: 00:07:34

Like if you are from the UK or from some very particular area of Spain Also Marcelo

Gielsa, which is a coach A very particular coach He is also from I didn't know he was from

78

: 00:07:45

Rosario too, ok Yeah, yeah, yeah, we have very particular characters in the city Yeah, now

I understand why he's called El Loco Ok, ok Yeah, yeah That's how we call Tommy inside

79

: 00:07:58

Pimcy Labs You are not supposed to tell that to people yeah, right, ooh, I'm sorry

80

: 00:08:04

I'm not supposed to say a lie to you.

81

: 00:08:06

On the show I can't.

82

: 00:08:10

Yeah, and so yeah, I live here in Rosario.

83

: 00:08:13

In Rosario, I don't know why I'm telling that in English.

84

: 00:08:18

I teach in our national university.

85

: 00:08:21

There's a program in statistics, which is a program where I studied.

86

: 00:08:28

Now I'm teaching also based in statistics.

87

: 00:08:31

There's a whole course

88

: 00:08:33

dedicated to Basin Statistics in the final year of the career.

89

: 00:08:38

It's a new course.

90

: 00:08:40

It started in 2023.

91

: 00:08:42

That was the first edition.

92

: 00:08:44

Now we are finishing the second edition.

93

: 00:08:49

The students hate and love us at the same time because we make them work a lot, but at the

end of the day they learn or at least that's what they say.

94

: 00:09:02

what we find in the things that they present.

95

: 00:09:07

So yeah, those are my two main activities today.

96

: 00:09:10

I'm also an open source developer contributing mainly to Bambi, Pimc, Arvies.

97

: 00:09:19

Sometimes I am creating a random repository to play with something or some educational

tool.

98

: 00:09:30

Yeah.

99

: 00:09:30

And from time to time I teach courses.

100

: 00:09:33

I've just finalized teaching a Python course.

101

: 00:09:40

But yeah, it's like a mixture between statistics, computers, basic statistics, Python,

also R, which was my first language.

102

: 00:09:53

And yeah, that's the world we're living in.

103

: 00:09:57

Yeah, yeah, definitely.

104

: 00:09:59

You do a lot of things for sure.

105

: 00:10:01

Yeah, I think we can go in different directions, but I'm actually curious if you can talk

about...

106

: 00:10:10

I know you have an exercise in your class where you teach patient base and stance, and you

introduce them with &Ms.

107

: 00:10:19

yes!

108

: 00:10:20

Can you talk a bit about that exercise on the show?

109

: 00:10:23

I think it will be interesting for our listeners.

110

: 00:10:26

yeah, yeah, definitely.

111

: 00:10:28

To be completely honest and fair, is not our idea.

112

: 00:10:33

I mean, it's an idea that was actually published on a paper.

113

: 00:10:39

I don't remember the name of the paper, but I'm gonna find it.

114

: 00:10:42

I have it.

115

: 00:10:43

And I'm gonna give you the real source of the game.

116

: 00:10:52

But we have adapted that.

117

: 00:10:54

Basically, the first day you enter

118

: 00:10:58

a base in classroom, the teachers present you a problem saying, hey, something happened

with MMMs.

119

: 00:11:07

In our case, we used the local version, which are called Rocklets.

120

: 00:11:13

It's basically the same.

121

: 00:11:14

It's chocolate, different colors.

122

: 00:11:16

And we tell them, hey, the owner of the factory suspects that there's something happening

with the machine that creates

123

: 00:11:29

the MMMs of a particular color and you need to figure out what's happening.

124

: 00:11:36

And so we give them, so we divide the students in groups, we give them a bag to the

different groups and they have to open the bag, they have to count the number of pieces

125

: 00:11:50

that they have of the different colors.

126

: 00:11:53

At that point, the students realize that what they care about is whether it

127

: 00:11:57

that particular color or not and the idea is to start thinking like in a statistical plus

basin way like what is the quantity we are trying to estimate or what is the quantity that

128

: 00:12:17

will tell us the answer and then you say okay we are talking about a proportion all right

and do we know anything about that proportion?

129

: 00:12:27

Well, it's a proportion.

130

: 00:12:28

It can be between 0 and 1.

131

: 00:12:30

It's a continuous quantity.

132

: 00:12:32

And then, okay, we are going to work manually, so let's discretize that proportion.

133

: 00:12:39

And we have 11 values from 0 to 1.

134

: 00:12:44

And then, okay, what else do we know about that proportion?

135

: 00:12:50

Are all the values equally likely?

136

: 00:12:54

And you can notice that we are starting to build a prior.

137

: 00:12:58

And students are like, no, we have five colors.

138

: 00:13:04

The probability of this color being present 80 % of the time is not the same as the

probability of this color being present 20 % of the time, for example.

139

: 00:13:15

And so we start like in a very manual way to build a probability distribution, which is

the prior for the proportion of items that are of that

140

: 00:13:28

And then we say, okay, what's the kind of the data that we are collecting?

141

: 00:13:36

And we end up saying, okay, this is a binomial experiment.

142

: 00:13:39

And we talk about the different assumptions, independence, constant probability.

143

: 00:13:45

And then, okay, how can we combine this information together?

144

: 00:13:48

And we naturally talk about the Bayesian theorem.

145

: 00:13:56

And yeah, we do all the math by hand with very simple numbers, but in a very intuitive way

with a problem that is interesting for students because they know those chocolates, they

146

: 00:14:17

can feel it makes sense to put what they know about the problem into a probability

distribution.

147

: 00:14:24

because they know that they know something about the problem.

148

: 00:14:27

And doing some very simple math using probability rules that they already know, we can

arrive a solution in a basic way.

149

: 00:14:37

And the end of that lesson is, okay, everything we did so far is what we are going to do

in this course.

150

: 00:14:46

Like we are going to learn more about this approach to do statistics.

151

: 00:14:52

And yeah.

152

: 00:14:53

In the very end, they can eat the data, basically.

153

: 00:14:58

And that's really interesting.

154

: 00:15:02

In the very first edition, we used Rocklets, which are like &M's.

155

: 00:15:07

And in the second edition, we used Gummy Bears.

156

: 00:15:12

But the logic was more or less the same, but we changed the product.

157

: 00:15:17

And I don't know what you're going to do in the next edition, but it will have some

158

: 00:15:23

involved.

159

: 00:15:25

It's definitely very interesting and I'm fascinated by these approaches to introduce stats

to people which are more intuitive.

160

: 00:15:40

The student is involved in the problem from the very beginning.

161

: 00:15:44

You don't start with a list of 10 abstract concepts

162

: 00:15:52

Perhaps they know how to follow, but it's less attractive.

163

: 00:15:55

So yeah, we do that and I really like that approach.

164

: 00:15:59

Yeah, yeah.

165

: 00:16:01

I mean, that's definitely super fun.

166

: 00:16:02

That's why I want you to do that on the show.

167

: 00:16:06

think it's a great way to stance and we'll definitely add that to the show notes as you

were saying.

168

: 00:16:14

And for next year, well, I think you definitely should do that with Alpha Chores.

169

: 00:16:22

Let's see if we have the budget to do that.

170

: 00:16:24

Yeah, it's gonna be a bit more budget, yeah for sure.

171

: 00:16:27

mean, the best would be with empanadas, but that should be not very...

172

: 00:16:32

that shouldn't be very easy to do, you know, like the empanada can break.

173

: 00:16:35

Nah, it's gonna be a whole mess.

174

: 00:16:37

you know...

175

: 00:16:38

Yeah, I know, and that usually happens like early in the morning, so the students will be

like, what are we doing here?

176

: 00:16:49

Yeah, it's

177

: 00:16:51

It's a nice confusion because it creates a nice, an impact.

178

: 00:16:55

Like they enter the classroom and instead of having people saying, Hey, this is my name.

179

: 00:17:01

we are going to work on that.

180

: 00:17:02

It's like, Hey, you have this problem.

181

: 00:17:05

Take some gummy bears.

182

: 00:17:07

And they're like, what?

183

: 00:17:08

What's happening?

184

: 00:17:09

So that's, it's attractive.

185

: 00:17:12

Yeah.

186

: 00:17:12

Yeah.

187

: 00:17:12

No, for sure.

188

: 00:17:14

most of your students are like, do they already know about stance?

189

: 00:17:20

Yes.

190

: 00:17:20

you're teaching them the Beijing way?

191

: 00:17:23

Yeah, yeah, so at that point...

192

: 00:17:25

What's their most, you know, what's the most confusing part to them?

193

: 00:17:30

How do they react to that new framework?

194

: 00:17:36

I would say in general, we had good experiences, especially at the end of the journey.

195

: 00:17:44

But in the very beginning, so when they start the course...

196

: 00:17:49

They already have like 20 courses, let's say 15 because other courses are focused on

mathematics or programming, but they already have like 15 courses about statistics, but

197

: 00:18:04

they are all about the non -basin approach.

198

: 00:18:09

So frequentist approach.

199

: 00:18:11

They know a lot about maximum likelihood estimation and all the properties.

200

: 00:18:19

At that point, they already spent hours writing mathematical formulas and demonstrating

results and all that.

201

: 00:18:30

But they are very new to Bayesian statistics, because all they know about Bayes is Bayes

rules.

202

: 00:18:38

That's the only thing they know.

203

: 00:18:40

And they also know there's an estimation method called the Bayesian method, but

204

: 00:18:49

they are not using that at that point.

205

: 00:18:52

And one thing that there may be other things, but one thing that takes some time for them

to adapt is, okay, parameters are not fixed anymore.

206

: 00:19:09

And I put a probability distribution on top of that because in all the courses they took

before our course,

207

: 00:19:17

there's a lot of emphasis on how to interpret confidence intervals, p -values and

classical statistics.

208

: 00:19:25

At that point, they are not the typical student that is confused about interpreting

confidence intervals, p -values and frequency stats because they practice that a lot.

209

: 00:19:39

But then it's hard for them to switch from parameters are fixed

210

: 00:19:47

our interval either contains the parameter or not, but we don't know it, to, parameters

are random quantities and we put probability distributions on top of them.

211

: 00:20:00

So there's a cost there, which is not huge.

212

: 00:20:05

And what was really nice for us, Monte Carlo is something that really helped us from very

early we start

213

: 00:20:15

computing quantities of interest with Monte Carlo, when they realize the power in that

approach, they're like, I really like this.

214

: 00:20:27

Because I have a probability distribution and I'm interested in this particular

probability, or I'm interested in a probability involving two random variables, or in many

215

: 00:20:40

things.

216

: 00:20:40

Once they discover how powerful that approach

217

: 00:20:46

They're like, this is really nice.

218

: 00:20:51

But yeah, it's a challenge, but I really like it.

219

: 00:20:56

And I think at the end of the day, they also like it and they see the power in the

approach.

220

: 00:21:03

In fact, I have a student that's right now working on a Google Summer of Code project with

Bambi.

221

: 00:21:12

So it's based in stats.

222

: 00:21:13

And it seems I'm going to have another student working on a hierarchical model for his

223

: 00:21:20

So yeah, it's really nice.

224

: 00:21:25

Nice, yeah, yeah, for sure.

225

: 00:21:27

Who is the...

226

: 00:21:29

So I know also, I think if I remember correctly, there is...

227

: 00:21:33

So you know Gabriel, who works on BEMI.

228

: 00:21:37

I don't remember his last name right now, do you?

229

: 00:21:39

It's hard, it's Gabriel Stech -Schulte.

230

: 00:21:43

I don't know...

231

: 00:21:44

yes, something like that.

232

: 00:21:45

So sorry, Gabriel.

233

: 00:21:47

But Gabriel is also a patron of the show.

234

: 00:21:49

of Learn Based Stats, so he's really in the Bayesian state of mind.

235

: 00:21:54

Thank you so much Gabriel for all the support to Learn Based Stats, but also, and even

more importantly, the work you do on Bambi.

236

: 00:22:03

I know you've helped me a few months ago on a PR for HSGP, where I was testing Bambi's

HSGP capabilities to the limit.

237

: 00:22:17

Thank you so much, Gabriel and Tony, of course, for developing Bambi all the time and

pushing the boundaries on that.

238

: 00:22:27

I know Gabriel.

239

: 00:22:29

So he was working in the industry and now he's back to academia, but in a more research

role.

240

: 00:22:34

And sorry, Gabriel, I don't remember all the details about this, but I do remember he was

doing something very cool, applying Basin stats.

241

: 00:22:42

So I'm like nudging

242

: 00:22:46

publicly to someday tell the world about what he does.

243

: 00:22:51

Because I remember being like, this is quite interesting.

244

: 00:22:57

So yeah.

245

: 00:22:57

Definitely.

246

: 00:22:58

Yeah, for sure.

247

: 00:23:00

Yeah.

248

: 00:23:01

Actually, let's talk about Bambi.

249

: 00:23:04

I think it's going to be very interesting to listeners.

250

: 00:23:06

So yeah, can you tell us what Bambi is about basically and why would people

251

: 00:23:17

use it.

252

: 00:23:18

The way I usually do that is at least people know or I tell them it's like BRMS in Python.

253

: 00:23:26

If you're interested in BRMS and don't know what that is, I think it's episode 35 with

Paul Berkner, he was on the show, I put that in the show notes.

254

: 00:23:35

But if you want now Tommy's definition of Bambi, so one of the main core devs of Bambi,

well here it is folks.

255

: 00:23:45

To be honest, your definition was already really good because it's one of the definitions

I usually give when I know the other party knows about VRMS.

256

: 00:24:01

basically, if you don't know R, I can tell you like in 30 seconds, R has a very particular

syntax to specify regression models.

257

: 00:24:16

where you basically say, okay, this is my outcome variable, use a symbol, which is a

tilde, and you say, these are my predictors.

258

: 00:24:24

And you pass that to a function together with a data frame, which is a very convenient

structure.

259

: 00:24:30

And that function knows how to map the names of the predictors to parameters and variables

in the model.

260

: 00:24:43

It knows how to take a model formula

261

: 00:24:47

and a data frame and some other information that's not always needed, and it constructs a

model with that information.

262

: 00:24:58

So that's like very built in into R.

263

: 00:25:03

Like if you go back to, I think to the S language, the formula syntax already existed.

264

: 00:25:10

Then the R language has the formula syntax in the base packages.

265

: 00:25:15

And a lot of packages built by people in R use the formula syntax to specify regression

models.

266

: 00:25:26

And a lot of people also extended the formula syntax to account for other things, like one

extension that we incorporated in Bambi is the syntax to have what in frequency stats you

267

: 00:25:43

call random effects.

268

: 00:25:47

that appeared I think the first time in the LME4 package which is a very popular package

in R to work with mixed effects model which is another name for hierarchical models it's

269

: 00:26:02

crazy how many names you have for that so basically in R you have this formula syntax and

this very short way of writing a statistical model

270

: 00:26:12

and lot of people created a lot of packages to have a larger variety of models.

271

: 00:26:20

Then go to Python.

272

: 00:26:20

Let's go to Python.

273

: 00:26:22

Python is a more general programming language.

274

: 00:26:26

It has great support for statistics, machine learning, basic stats, and all that.

275

: 00:26:32

But you don't have something like a model formula built in the language.

276

: 00:26:41

I think one of the very first attempts to build that, which was extremely successful, it's

Patsy, which is a library developed by...

277

: 00:26:54

I don't remember the name of the guy, sorry.

278

: 00:26:57

I think it's Nathaniel, but I don't remember the last name.

279

: 00:27:01

But that's like...

280

: 00:27:05

As far as I know, the first package and the largest package that brought the model

formulas to Python, and then other libraries started to build on top of that Patsy

281

: 00:27:16

library.

282

: 00:27:17

For example, stats models.

283

: 00:27:20

And stats models allows you not to copy and paste your R code, but basically to say, this

is in R how I will create a linear regression model.

284

: 00:27:32

Okay, in Python, what do I need to

285

: 00:27:35

Okay, I need a pandas data frame, model formula that it passed in a string and it works

the same way.

286

: 00:27:42

And so as it happened in R with people creating packages to extend those capabilities, the

same happened in Python.

287

: 00:27:54

Like you have stats models, which is very popular, but there are also many other

libraries.

288

: 00:28:00

And one of those libraries is Bambi, which extends

289

: 00:28:03

the model formula and uses the model formula in a basin context.

290

: 00:28:09

BAMB is stands for basin model building interface.

291

: 00:28:15

It uses a model formula and a syntax very similar to the syntax that you find in R to

create basin models.

292

: 00:28:28

I think what's great about it is that you're not only creating the model, but you also

have lot of functionalities to work with the model.

293

: 00:28:37

For example, obtain predictions, which is not trivial in many cases, or compute some

summary of interest, or help you to find prayers that are sensible for the problem that

294

: 00:28:52

you have.

295

: 00:28:53

And so yeah, I joined.

296

: 00:28:56

the Bambi project, I think it was in 2020 or 2021, while working with Osvaldo, he was my

director in Conicet, which is like a national institute for science and technology here in

297

: 00:29:16

Argentina.

298

: 00:29:19

yeah, and I really liked the interface and I saw many points that could be

299

: 00:29:26

improved, mainly that Bambi didn't support the syntax for random effects.

300

: 00:29:36

Actually, no Python library supported that because Patsy didn't support that.

301

: 00:29:43

And at that point in time, I was learning about programming languages and I was like,

well, maybe it's time to write a parser for model formulas.

302

: 00:29:54

And that's what I did.

303

: 00:29:55

And that was my first big contribution to Bambi.

304

: 00:30:06

And then we started to add, I don't know, more model families.

305

: 00:30:11

So Bambi now supports many more likelihood functions.

306

: 00:30:16

We started to add better default priors because the goal of these libraries is to allow

you to

307

: 00:30:24

a quick iteration.

308

: 00:30:26

It's not that we are rooting for, you should all use default priors and automatic priors.

309

: 00:30:32

No, please don't do that.

310

: 00:30:34

But if you want to have something quick and iterate quick, then that's not a bad idea.

311

: 00:30:41

Once you more or less have like a more refined idea of your model, then you can sit down

and say, okay, let's really think really.

312

: 00:30:53

about the priors.

313

: 00:30:55

So to summarize Bambi is a package built on top of PyMC.

314

: 00:31:01

I didn't mention that before.

315

: 00:31:05

That allows people to write, fit and work with base models in Python without having to

write a model in a probabilistic programming language.

316

: 00:31:21

There's a trade -off.

317

: 00:31:23

Like you can write a very complex model in two or three lines of code.

318

: 00:31:30

If you want full flexibility, you should use a PIMC.

319

: 00:31:37

And to conclude, said BAMBEE is the BRMS of Python.

320

: 00:31:46

We always take like BRMS as an inspiration and also as

321

: 00:31:53

Yeah, what we want to have in many cases because implementing Bambi, I learned a lot about

BRMS and how great it is actually because the complexities it can handle and the variety

322

: 00:32:10

of models and kind of things you can have in a model in BRMS is huge.

323

: 00:32:17

I mean, I'm not aware of any other interface like this that supports

324

: 00:32:22

as many things, base and non -base.

325

: 00:32:25

I mean, it's really amazing.

326

: 00:32:30

And yeah, we are always taking ideas from VRMS.

327

: 00:32:39

Yeah, Yeah, great, Samari.

328

: 00:32:42

Thanks to me.

329

: 00:32:43

And even like, brief history of Bambi, I love that.

330

: 00:32:46

So in the show notes, I added the link to

331

: 00:32:52

that you mentioned and also the link to the very first Learn Bay Stats episode which was

with Osvaldo Maldini.

332

: 00:33:03

So it was episode number one.

333

: 00:33:05

It's definitely a vintage one, people.

334

: 00:33:08

Feel free to...

335

: 00:33:09

I have a fun story about that.

336

: 00:33:11

yeah?

337

: 00:33:13

I don't know if I told you about this story but when Osvaldo recorded that...

338

: 00:33:17

think I know.

339

: 00:33:18

Yeah, you know, know.

340

: 00:33:19

When Osvaldo...

341

: 00:33:20

but I don't know if the public know

342

: 00:33:22

knows about that story.

343

: 00:33:24

So Osvaldo and I used to work like in the same building, not in the exact same office, but

his office was in front of my office.

344

: 00:33:35

So if he was talking to someone, I could listen.

345

: 00:33:38

Not very clearly, but I could realize he was talking.

346

: 00:33:43

And some random day I was in the office and I noticed that he was talking English, but

alone.

347

: 00:33:51

Like, not with another person.

348

: 00:33:53

And I said, what is he doing?

349

: 00:33:55

And then after that, he told me, yes, I was interviewed in a podcast that this other guy

who's been contributing to Arby's is starting.

350

: 00:34:05

And yeah, I think it's very cool.

351

: 00:34:07

I think it went very well.

352

: 00:34:10

And at that point in time, I didn't know you, but I knew there was a podcast guy and it

turns out that I witnessed

353

: 00:34:21

the first recording of Learned Basics Statistics, which is pretty fun.

354

: 00:34:26

And look where we are now.

355

: 00:34:30

Pretty interesting.

356

: 00:34:32

Yeah, this is really cool.

357

: 00:34:33

I love that story.

358

: 00:34:35

It was already all linked together.

359

: 00:34:38

I love that.

360

: 00:34:40

Yeah.

361

: 00:34:42

Yeah.

362

: 00:34:44

I really love Bendy for what you said, you just said, I think.

363

: 00:34:49

It's a great way to start and iterate very fast on the model.

364

: 00:34:56

And then if you validate the concept, then you can switch to PIMC and build the model

again, but then build on top of that.

365

: 00:35:06

And that's going to make all your modeling workflow way faster.

366

: 00:35:11

Yeah.

367

: 00:35:11

really love that.

368

: 00:35:12

Another thing also that's really good is for teaching, especially beginners,

369

: 00:35:17

that will abstract away a lot of the choices that need to be made in the model.

370

: 00:35:22

As you were saying, it's not necessarily what you want to do all the time, but at least to

start with, you know, it's like when you start learning a new sport.

371

: 00:35:32

Yes, there are tons of nuances to learn, but, you know, if you focus on one or two things,

you already have the Pareto effect.

372

: 00:35:42

Well, then Bambi allows you to do that, and I think that's extremely valuable.

373

: 00:35:48

Yeah, and another point I'm realizing I forgot to mention is that it lowers the the

entrance barrier.

374

: 00:35:56

Like, there are a lot of people who are not statisticians, but they do stats because they

have experiments or they have they are studying something and they have data and they have

375

: 00:36:09

some level of familiarity with some models and they know that that's the model they want

to fit.

376

: 00:36:15

But probably writing PIMC

377

: 00:36:17

and working with indexes and demons and quarts is too much and going to Stan and typing

everything is also too much and they don't work with R and they want some higher level

378

: 00:36:34

interface to work with, then Bambi is also what they use.

379

: 00:36:39

And yeah, I also really like that.

380

: 00:36:42

It makes basic stats

381

: 00:36:48

more welcoming for people that are not experts at writing code, which is completely fine.

382

: 00:36:57

Because a lot of people out there are trying to solve already difficult problems and

adding the extra complexity of being an expert in a PPL maybe too much.

383

: 00:37:09

So that's also another reason to have these interfaces.

384

: 00:37:13

Yeah, yeah, yeah.

385

: 00:37:14

I definitely completely agree.

386

: 00:37:18

I that's also...

387

: 00:37:21

So basically, if people are curious about Bambi and get started with that, I definitely

recommend taking a look at the Bambi's website that I put in the show notes.

388

: 00:37:35

also, well, probably then about our new course, Tommy, that's the project that was in the

notes.

389

: 00:37:43

So this is all I am happy to have you on the show here, please.

390

: 00:37:46

So the course is called Advanced Regression with Bambi and Pimc.

391

: 00:37:53

Precisely, it's on the intuitive -based website, so of course I put that in the show notes

for people who want to take a look at it.

392

: 00:38:00

If you're a patron of the show, have 10 % off.

393

: 00:38:03

This is the only discount that we do, so I hope you appreciate it.

394

: 00:38:10

That's how special you are.

395

: 00:38:13

Thank you so much, patrons.

396

: 00:38:16

And yeah, maybe Tommy tell us about, you know, the course and what it is about and for

whom in particular that would be.

397

: 00:38:29

We spent a lot of time on this course.

398

: 00:38:31

It took us two years to develop.

399

: 00:38:33

So, yeah, I'm super happy about it.

400

: 00:38:36

I'm also super happy that it's done.

401

: 00:38:41

But yeah, maybe give us the elevator pitch for the course who that before.

402

: 00:38:46

and why would people even care about it?

403

: 00:38:50

So the Advanced Regression Course is a very interesting course with a lot of material,

with a lot of very well thought material, which in all cases went through a lot of

404

: 00:39:07

reviews.

405

: 00:39:10

As the title says, it's a course about regression, but also as the title says,

406

: 00:39:16

it's an advanced regression course.

407

: 00:39:19

It doesn't mean it starts from the beginning being extremely advanced and it doesn't mean

it involves the craziest mathematical formulas that you're going to see in your life, but

408

: 00:39:32

it means it's the course you have to take if you want to give, sorry, if you want to take

that second or third step in your learning journey.

409

: 00:39:46

Like for example, if you took an introductory course like yours or another introductory

course and you feel that's not enough or you are open to learn more, you are eager to

410

: 00:39:59

learn more, then that's the course for you.

411

: 00:40:03

Of course, it has a base in approach and it uses a lot of Python, Bambi and Pimc.

412

: 00:40:15

Every time I talk about regression, I want to qualify something.

413

: 00:40:23

I remember a conversation I had with colleagues when I was just starting in a previous

job.

414

: 00:40:32

They were telling me they were taking a course about statistics, like those courses where

you have a ton of topics, but only very lightly colored.

415

: 00:40:44

And they were like, yeah, the first two units is regression.

416

: 00:40:47

And this is a lot.

417

: 00:40:48

And I was telling them, in university, I had six courses about regression.

418

: 00:40:54

It was not just two units in a course.

419

: 00:41:03

And that's because I think in many cases, people think that regression is something very

simple.

420

: 00:41:09

It's the linear regression that you learn in

421

: 00:41:14

basic statistics course, like you have a predictor and you have an outcome variable and

you have a predictor, then that's simple linear regression.

422

: 00:41:25

You have multiple predictors, you have multiple linear regression.

423

: 00:41:28

And that's it.

424

: 00:41:30

That's all linear regression gives you.

425

: 00:41:33

And all the rest are crazier things that fall under the machine learning umbrella.

426

: 00:41:40

But in the course, we see that that's

427

: 00:41:43

the whole story.

428

: 00:41:46

So many things are regressions or if you don't like the term maybe we can give you a

better term in the future but so many things are linear models which sounds pretty basic

429

: 00:42:02

right?

430

: 00:42:03

You say this is a linear model this is a linear equation it's like this is for dummies but

if you're curious take the course and and you will see

431

: 00:42:14

With linear models, you can do a lot of crazy things.

432

: 00:42:18

Of course, we start with simple linear regression and we do multiple linear regression.

433

: 00:42:23

But then very quickly, go to logistic regression, Poisson regression, we talk about

categorical regression, multinomial regression, when your outcome is categories and you

434

: 00:42:39

have multiple categories.

435

: 00:42:41

And then it goes crazy.

436

: 00:42:45

and we have zero inflation and we have overdispersion and we finalize the course talking

about hierarchical models in the context of regressions and it ends with a very

437

: 00:43:01

interesting model that you developed.

438

: 00:43:06

So the course is very complete, it starts

439

: 00:43:13

A few things that we assume people know but we like review them.

440

: 00:43:19

But then very soon we start covering new things.

441

: 00:43:25

I think in all cases we show how to do things with Bambi and how to do them with Pine T.

442

: 00:43:32

We have a lot of visualizations.

443

: 00:43:36

Our editor did an amazing job at editing the video so we also have animations and all

that.

444

: 00:43:43

Yeah, it's a product I'm proud of.

445

: 00:43:46

Yeah, it's nice.

446

: 00:43:50

Yeah, definitely.

447

: 00:43:52

There is so much that we've done, in this foreign territory.

448

: 00:43:56

Well, I learned so much because...

449

: 00:43:58

Me too.

450

: 00:44:00

Yeah, as you were saying, it sounds like what a regression is, just something from the

past.

451

: 00:44:07

But it's actually used all the time.

452

: 00:44:11

You know, even the big LMs now, in the end, it's a lot of dot products and dot products

are matrices multiplied with vectors and, you know, a linear regression is actually not

453

: 00:44:28

that far from that.

454

: 00:44:29

It's actually exactly that.

455

: 00:44:31

So if you learn and understand really the nitty gritty of hard regressions, complex

456

: 00:44:41

you already know a lot of things you're going to need to to need.

457

: 00:44:47

You're going to need to know when doing Bayesian modeling in the trenches.

458

: 00:44:53

That's, that's for sure.

459

: 00:44:54

And that's also why I learned so much in this course, because I had to really dig into the

regression models.

460

: 00:45:02

And, we show you how to do that from simple regression to binomial regression.

461

: 00:45:09

Poisson regression, stuff you guys obviously at least have heard about, but then we teach

you more niche and advanced concepts like zero inflated regressions, over dispersed

462

: 00:45:25

regression, which is one of the chapters you worked on, Tommy, and you folks are gonna

learn a lot on that, like not only how to do the models, but then what to do with the

463

: 00:45:36

models after.

464

: 00:45:37

how to diagnose them, how to become confident about the model's predictions.

465

: 00:45:44

And also we teach you about a personal favorite of mine, which is the categorical and

multinomial regressions, which I use a lot for electoral forecasting.

466

: 00:45:57

But also you're going to use them a lot, for instance, for any more than two categories,

you're going to use a multinomial or a categorical.

467

: 00:46:07

And that's just extremely important to know about them because they are not trivial.

468

: 00:46:13

There are lot of subtleties and difficulties and we show you how to handle that.

469

: 00:46:19

I that's personally, I learned so much.

470

: 00:46:22

Something I really loved is what you did in in the over dispersed lesson, you know, where

you were diagnosing the over dispersion and coming up with a bunch

471

: 00:46:36

custom plots to show that the model is under dispersed.

472

: 00:46:41

Yeah, that's a term.

473

: 00:46:43

Compared to the data.

474

: 00:46:44

And also then coming up with a test statistic, a custom test statistic to actually see

whether the model is under dispersed or not.

475

: 00:46:55

And I think that's really powerful because that shows you also that in the invasion

framework, I often get that question from beginners.

476

: 00:47:04

can I compute

477

: 00:47:05

test statistics, because that's a magic one in the fragrances framework.

478

: 00:47:09

I'm like, yeah, sure.

479

: 00:47:10

But you can also invent your own test statistics for your own purpose here.

480

: 00:47:14

You don't have to use a pre -baked test statistic.

481

: 00:47:18

You have posterior samples.

482

: 00:47:19

can do whatever you want with them.

483

: 00:47:24

I thought that was like, that's definitely one of my favorite parts of the course.

484

: 00:47:31

And something I realized we forgot to mention, and I really like,

485

: 00:47:35

about the course and I really like having that in the course is all the different parts

where we talk about parameter identifiability and overparameterization and it's like we

486

: 00:47:48

don't tell you, take this outcome, take these three predictors and put them into the

machine and you're good to go.

487

: 00:47:57

I think that's probably, that will be a difficult part the first time you encounter

488

: 00:48:04

in the course, but we cover it multiple times in multiple lessons.

489

: 00:48:11

And the reason is it's a very important topic that's covered in many places, but I think

with not enough emphasis.

490

: 00:48:22

So we did our best to include that topic in many lessons to show it from different angles,

show how it can happen under

491

: 00:48:34

synchro stances, and that's something I'm really proud about.

492

: 00:48:41

How much time and effort we invested in non -identifiability, parameter redundancy, and

all that.

493

: 00:48:48

And the different approaches to deal with that, that's something I'm proud of.

494

: 00:48:56

I'm very happy we did that.

495

: 00:49:01

Yeah, definitely.

496

: 00:49:02

That's a very good point.

497

: 00:49:04

I think I finally understand overparameterization by working on this course because we see

it from, I think from lesson two or three, up until the last lesson, which is lesson nine.

498

: 00:49:16

Yes.

499

: 00:49:17

And we see it repeatedly.

500

: 00:49:19

And I think that's really good because it's a hard concept that's related to an

unidentifiability.

501

: 00:49:26

That happens a lot in models, not only Bayesian models, all the, like any statistical

model, but it's

502

: 00:49:33

mathematical thing.

503

: 00:49:38

And then it appears all the time in models.

504

: 00:49:41

And that's related to an identifiability, but it's hard to understand.

505

: 00:49:44

So you have to repeat it and really, really understand what that means.

506

: 00:49:50

then only then you can develop an intuition of what that really is and when it happens.

507

: 00:49:56

So yeah, definitely that's, that's also something I personally learned a lot and enjoyed a

lot in this.

508

: 00:50:03

in building this course.

509

: 00:50:06

Yeah, me too.

510

: 00:50:08

What would you say is your favorite part of all the curriculum right now and also what is

the part that was much more complicated than you anticipated?

511

: 00:50:20

Good question.

512

: 00:50:29

I don't know if this is a favorite part, but something I really like about the course is

how many visualizations we created.

513

: 00:50:37

Like in every model, we always created a visualization to explore the posterior, to plot

predictions, to do things like that.

514

: 00:50:47

I really like when you create a model and you don't just show two numbers, you make a

beautiful thing to communicate what you found.

515

: 00:50:59

That's something I really like.

516

: 00:51:03

definitely, my favorite parts are the more advanced parts, like starting perhaps in lesson

five, lesson six, when we talk about categorical regression, multinomial regression, and

517

: 00:51:17

then everything that happens after that.

518

: 00:51:19

Because I think that every lesson has many things to learn.

519

: 00:51:26

So I couldn't say, okay, this

520

: 00:51:30

the part I enjoy the most because I enjoy all of them but definitely the second half and

something that was difficult actually while working on the lesson about over dispersion I

521

: 00:51:50

looked through a lot of books, papers and all that and it was not easy at all to

522

: 00:52:00

many references, examples, datasets, very well worked examples from end end.

523

: 00:52:15

Honestly, I thought I would find a lot more, many more resources, and it was not that

easy.

524

: 00:52:25

I read papers

525

: 00:52:28

from 50 years ago.

526

: 00:52:31

Those scanned papers, like written in machines.

527

: 00:52:36

Yeah, that was harder than what I anticipated.

528

: 00:52:41

Crafting that lesson required a lot of reading, not only for the complexity, but also to

find resources that helped me build the lesson.

529

: 00:52:56

Yeah, definitely that

530

: 00:52:58

challenging and unanticipated.

531

: 00:53:00

Yeah, that lesson was hard, for sure.

532

: 00:53:02

that was difficult one.

533

: 00:53:06

Yeah, I mean, for me, I think my favorite part was really, as I was saying, Not learning,

but really getting to another level of understanding of an identifiability and of

534

: 00:53:25

parameterization.

535

: 00:53:27

And also, the next level in my understanding of the zero -sum normal distribution.

536

: 00:53:35

Because I had to use it a lot in the whole lesson.

537

: 00:53:38

And so, I mean, in the lessons, in all the lessons I'm teaching in this course, so three

of them, I'm using zero -sum normal.

538

: 00:53:44

So I had a really deep, deep...

539

: 00:53:46

And actually, that's something that, yeah, the students have said from the beta version

that

540

: 00:53:58

Yeah, it's very interesting to see how you solve one of the unidentifiability that can

happen in models.

541

: 00:54:07

So like, for instance, with multinomial models, one of the probabilities, like the last

category's probability is entirely determined by the n minus one previous categories.

542

: 00:54:22

So that's basically what an overparameterization is.

543

: 00:54:26

If you put the parameter

544

: 00:54:28

the end categories, then your model is overparameterized because the last category is

entirely determined once you know about the end minus one, the previous end minus ones.

545

: 00:54:41

And so there are at least two ways to solve that as we show in the course.

546

: 00:54:46

One of the classic ones, and it's the one that automatically implemented in BAMBi is

reference encoding.

547

: 00:54:53

So you take one of the categories and you consider that

548

: 00:54:57

is the reference in O and you fix it to an arbitrary number.

549

: 00:55:03

So fix that parameter to an arbitrary number.

550

: 00:55:05

Usually it's zero.

551

: 00:55:08

And then all the other categories, these parameters are in reference to that category.

552

: 00:55:13

So you could do that, but you can also do, and that's what we show you also in the course,

you can also say, well, instead of fixing one category to zero, I'm going to fix the

553

: 00:55:26

other categories to zero.

554

: 00:55:28

And that way you can still have n parameters, one for each category, which is really cool

because that way you don't have to think about one category as a reference.

555

: 00:55:41

And you just use a zero for normal distribution instead of normal distribution.

556

: 00:55:46

And that distribution is going to make sure that the sum of the categories sum to zero.

557

: 00:55:52

So that will depend when you prefer one or the

558

: 00:55:56

But usually when you don't have a natural placebo, you will probably prefer the zero

-subnormal parameterization because then there is no obvious reference.

559

: 00:56:07

Whereas a placebo is an obvious reference, you probably want all the parameters in

reference to that category.

560

: 00:56:14

But the zero -subnormal is going to be in reference to the average of all the categories.

561

: 00:56:20

And you can actually model an average for all the categories

562

: 00:56:25

this parameterization and then all the categories will be an offset of that baseline.

563

: 00:56:32

So that was definitely something super interesting that helped me pass the level in my

understanding of the distribution in that course.

564

: 00:56:43

And definitely a lot of better testers appreciated it.

565

: 00:56:47

I guess you want to say something also, but that's only because you know the zero sum

novel quite well.

566

: 00:56:52

Yeah, yeah.

567

: 00:56:52

But something like

568

: 00:56:56

Something nice I wanna say about the zero -sum normal.

569

: 00:57:02

In PyMC, the Serious or Normal is implemented as a distribution, which I think it would be

better if we could say, okay, this is a normal distribution plus transformation or a

570

: 00:57:15

restriction.

571

: 00:57:17

But having something called Serious or Normal and being able to use that as problem as any

other PyMC distribution is very convenient because the user doesn't have to deal with all

572

: 00:57:31

the details.

573

: 00:57:32

to get that constraint.

574

: 00:57:34

While if in PyMC you wanna have like other encoding, like you wanna have reference level,

you have to do it in a very manual way.

575

: 00:57:47

You have to create a vector of normals with shape n minus one.

576

: 00:57:53

Then you have to concatenate a serial to that other vector.

577

: 00:57:57

And then you get a new vector and that's vector you use in your model.

578

: 00:58:03

And you end up having like a constant in your trace and then Arvis complains about not

being able to compute our hat, for example, because they are all zeros or all constant.

579

: 00:58:22

And the zeros on normal is also like more appealing for the general users.

580

: 00:58:28

They just replace normal with zeros on normal.

581

: 00:58:33

and you're good to go.

582

: 00:58:35

That doesn't mean we shouldn't think about what we're doing.

583

: 00:58:38

I'm just talking about from like user experience, it's much easier to use a

SerialSumNormal and also more intuitive in most of the cases.

584

: 00:58:51

But yeah, I think the summary and how this relates to the course is think about parameter

restrictions that you add to the model.

585

: 00:59:02

think about how that changes the meaning of the parameters and then be responsible with

what you do.

586

: 00:59:13

But know that there's not a single recipe for solving that kind of problems.

587

: 00:59:20

Yeah, yeah.

588

: 00:59:21

Yeah, and that's also why we have the whole community in intuitive ways and we have the

discourse that people can ask questions because unfortunately there is no...

589

: 00:59:31

one size fits all.

590

: 00:59:32

I mean, I say unfortunately, that's actually pretty cool because otherwise, I guess what

we're doing would be pretty boring.

591

: 00:59:43

Time is running by and I think we've covered that topic quite well.

592

: 00:59:47

I I could talk about regression quite a long time, but I think that's a good overview.

593

: 00:59:54

And of course, if people are interested in some of the topics we talked about here,

594

: 00:59:59

Let me know and I can do a special episode about some parts of Regressions that you're

interested in or you're really wondering about.

595

: 01:00:08

Or we can even do a modern webinar showing you some things, some answers to the most

frequently asked questions you have about Regressions.

596

: 01:00:22

for sure, let us know about that.

597

: 01:00:24

And well, if we made you curious to take the course.

598

: 01:00:29

That's awesome.

599

: 01:00:29

I think this will be a lot of hours well invested.

600

: 01:00:34

Yeah, because it's nine lessons.

601

: 01:00:37

It's, I don't know how many hours of videos, but a lot.

602

: 01:00:41

You have lifetime access to that.

603

: 01:00:44

have exercises, which are very important.

604

: 01:00:47

Folks, I know I sound like a very old professor here, but actually I think the most

valuable of the course is not only watching the videos, but also doing the exercises.

605

: 01:00:58

and going through the solutions that you have all on the repo and asking questions on the

discourse, answering questions on the discourse, being part of that community.

606

: 01:01:08

Basically that's really how you're going to get the most out of yeah, like it's, you can

not learn how to ride a horse by just watching people riding horses.

607

: 01:01:23

It's the same with patient modeling.

608

: 01:01:26

If you just watch the videos, that will be entertaining for sure, but you're not gonna get

the most out of it.

609

: 01:01:32

So, yeah.

610

: 01:01:33

And if you do take the course, please say hi.

611

: 01:01:37

You are gonna be very happy to have you there and definitely wanna hear from you.

612

: 01:01:44

Tell me maybe, yeah, something I wanted to ask you before letting you go is, I know you've

done some work lately about sparse matrices.

613

: 01:01:56

If I remember correctly, in PyTentor, is that something you think would be useful here to

share a bit for listeners?

614

: 01:02:04

Yeah, yeah, can, I It's a topic I really like and I wish I knew more about that and always

like trying to learn.

615

: 01:02:15

Like there's some depth at which I know nothing about how that works.

616

: 01:02:23

But basically,

617

: 01:02:25

You already mentioned this, many things can be expressed as dot products.

618

: 01:02:30

And a subset of those many things can be expressed as a dot product between a matrix and a

vector.

619

: 01:02:39

That happens all the time in linear models.

620

: 01:02:43

That's basically the gist of linear model.

621

: 01:02:47

And in a subset of those cases, one

622

: 01:02:52

the matrix of that dot product is very sparse.

623

: 01:02:58

And if it's very sparse...

624

: 01:02:59

So define a sparse...

625

: 01:03:00

Yeah, define a closed matrix for example.

626

: 01:03:05

You have many entries in a matrix, but most of them, the great majority of them, are zero.

627

: 01:03:14

So it means in the multiplication they are not going to contribute anything to the final

628

: 01:03:19

If you do a dot product between a sparse matrix and a dense vector, dense is the opposite

of a sparse, meaning that you can have some zeros, but you don't have so many zeros to the

629

: 01:03:34

point where non -series are the rare value.

630

: 01:03:40

Anyway, if you have a big sparse matrix and a dense vector and you multiply them, you do a

dot product.

631

: 01:03:49

you're going to spend a lot of time computing things that are serial and will always be

serial and contribute nothing to the end result.

632

: 01:04:02

Of course there are, like, for a long time there have been structures to store these

special matrices in computers in such a way that you save space because

633

: 01:04:16

If you have a huge matrix with a lot of zeros stored in a dense way, that takes memory.

634

: 01:04:26

If you don't tell the computer those values are all the same, it doesn't know about that.

635

: 01:04:31

So it's going to take a lot of memory to store that matrix.

636

: 01:04:35

But with a sparse matrix, first you can save a lot of space into storage of the matrix.

637

: 01:04:46

And then you can exploit the sparsity to do less computations.

638

: 01:04:56

And at the end of the day, have computations that run faster.

639

: 01:05:00

And if you are doing MCMC, which means that you are evaluating the log P and its

derivative many, many times, it means you're multiplying.

640

: 01:05:15

If you're doing

641

: 01:05:16

matrix and vector multiplication a lot of times.

642

: 01:05:20

So gaining time, making that computation faster is something that we want to have.

643

: 01:05:31

yeah, PyTensor has some support for sparse matrices and sparse objects in general.

644

: 01:05:41

But as far as I know, that support comes from

645

: 01:05:46

old Tiano days.

646

: 01:05:47

There has been some maintenance, but not a lot of features have been added.

647

: 01:05:55

And yeah, for some projects at Labs, I've been writing my custom things to do dot products

between sparse matrices and dense vectors.

648

: 01:06:08

Unfortunately, I didn't have time yet to put that into PyTensor, but I want to do that

649

: 01:06:15

someone wants to collaborate on that endeavor, I'm more than happy.

650

: 01:06:25

But yeah, I think it's something that we should do more.

651

: 01:06:28

And the main motivation was that I wanted Bambi to do that by default, because Bambi is

doing the simple thing of multiplying big dense matrices.

652

: 01:06:44

when some of those matrices could have been sparse.

653

: 01:06:49

It's definitely not like new theory or new computational techniques, but it's taking

things that already exist and making them usable, first available and then usable for the

654

: 01:07:07

wider community.

655

: 01:07:08

And I don't know, I have fun doing those kinds of things.

656

: 01:07:13

Yeah, I mean, I think this is extremely valuable.

657

: 01:07:17

I hope you'll have time to include that in Python.

658

: 01:07:23

In a few weeks or months.

659

: 01:07:24

I mean, if I had time, but I definitely helped you, Matt.

660

: 01:07:29

Unfortunately, now with the new job and the other projects that have

661

: 01:07:42

to finish, like, don't have a lot of time for that.

662

: 01:07:46

yeah, but I mean, this is also definitely something that I want to learn more about

because it happens quite a lot.

663

: 01:07:54

And this is extremely frustrating.

664

: 01:07:56

Yeah, it's just like your brain, it feels weird because your brain when it sees a zero, it

knows if this term is not going to be useful.

665

: 01:08:07

So you can kind of get rid of it when you do the computation

666

: 01:08:11

You you do any computation by hand, you get rid of the zeros very easy.

667

: 01:08:15

But the computation does, the computer doesn't know that.

668

: 01:08:18

So you have to tell it because otherwise it spends a lot of time doing useless

computation.

669

: 01:08:24

And then in the end it's like, yeah, that's a zero.

670

: 01:08:27

But then you spent a lot of seconds doing that.

671

: 01:08:29

And that's stupid.

672

: 01:08:31

But you have to tell it, right?

673

: 01:08:33

It's what I tell with computers a lot, Computers are very powerful, but often they are

very dumb.

674

: 01:08:40

So you need to tell them exactly what you want.

675

: 01:08:43

And that's basically what you're trying to do here.

676

: 01:08:47

That's really interesting because that also happens very frequently, doesn't it?

677

: 01:08:51

Yeah, yeah.

678

: 01:08:53

For those who are curious about it and want to take a deeper dive, Daniel Simpson, he has

a very interesting blog.

679

: 01:09:02

And in that blog, he has many posts about doing things with sparse mentacies.

680

: 01:09:09

because I didn't mention this, but these matrices can have particular structures and if

they have that particular structure, can exploit some property of matrices and then do the

681

: 01:09:22

computation even faster.

682

: 01:09:25

like dot products, inverses, transposes, and things like that, determinants.

683

: 01:09:32

If you have matrices with particular structures, you can exploit those structures to save

684

: 01:09:39

and perhaps also memory.

685

: 01:09:41

And Daniel wrote a lot of posts doing things with sparse matrices using Jax, which, know,

PyTensor has these multiple backends.

686

: 01:09:56

It has a C backend, it has a Numba backend and a Jax backend.

687

: 01:10:02

And what has been frustrating to be honest is that the support for sparse matrices

688

: 01:10:09

varies a lot in those backends.

689

: 01:10:14

And that's one of the reasons that makes it harder to have something available that works

for most of the cases.

690

: 01:10:25

So in my use case, I implemented what I needed for the particular model that I had.

691

: 01:10:34

But if you want to have something public,

692

: 01:10:39

available for the wider community, it should work in more than just one single case.

693

: 01:10:47

But yeah, I think what's needed is a few people with some time to work on that and that

should be it because many things are already invented.

694

: 01:11:03

I'm not saying the task is trivial, not at all.

695

: 01:11:05

I'm saying it's...

696

: 01:11:09

It's about investing time, programming, designing, testing, and all that.

697

: 01:11:16

Yeah.

698

: 01:11:17

Yeah, so you heard it, folks.

699

: 01:11:19

Really, if you're interested in working on that, and you don't need to be an expert on

that because we have people like Tommy on the Pimesy repo who can mentor you.

700

: 01:11:30

If you're interested in that and you want to dive a bit into open source, please contact

me and I'll put you in contact

701

: 01:11:38

the appropriate authorities, as we say.

702

: 01:11:43

And yeah, so we should definitely put that blog post by Dan Simpson in the show notes,

Tommy, if you can do that.

703

: 01:11:50

also, is there anything you can share already in the show notes from your custom

implementation?

704

: 01:11:59

Yeah, I have all the repository that is public.

705

: 01:12:05

Perhaps I can update

706

: 01:12:07

with the latest things.

707

: 01:12:10

But I do have a few things to share.

708

: 01:12:15

Both implementations and experiments of myself testing those implementations.

709

: 01:12:22

Nice.

710

: 01:12:23

Which implementations are those?

711

: 01:12:25

In which cases could people use them?

712

: 01:12:28

Just Matrix.

713

: 01:12:31

If you write it, it's SPMB.

714

: 01:12:34

It's a sparse matrix.

715

: 01:12:38

SPMB, think.

716

: 01:12:39

But basically sparse matrix dense vector multiplication.

717

: 01:12:44

That's what I care about.

718

: 01:12:46

But that's in PyTensor.

719

: 01:12:51

PyTensor, C, Numba, JAX, many things.

720

: 01:12:55

But yeah, it's PyTensor with different backends.

721

: 01:13:00

Okay, so it would be like, for instance, you could use that function that's written in

PyTensor.

722

: 01:13:08

in a PyMC model.

723

: 01:13:09

Yeah, yeah, that's the goal and that's what I did in my use case.

724

: 01:13:12

Yeah, yeah, yeah.

725

: 01:13:14

It's like you have a sparse matrix multiplication somewhere in your PyMC model.

726

: 01:13:18

Instead of just doing pm .math .dot, you would use that custom...

727

: 01:13:22

Another function.

728

: 01:13:24

You would use that custom PyTensor function.

729

: 01:13:27

Correct.

730

: 01:13:28

Yeah, but the problem I was telling is, let's say you want to use the great new, not

PySum, okay, then you need a number backend to be

731

: 01:13:37

so you have that sparse thing implemented in Lumber and so on.

732

: 01:13:42

That definitely would awesome to have people help out on that.

733

: 01:13:52

I definitely love to that, unfortunately, I cannot extend my days.

734

: 01:13:59

That's really fascinating work.

735

: 01:14:02

That's really cool.

736

: 01:14:04

I'm hoping to have to do that at one point for work.

737

: 01:14:07

So you are forced to do it?

738

: 01:14:09

Yeah, either for the Marlins or for the Labs project.

739

: 01:14:14

Because then I'm forced to dive into and do it and probably do a PR to finally push that

to Pytancer Universe.

740

: 01:14:22

That's how a lot of my PRs end up being, you know.

741

: 01:14:26

That'd be great, I'd say.

742

: 01:14:27

I'd love that.

743

: 01:14:29

I love that because I've definitely been beaten by that before.

744

: 01:14:35

that's, yeah.

745

: 01:14:37

I had also looked into implementing a sparse Softmax implementation in Pytensor.

746

: 01:14:44

If I remember correctly, that didn't need to be very hard and I didn't have a lot of time

to work on that project, so I had to abandon it.

747

: 01:14:54

But yeah, definitely that'd be super fun.

748

: 01:14:57

Great, so, Tommy, it's already been a lot of time, maybe I just have one more question

before I go to last two questions.

749

: 01:15:07

Now I know you, learn a lot of stuff, we kind of work similarly so I think something I'd

like to ask you is what are you thinking about these days?

750

: 01:15:19

What do you want to learn in the coming week or coming month?

751

: 01:15:24

that's an interesting question.

752

: 01:15:30

I've been learning more about hierarchical models.

753

: 01:15:35

So it seems like, but shouldn't you already know about that topic?

754

: 01:15:39

Yeah, but turns out there are a lot of things to learn.

755

: 01:15:43

And so I've been learning about basic modeling and hierarchical models, like in multiple

ways, definitely gaining intuition through like computer exercises.

756

: 01:15:59

helped me a lot.

757

: 01:16:01

But lately, I went to more formal sources to have a look at the math and have a look at

the properties to better understand assumptions, consequences of those assumptions, trying

758

: 01:16:17

to understand when we can avoid computations.

759

: 01:16:24

In some point, my understanding was, okay, we have HMC.

760

: 01:16:28

This is the best thing in the world.

761

: 01:16:30

we pass any model between quotes because it's not any model but let's say any model and it

just works.

762

: 01:16:36

Okay, yes, you can have some problems but let's say it just works.

763

: 01:16:43

But then I've been learning more about those cases where you can avoid using such a

sampler or you can...

764

: 01:16:52

I know it sounds boring to write your own MCMC routine but if you have a A model

765

: 01:16:59

that you know very well and that's the model you want to use and nuts is going to take 30

hours because you have millions of parameters probably it's worth it like having a look at

766

: 01:17:15

the theory and realizing if you can do something more and I'm learning about that and I

really like it it's challenging I think that with

767

: 01:17:27

the experience of having worked a lot with BASIN models, is much easier to digest all

that.

768

: 01:17:36

So that's one of the things that I'm learning about.

769

: 01:17:42

Another thing that I'm always learning, and there's a book that we have been sharing

lately with folks at Labs and on Twitter.

770

: 01:17:53

The book is called, Richly Parametrized

771

: 01:17:56

linear models or something like that.

772

: 01:17:58

But something about models with a lot of parameters and how to work with those models.

773

: 01:18:04

And the book is great.

774

: 01:18:07

I enjoyed it.

775

: 01:18:09

And the topic is the connection between many different models that seem to be different,

but how they are connected to each other.

776

: 01:18:20

And I really enjoy that.

777

: 01:18:23

Like, you have a spline model.

778

: 01:18:27

You have a model with splines and then you have a hierarchical model but if you have these

particular priors and you go to the models that distribution it matches that other thing

779

: 01:18:37

and seeing those connections between the different models and modeling approaches is

really nice because it may seem boring at some point but that's how you

780

: 01:18:57

really grasp the depths of something.

781

: 01:19:03

So yeah, those are two things I'm learning about these days and I enjoy learning about

those things.

782

: 01:19:12

Yeah, I can tell you, you love learning about new things.

783

: 01:19:18

I do too, I think that's why also we work so well together.

784

: 01:19:25

And if you have a link to the book you just mentioned...

785

: 01:19:28

Yeah, I will share the book to edit.

786

: 01:19:31

I'm very bad at remembering exact names.

787

: 01:19:35

Fortunately, I can just search my computer so I know one or two words and then I can get

what I want.

788

: 01:19:41

That's cool.

789

: 01:19:43

sounds about right.

790

: 01:19:46

Well, Tommy, that's great.

791

: 01:19:48

I think it's time to call it a show.

792

: 01:19:50

We've got a lot of ground.

793

: 01:19:52

Of course, a ton of questions I

794

: 01:19:54

Still ask you, let's be respectful of your time.

795

: 01:19:58

But before, I'll let you go, of course.

796

: 01:20:00

I'm gonna ask you the last questions.

797

: 01:20:02

I'll ask you if you had guests at the end of the show.

798

: 01:20:06

you could...

799

: 01:20:08

No, sorry.

800

: 01:20:09

First one is if you had unlimited time and resources, which problem would you try to

solve?

801

: 01:20:16

I don't know if this problem has like a particular name, but you know, I enjoyed...

802

: 01:20:23

working with samples obtained with MCMC methods.

803

: 01:20:29

And it's really nice learning about how they work and how to diagnose them and all that.

804

: 01:20:35

But if we could have just a method that gives us real samples from any posterior

distribution that we work with, or we could have a very clever machine that knows the

805

: 01:20:50

details about every model

806

: 01:20:52

without us noticing, it uses a specific method to give us draws from the posterior,

meaning that you don't need to worry about divergences, convergence, and things like that,

807

: 01:21:08

where you can just focus in the analysis of the outcome.

808

: 01:21:12

I will work on that.

809

: 01:21:13

Because, and it's something I've been thinking more these days, like, now I need to wait

for the compilation.

810

: 01:21:21

and now I need to wait a few hours to get the draws.

811

: 01:21:26

If I could have something that saved me from that, even though I enjoy learning about how

it works and how to improve it depending on the kind of problems I'm having, yeah, I would

812

: 01:21:42

definitely like getting rid of MCMC and just do MC.

813

: 01:21:52

But I don't know if it's possible.

814

: 01:21:54

But if I'm here to dream, I'm going to have like...

815

: 01:22:00

Yeah, a very ambitious dream.

816

: 01:22:02

sure.

817

: 01:22:03

Yeah, Let's dream big.

818

: 01:22:05

Yeah, I agree with that.

819

: 01:22:06

Kind of having a...

820

: 01:22:07

Yeah, what I often dream about is having kind of like a Javi's like Iron Man.

821

: 01:22:13

I mean, like, can you try that version of the model?

822

: 01:22:17

Something like that.

823

: 01:22:19

that'd be fantastic.

824

: 01:22:21

Yeah.

825

: 01:22:22

Nice.

826

: 01:22:23

then second question.

827

: 01:22:24

If you could have dinner with any great scientific mind that alive or fictional, who would

it be?

828

: 01:22:31

And keep in mind that you cannot say myself because you already had dinner with me.

829

: 01:22:37

then we have to finish the recording.

830

: 01:22:39

Yeah, I I knew you were going to answer myself and I definitely appreciate that.

831

: 01:22:44

But you already had dinner with me, so you have to choose one of us.

832

: 01:22:47

Yeah,

833

: 01:22:51

Again, let me explain the answer.

834

: 01:22:54

I don't know why, but I'm a fan of movies and documentaries about World War II.

835

: 01:23:02

And one movie I enjoyed a lot and like I was really into the movie with a lot of attention

and very interested in what was happening was the, I think in English it is called the

836

: 01:23:16

Imitation Game, but in Spanish we call it...

837

: 01:23:21

the Enigma code or something like that.

838

: 01:23:26

And I really enjoyed that movie.

839

: 01:23:29

And I was fascinated seeing the machine moving the things and making noise, trying to

crack the machines to understand the message and then using like, okay, now we have the

840

: 01:23:46

information.

841

: 01:23:46

What do we do with that information?

842

: 01:23:48

So definitely...

843

: 01:23:51

I'm talking about Alan Turing and we have dinner with him to talk about everything.

844

: 01:23:57

How he was recruited, how they come with ideas, how they used it, what was hard about

making choices because it was both a technical problem but also a political, human

845

: 01:24:14

problem.

846

: 01:24:16

And then to talk about what happened after that.

847

: 01:24:19

So yeah, I think

848

: 01:24:22

The bad thing about that dinner would be that I would like it to last for many hours

because I would have many questions.

849

: 01:24:33

But yeah, that would be one person I would like to have dinner with to interview and ask a

lot of things.

850

: 01:24:42

Yeah, Great choice.

851

: 01:24:43

Fantastic choice.

852

: 01:24:45

Invite him at Christmas.

853

: 01:24:49

Christmas dinner

854

: 01:24:50

takes hours, so I think that's That's a very good opportunity.

855

: 01:24:55

Whether in France or Argentina, they always last hours, so you know.

856

: 01:25:00

That's good.

857

: 01:25:02

Awesome.

858

: 01:25:02

Well, thanks a That was a blast to finally have you on the show.

859

: 01:25:09

More than 100 episodes after you eavesdropped on Osvaldo's door at the Cunicet.

860

: 01:25:22

In Spanish, I think you would say, a little bit Quechua, and yeah, I'm sure.

861

: 01:25:29

yeah, yeah.

862

: 01:25:30

Yeah, that's great to have you on the show, And as usual, we'll put a link to your

website, to your socials, to a lot of links for those who want to dig deeper.

863

: 01:25:46

Thanks again, Tommy, for taking the time and being on this show.

864

: 01:25:50

Thank you, it was a lot of fun to be honest.

865

: 01:25:54

if Alex happens to invite you to the podcast, you have to say yes.

866

: 01:26:00

Thank you, Alex.

867

: 01:26:07

This has been another episode of Learning Bayesian Statistics.

868

: 01:26:10

Be sure to rate, review, and follow the show on your favorite podcatcher, and visit

learnbayestats .com for more resources about today's topics, as well as access to more

869

: 01:26:21

episodes to help you reach true Bayesian state of mind.

870

: 01:26:25

That's learnbayestats .com.

871

: 01:26:27

Our theme music is Good Bayesian by Baba Brinkman.

872

: 01:26:30

Fit MC Lance and Meghiraam.

873

: 01:26:32

Check out his awesome work at bababrinkman .com.

874

: 01:26:35

I'm your host.

875

: 01:26:36

Alex Andorra.

876

: 01:26:37

can follow me on Twitter at Alex underscore Andorra, like the country.

877

: 01:26:41

You can support the show and unlock exclusive benefits by visiting Patreon .com slash

LearnBasedDance.

878

: 01:26:49

Thank you so much for listening and for your support.

879

: 01:26:51

You're truly a good Bayesian.

880

: 01:26:53

Change your predictions after taking information in.

881

: 01:26:57

And if you're thinking I'll be less than amazing, let's adjust those expectations.

882

: 01:27:03

Let me show you how to be a good Bayesian Change calculations after taking fresh data in

Those predictions that your brain is making Let's get them on a solid foundation

Share Episode

Shownotes

Transcripts

Follow

Links

Chapters

Video

More from YouTube