Artwork for podcast HockeyStick Show
HockeyStick #7 - Generative AI for Data Analytics
Episode 713th May 2024 • HockeyStick Show • Miko Pawlikowski
00:00:00 01:06:19

Share Episode

Shownotes

Leveraging Generative AI in Data Analytics: Insights from Industry Experts

In this episode of HockeyStick, host Miko Pawlikowski interviews Artur Guja, Dr. Marlena Siviak, and Dr. Marian Siwiak, the authors of 'Generative AI for Data Analytics.' They discuss the impact of generative AI on data analytics, their collaborative background, and their book's focus on utilizing AI tools efficiently rather than seeking them out as silver bullets for complex problems. The conversation also delves into process optimization, the challenges and realities of academia, the potential and limits of prompt engineering, and the future of AI in data analytics. The importance of understanding the nuances of using generative AI as an assistant rather than a replacement for human creativity and insight in data analytics is highlighted throughout the discussion.

00:00 Welcome to HockeyStick: Unveiling Generative AI's Impact on Data Analytics

00:17 Meet the Minds Behind the Book: Diverse Expertise Uniting for Innovation

01:11 The Genesis of a Groundbreaking Book: Collaboration and Inspiration

02:07 Demystifying Generative AI: Beyond the Hype and Into Practical Use

04:34 The Essence of Process Optimization: Bridging Gaps and Enhancing Efficiency

09:58 Navigating the Complexities of Academia: A Personal Journey

24:13 The Intriguing World of Pharmacon: A Techno-Thriller Born from Experience

26:00 Crafting a Book on Generative AI: A Collective Venture into the Future

33:55 Exploring the Impact of AI on Data Analytics and Programming

35:56 The Skepticism Towards LLMs in Development

37:04 The Role of Healthy Paranoia in AI Assistance

39:41 Defining and Discussing Artificial Sentience

46:14 The Practical Use of Generative AI in Data Analytics

01:00:15 The Future of Data Analytics and AI Integration

01:04:24 Final Thoughts and Book Promotion

Transcripts

Miko Pawlikowski:

I'm Miko Pawlikowski and this is HockeyStick.

Miko Pawlikowski:

Today we're talking about how generative AI is changing the field of data analytics and how you too can leverage large language models to become your assistant and co-worker.

Miko Pawlikowski:

I'm joined by the three authors of the "Generative AI for Data Analytics" book,

Miko Pawlikowski:

now available in early access from Manning.com.

Miko Pawlikowski:

Artur Guja, risk manager and computer scientist with over 20 years of experience in the banking sector.

Miko Pawlikowski:

Dr.

Miko Pawlikowski:

Marlena Siviak, data scientist and bioinformatician, the co-creator of the first global model of the COVID-19 pandemic, and the co-author of a techno thrill novel and sci-fi short stories.

Miko Pawlikowski:

And Dr.

Miko Pawlikowski:

Marian Siwiak, data scientist, strategist, and bioinformatician, the creator of the first artificial sentience, something we're going to cover in this episode, and the sci-fi novel Pharmacon.

Miko Pawlikowski:

Welcome to this episode and thank you for flying hockey stick.

Miko Pawlikowski:

The first thing I thought is that you look like an eclectic bunch.

Miko Pawlikowski:

you've got Artur of this banking sector, Marlena with the bioinformatician, Marian, data How did you end up teaming up for the book?

Marian Siwiak:

we worked together previously, especially me and Marlena.

Marian Siwiak:

With Artur, we also,

Artur Guja:

walking our kids in the park.

Artur Guja:

the three of us used to work, earlier on various ventures, on, process, automation on the business process re-engineering.

Artur Guja:

So this is one of many ventures that we've done

Miko Pawlikowski:

I see.

Miko Pawlikowski:

So you go way back and this is just Another project.

Miko Pawlikowski:

Just another day.

Marian Siwiak:

funnily enough, it's not like we go like 20 years way back.

Marian Siwiak:

We work together quite intensely.

Marian Siwiak:

what we did together, multiple things, they all led to this book because we were always trying to find ways to make things.

Marian Siwiak:

quicker, more efficient, better.

Marian Siwiak:

This is what Artur mentioned.

Marian Siwiak:

we worked in process optimization in a broad sense.

Marian Siwiak:

So it was always interesting, to us how to make things more, efficient.

Marian Siwiak:

And when generative AI.

Marian Siwiak:

blew and finally started to resemble, human cognition in a sense, we decided to give it a try and our minds were collectively blown and we started using it for our work.

Marian Siwiak:

And then we decided that now that we know how to use it, I would say, again, efficiently and, Marlena found a way to use it smartly.

Marian Siwiak:

we decided that we could write a book about it because we noticed that there is a lot of buzz about it.

Marian Siwiak:

There is a lot of prompt engineering.

Marian Siwiak:

now I think on Coursera, you can take a specialization in prompt engineering.

Marian Siwiak:

and everybody's again, looking for a silver bullet.

Marian Siwiak:

So I will just type in magical command and it will solve my problems.

Marian Siwiak:

our collective experience is technology doesn't solve problems.

Marian Siwiak:

technology can give you a great headache if you don't use it in a way it's supposed to be used, but everybody tries to cut corners and simplify things.

Marian Siwiak:

So this book is about using generative AI.

Marian Siwiak:

it's not a cookbook.

Marian Siwiak:

It's not, okay, this is some code or some prompts.

Marian Siwiak:

You will type them in and your problems will be solved.

Marian Siwiak:

it's just not how we work.

Marian Siwiak:

It's not how the world works.

Marian Siwiak:

Despite many people wanting it to,

Marlena Siwiak:

I think that this is the problem with expectations.

Marlena Siwiak:

many people have missed expectations in terms of ChatGPT and other generative AI.

Marlena Siwiak:

And then they are surprised and they are unhappy because ChatGPT can't make them coffee yet.

Marlena Siwiak:

maybe this is not the tool for making coffee.

Marlena Siwiak:

I very often see this kind of complaint, which is not necessary because it is a great tool.

Marlena Siwiak:

it's great invention.

Marlena Siwiak:

And I think it's going to change the way our society works.

Marlena Siwiak:

it's good to live in such times.

Marlena Siwiak:

it's really interesting.

Miko Pawlikowski:

Ask a few questions to ChatGPT and see how good they are and see what you can do.

Miko Pawlikowski:

Ask for some snippets and do all kinds of things that kind of speed you up.

Miko Pawlikowski:

but it's also probably the most frustrating, element of working with, especially for people like me who come from software engineering background and they like things well defined and, always replicable and reproducible and all of that, and then you go here and it all ends.

Miko Pawlikowski:

But, before we jump into the book, a little bit deeper, can you tell us a little bit more about what, process optimization actually means?

Miko Pawlikowski:

I know that's probably a phrase that you use a lot and it means a well defined thing for you, but it might not for the audience.

Artur Guja:

basically taking a look at what business does, what people do in the business and, looking for, ways for optimizing it, but, actually describing what should be done, what people think, should be done versus what people actually

Artur Guja:

do, because usually there is a massive gap between what people think is happening and what they think should be happening.

Artur Guja:

People think that, a given operation should be reviewed by at least two people and should take no more than a day.

Artur Guja:

The fact is that usually one person just takes it off and it takes maybe two days because they're very busy or they've been on holiday.

Artur Guja:

the dissonance between reality and documentation is usually huge.

Artur Guja:

In looking from the process from the outside and then looking for ways to close that gap is I think the best way to describe the optimization to actually make the process and the reality meet in something that is both realistic.

Artur Guja:

Because processes, when they're designed, are usually overly, optimistic and something that actually works.

Artur Guja:

And then using automation, because once, once you actually describe what's happening, you can use automation to free people from the burden of mundane tasks, and actually help them focus on something creative.

Marian Siwiak:

the way we approached it is, important part is to understand what is really happening.

Marian Siwiak:

And that, Joe on the second floor is actually the information hub for all the company, and despite his, activities not being overly highlighted in the org structure, he's the most important person in the company.

Marian Siwiak:

We created, maps which connected on the one side, what are the actions and decisions?

Marian Siwiak:

this is where we believe.

Marian Siwiak:

Is the critical, value in process mapping is understanding what are the decisions to be made, who is making these decisions and what, on what basis to understand on what basis they make this decision.

Marian Siwiak:

We map up decisions and we map up all the data that they are using.

Marian Siwiak:

So all the actions produce data, and all decisions utilize some data and you have this two layers of information about the process.

Marian Siwiak:

Artur introduced, also the third layer, which is the risk.

Marian Siwiak:

So the people who are making decisions can understand, what are the risks associated, what different outcomes of decisions can be.

Marian Siwiak:

And then when you can see how it all works, you can improve on it.

Marian Siwiak:

You can shorten the cycles.

Marian Siwiak:

so process optimization is actually first understand what is happening, understanding, what could be happening and find a way to make decisions more informed and, conscious of risks.

Marian Siwiak:

And then also you have actions, and this is probably where most of the process optimization, consultants work, is how to make actions to be more efficient.

Marian Siwiak:

But in our opinion, if the action is triggered by a misinformed decision, it's a pure waste of time anyway.

Miko Pawlikowski:

makes more sense now, because initially I thought when you said technology doesn't solve problems, it creates headaches.

Miko Pawlikowski:

I was like, 'Oh, this is such a terrible slogan

Marian Siwiak:

as you can see, I now work in the aluminum refinery.

Marian Siwiak:

because people didn't want to hear, what we are saying.

Marian Siwiak:

They wanted to hear, 'yes, we will come and install you a new tool and all will be solved'.

Marian Siwiak:

So our sales process sucks, as you can hear.

Marian Siwiak:

Where we were able to implement it, it worked perfectly.

Marian Siwiak:

but not many people wanted to put extra effort.

Marian Siwiak:

So I need to tell you what I do?

Marian Siwiak:

No, I want the tool that will discover what I do.

Artur Guja:

This is a problem with generative AI, that people expected to solve problems just by, give me an account on, ChatGPT.

Artur Guja:

And here all my problems are solved.

Artur Guja:

And very often as we've seen through various, anecdotal, evidence,

Artur Guja:

giving ChatGPT to people who are not aware of the dangers of it and the problems, the hallucinations that it can generate, just leads to, hilarious results as, the case of those lawyers in US who introduced completely fictitious, cases into their evidence or, maybe slightly less hilarious examples

Artur Guja:

of, proprietary software leaking out through ChatGPT because people were just putting proprietary information into it and it became public knowledge.

Artur Guja:

don't expect ChatGPT to solve all your issues

Marlena Siwiak:

the comparison that you used at the beginning was the right one.

Marlena Siwiak:

That ChatGPT is like an assistant.

Marlena Siwiak:

Very, smart, very intelligent, an assistant who read a lot and learned a lot, but it's still a newbie.

Marlena Siwiak:

He's, just after grad school, right?

Marlena Siwiak:

No experience, you can ask it for help, you can give it tasks to do, but you have to, manage that.

Marlena Siwiak:

You cannot give him all the responsibility.

Miko Pawlikowski:

Yeah.

Miko Pawlikowski:

one could say it was literally born yesterday,

Marlena Siwiak:

Exactly.

Miko Pawlikowski:

to a certain degree, understandable.

Miko Pawlikowski:

so is your PhD background also in, in process optimization

Marlena Siwiak:

So my PhD was in biophysics, in particular protein translation, a bit of process optimization, but not much and not related to business at all.

Marlena Siwiak:

at some point I decided to quit academia and you have to do something else.

Marlena Siwiak:

So I turned to data science, which was very close to the things that I was actually doing as a bioinformatician.

Marlena Siwiak:

the type of data changed, basically, that was the thing that really matter.

Marlena Siwiak:

And from there, slowly, you look for a job, another job, and it goes like that.

Marlena Siwiak:

Yeah.

Miko Pawlikowski:

and if you don't mind me asking why quit academia,

Marlena Siwiak:

maybe I got a bit disappointed with how science is made.

Marlena Siwiak:

you want more citations of your publications to survive.

Marlena Siwiak:

And to have more citations, you have to be more popular in social media and stuff.

Marlena Siwiak:

it's crazy that you have to fight for popularity by being a scientist where what should count is actually your science, your research and the thought behind it.

Marlena Siwiak:

There are too many papers.

Marlena Siwiak:

Nobody has time to read it, even in very narrow domain.

Marlena Siwiak:

So they read the first things that come to them when they search the internet.

Marlena Siwiak:

So we have to fight to be popular, to be on top.

Marlena Siwiak:

it has nothing to do with the quality of your research, in fact.

Marian Siwiak:

and there is a research on that, which shows that, You need to be popular to be accepted to high priority journals and it has nothing to do and as I said It's not just opinion of the frustrated former scientist, but that's a research showing that

Marian Siwiak:

the quality published there Is exactly the same as anywhere else, but there is more citations and, also more money resulting from it.

Marian Siwiak:

prestige, here translates to money because, from citations, come better grants, right?

Marian Siwiak:

Still, I want to be perfectly clear.

Marian Siwiak:

I think peer review

Marian Siwiak:

despite all its drawbacks, it's the only way of, distinguishing from pseudo research.

Marian Siwiak:

It changed a little in software engineering.

Marian Siwiak:

I don't think the papers about ChatGPT or LLAMA or anything like that were peer reviewed.

Marian Siwiak:

They are prepared as so called preprints, and they don't bother with so called researchers to evaluate it because the results speak for themselves.

Marian Siwiak:

So this is, I must say, the paradigm shift, I love this word, that we observe right now.

Marian Siwiak:

but in most other cases, the peer review is the only process.

Marlena Siwiak:

when you're talking about peer review, another thing that bothers me in academia is the fact that everybody expects that your research will be successful, and it's not always so with research.

Marlena Siwiak:

Research is asking questions.

Marlena Siwiak:

does your hypothesis work?

Marlena Siwiak:

And very often it doesn't work, or the most often outcome is that we don't know because the effect is too small, yeah?

Marlena Siwiak:

And it's impossible to publish things things.

Marlena Siwiak:

when you answer to the question, we still don't know.

Marlena Siwiak:

So you'll waste a lot of time and your effort, your money, and in the end you have the answer "we still don't know".

Marlena Siwiak:

Who would give you another money?

Marlena Siwiak:

So what researchers do, sometimes unconsciously, they are trying to find, black or white, but very often it's grey, publishing this grey results is still valuable because when you collect multiple researches like this,

Marlena Siwiak:

prepare a meta analysis, you can get the final answer yes or no.

Miko Pawlikowski:

or

Marlena Siwiak:

But the way science is funded, and the fact that you won't get another money for research like this, if you produce, "I don't know" answer,

Miko Pawlikowski:

I can't remember last time nature had on the cover.

Miko Pawlikowski:

"Is this true?

Miko Pawlikowski:

Don't know".

Marian Siwiak:

don't know.

Marlena Siwiak:

There's no space for such discussion.

Marlena Siwiak:

And, everybody's in a rush in academia.

Marlena Siwiak:

There is no space to really think.

Marlena Siwiak:

to educate yourself.

Marlena Siwiak:

Yeah, it's all, in the rush and results without, it's like corporation.

Marlena Siwiak:

It's not much difference, really.

Miko Pawlikowski:

So what you're saying is that turns out that scientists found that scientists are humans like any others, and they have the same problems with herd mentality and wanting to progress their career and wanting to make money and making headlines.

Marlena Siwiak:

it's not making huge monies or anything like that.

Marlena Siwiak:

Because to be honest, salaries in academia suck, right?

Marlena Siwiak:

when you compare the salaries, these salaries to salaries of people who work in business and are similarly educated, it's much worse.

Marlena Siwiak:

And the expectations are high, yeah?

Marlena Siwiak:

the amount of work you have to do, the amount of time.

Marlena Siwiak:

time it consumes,

Marian Siwiak:

Also, it's very ego-driven.

Marian Siwiak:

Look at us.

Marian Siwiak:

you have this myth.

Marian Siwiak:

Of We are the beacon of truth for the world, which has nothing to do with truth anyway.

Marian Siwiak:

But anyway, pretty low salaries compared to other positions.

Marian Siwiak:

You have pretty low, position stability.

Marian Siwiak:

many institutions keep researchers on grant money.

Marian Siwiak:

we bring more grants so they can get the overheads, their share.

Marian Siwiak:

brings people with very specific mentality, and many of them are complete egomaniacs.

Marian Siwiak:

So

Marian Siwiak:

it also makes all this environment extremely toxic.

Marian Siwiak:

know I sound like a frustrated former scientist, which I am.

Marian Siwiak:

but it doesn't mean that I'm not right,

Miko Pawlikowski:

to segway into a question I was going to ask about that COVID 19 pandemic, Could you talk a little bit about that Covid, model?

Miko Pawlikowski:

I'm curious, what does it mean to say, you're the co-creator of the first global model of covid pandemic?

Marian Siwiak:

we created a model of a global pandemic.

Marian Siwiak:

in March, 2020, we had a model where we were dropping an index case.

Marian Siwiak:

So it's the first person infected in Wuhan, China in November, 2019.

Marian Siwiak:

And we were accurately predicting number of symptomatic and asymptomatic cases in New York a couple months later.

Marian Siwiak:

back then there was no Good model on any country level.

Marian Siwiak:

Later, there were global models, because again, technology doesn't solve problems.

Marian Siwiak:

This is the perfect example of what we spoke previously, because we used existing technology.

Marian Siwiak:

No, we looked at the virus as a biological, not a political entity.

Marian Siwiak:

And that was.

Marian Siwiak:

The biggest difference, because we looked at the data available and we decided, okay, it's impossible that the virus has a completely different infectivity in one country than in the other.

Marian Siwiak:

It just viruses don't work this way.

Marian Siwiak:

It's not like they have, passports and they say, okay, I come to this country and I'll be nice and I will, infect, not more.

Marian Siwiak:

Yeah.

Marian Siwiak:

Visa denied.

Marian Siwiak:

no, in your country, I will infect no more than three people from every, infected person, I think our listeners will also interested in the source of the model, we approached as a data science problem and, at the same time, the biology-related problem.

Marian Siwiak:

So we checked other coronaviruses.

Marian Siwiak:

And we assumed that it is yet another coronavirus, like there was SARS, there are other.

Marian Siwiak:

And we simply used the values.

Marian Siwiak:

We created a model, not a pure machine learning model.

Marian Siwiak:

We prepared analytical model where we assumed, okay, so this is the virus.

Marian Siwiak:

This is how it should look like more or less and let's use some Monte Carlo simulations to check how it will spread.

Marian Siwiak:

And we noticed that our assumptions, they actually reflect the situation in the countries where we could say with certain degree of certainty, provide accurate data.

Marian Siwiak:

Okay, so this is the virus.

Marian Siwiak:

This is how it looks like.

Marian Siwiak:

And, this is how it behaves.

Marian Siwiak:

And, we tried to publish it for over half a year, when we published it, it was too late because we were just a small company trying to show people, 'okay, this is the accurate model'.

Marian Siwiak:

I'm not even saying it was true, right?

Marian Siwiak:

But it's accurate and it was showing completely different picture than everybody else was willing to believe.

Marian Siwiak:

so one of our reviewers was excluded from the process because of obstructionism slowed down the publication for many months.

Marian Siwiak:

This was a problem not solved by technology.

Marian Siwiak:

This was a problem where you had to just sit down, do your homework,

Marian Siwiak:

read about the problem, read about similar problems, collate the data into a coherent whole, and then use some technology to make this last inch.

Marian Siwiak:

Okay, let's check if our assumptions hold true, all right?

Marian Siwiak:

I'm sorry.

Marian Siwiak:

I'm getting emotional when I think about it.

Marian Siwiak:

Anyway, so yeah, it was, it was pretty fun.

Miko Pawlikowski:

What I always think about is in this models, are they just like statistical analysis of this is the incubation period, this is the exposure, this is the coefficient of, how it's going to grow, or things like, the

Miko Pawlikowski:

country's interventions as in, one country might be, we're not doing anything, not going to name any countries, but,

Marlena Siwiak:

if you know how to quantify

Marlena Siwiak:

it, you can add it, of course, but this is another level of complication.

Marlena Siwiak:

the problem is data.

Marian Siwiak:

you can assume that some interventions will impact because the way we modeled it, it's a statistical properties of the virus.

Marian Siwiak:

It's ability to infect others.

Marian Siwiak:

And time that people take to, be diagnosed or recognized as, infected, So this is, let's say, infectivity on different stages, you can complicate this model.

Marian Siwiak:

The model or technology that we moved, it was global mobility-based, so they divided the world.

Marian Siwiak:

into, areas around international airports.

Marian Siwiak:

And the simulation was run for each area separately.

Marian Siwiak:

And then there was a probability of somebody moving from this area.

Marian Siwiak:

So you could go area by area, one by one.

Marian Siwiak:

And this is why we modeled only the early stages but it takes time and money to evaluate what are the effects or.

Marian Siwiak:

expected effects,

Miko Pawlikowski:

effects,

Marian Siwiak:

in given area of, let's say different levels of lockdown or travel restrictions or whatever.

Marian Siwiak:

So it is possible, but we would have to have financing, right?

Marian Siwiak:

We were thinking about doing it, but it's a gigantic work.

Marian Siwiak:

Imagine nobody wanted to pay us, especially, but we published in the second grade journal, six months too late, it is possible technically.

Miko Pawlikowski:

So it's always a matter of the same thing.

Miko Pawlikowski:

Someone didn't allocate enough money

Marian Siwiak:

amount of money was sufficient.

Marian Siwiak:

I think it's again, what Marlena said previously, it's this beauty pageant, among scientists that, the people who got this money, they were the most popular because the model that was published just after we submitted ours was so widely

Marian Siwiak:

inaccurate that even the academic, environment, which is very careful in bad mouthing the results, they trashed it, right?

Marian Siwiak:

But it was popular.

Marian Siwiak:

It had a lot of citations and a lot of money went after it.

Marian Siwiak:

Somebody who published widely inaccurate model got a lot of money because he was widely recognized expert.

Marian Siwiak:

Because when you are applying for grant, nobody asks, are your citations saying that your model is inaccurate?

Marian Siwiak:

No, they ask, how many citations did your paper get?

Miko Pawlikowski:

Once, someone published some research, it got popular.

Miko Pawlikowski:

Turns out it was inaccurate or turns out it was wrong.

Miko Pawlikowski:

Are there any repercussions for that afterwards?

Marian Siwiak:

What repercussions?

Marian Siwiak:

In the worst case, you just retract your paper and you lose the citations.

Marian Siwiak:

you're not even very often excluded from conferences.

Marian Siwiak:

If you're popular enough, you are a voice in the discussion.

Marlena Siwiak:

if you go too far, if you exaggerate, you can end up in jail.

Marlena Siwiak:

I'm thinking about the Teranos right now.

Marlena Siwiak:

they also had some research about their technology.

Marlena Siwiak:

which was all fake.

Marlena Siwiak:

Of course.

Marlena Siwiak:

that

Artur Guja:

lady went to jail for, for financial fraud, not for research fraud

Marlena Siwiak:

But that fraud was based on false results, that she was convincing investors that she has technology, technology that solves

Marian Siwiak:

say, but if she wouldn't take money, she wouldn't go to jail.

Marlena Siwiak:

Yeah,

Miko Pawlikowski:

the lady we're talking about, obviously, is Elizabeth Holmes, who is either going to jail or is already in jail.

Miko Pawlikowski:

But to flip the question a little bit, should People be going to jail for faulty assumptions and faulty research

Marlena Siwiak:

now we punish people for saying that they still don't know, yeah?

Marlena Siwiak:

So we cannot punish them for false results.

Marlena Siwiak:

No, absolutely not.

Marlena Siwiak:

But, on the other hand, I think, no, making mistakes is okay.

Marlena Siwiak:

maybe we put too much trust sometimes in that.

Marlena Siwiak:

it should be as open for discussion as possible.

Marlena Siwiak:

you can check all the research, of others, right?

Marlena Siwiak:

All the time.

Marlena Siwiak:

And you should discuss with that.

Marlena Siwiak:

That's, it should be as open

Marian Siwiak:

won't get money.

Marian Siwiak:

you won't get money to check somebody's research.

Marian Siwiak:

Let's

Marlena Siwiak:

Yes.

Marlena Siwiak:

That's another problem.

Marlena Siwiak:

If you, it's difficult to get money to check somebody else's research, especially when the research is published high.

Marian Siwiak:

takes a lot of effort to

Marian Siwiak:

counter such a false claim.

Marian Siwiak:

it happened a couple of times.

Marian Siwiak:

But it was people who were, in equally prestigious universities.

Marian Siwiak:

I think that, one of the funniest was there was a lady, she was leading at Harvard some faculty on ethics.

Marian Siwiak:

And she falsified her results.

Marian Siwiak:

it was results that if people sign some waiver or some statement that they will be truthful, they actually answer the survey more truthful.

Marian Siwiak:

And she falsified a lot of the research that built her career on ethics.

Marian Siwiak:

But getting it down, it took people from equally prestigious universities a lot of time.

Miko Pawlikowski:

So

Miko Pawlikowski:

I guess before we get into, the generative AI, I also have one last question, for you and the question is one word, "Pharmacon", tell us about it.

Marian Siwiak:

so nice.

Marian Siwiak:

I hope somebody noticed.

Marian Siwiak:

I'm touched.

Artur Guja:

that's your third reader.

Marian Siwiak:

Yes, he did, he never said he read it.

Marian Siwiak:

I would notice.

Marian Siwiak:

I would notice.

Marian Siwiak:

I would get an email

Marlena Siwiak:

Can I show it?

Marlena Siwiak:

I am prepared.

Marlena Siwiak:

Can I show it?

Marlena Siwiak:

Yeah,

Marlena Siwiak:

this is our novel, and we have also the English version, but it's much smaller because it's just the beginning, the first part, but you can buy it on Amazon if you want.

Marlena Siwiak:

But anyway, it's, Sorry?

Marian Siwiak:

no.

Marian Siwiak:

it was translated long before ChatGPT.

Marlena Siwiak:

yeah.

Marian Siwiak:

it's a technotriller.

Marian Siwiak:

It's a story of a young scientist who makes a breakthrough, discovery and then bears the consequences.

Marlena Siwiak:

the consequences are, harsh, and it doesn't go the way he expected.

Marlena Siwiak:

It's more

Marian Siwiak:

social thriller as

Marlena Siwiak:

thriller, I would say.

Marlena Siwiak:

yeah.

Marlena Siwiak:

But, it's the way of, it's it's substitute for us, of Netflix, and other ways of wasting time.

Marlena Siwiak:

We prefer to create our own stories than watching somebody else's stories.

Marian Siwiak:

No, I must say I'm proud that some of our critics said that it's well written, it has good, dialogues, and writing it, was a lot of fun.

Marian Siwiak:

We are now writing another part very slowly.

Marian Siwiak:

the process of creating it is pretty, pretty fun.

Marian Siwiak:

And I think that a lot of our frustrations that you can hear in this conversation are there in much funnier form, I would say.

Miko Pawlikowski:

Perfect.

Miko Pawlikowski:

I like that.

Miko Pawlikowski:

Happy story.

Miko Pawlikowski:

at the end of a very long rant about, all the faults of academia.

Miko Pawlikowski:

So whose idea was it really to write a book about, generative AI confess.

Marlena Siwiak:

I think Marian started

Marian Siwiak:

I would have to blame myself.

Marian Siwiak:

I wrote another book with Manning, "Data Mesh in Action".

Marian Siwiak:

and I contacted our absolutely wonderful editor, we spoke about putting into written form our experiences with generative AI, which we started writing it some time ago, so it wasn't much, but we've already seen that it's a breakthrough.

Marian Siwiak:

It speeds our work enormously and also brings some risks.

Marian Siwiak:

which people should know about.

Marian Siwiak:

People should know what to expect and what not to expect.

Marian Siwiak:

And, this is where I thought that Artur would be the best person to ask for help.

Marian Siwiak:

Because when it comes to 'don't do it', he's almost as good as Marlena.

Marian Siwiak:

many years ago, I noticed when people started to get hyped about data science, which was supposed to be a narrow field for disillusioned scientists, finding their way into, corporate world and, putting their skills into use.

Marian Siwiak:

So we decided to write a book, but would show, ;'okay, this is a tool with its enormus capabilities and enormous risks.

Marian Siwiak:

Let's put it together into a working whole.

Marian Siwiak:

And this is the effect.

Marian Siwiak:

It's not written in not such an exciting way as pharmacon is.

Marian Siwiak:

it's not meant to excite.

Marian Siwiak:

A lot of books that you see, even technical books, they are written to excite you about technology.

Marian Siwiak:

This technology is exciting on itself.

Marian Siwiak:

our, goal was to cool some heels, I would say,

Artur Guja:

we wanted to make the book exciting, but we didn't want people to be over excited about the technology.

Artur Guja:

I think it's an important difference.

Artur Guja:

Because, people were so hyped up about ChatGPT and LLAMA and other models.

Artur Guja:

where they thought that suddenly that the future has come and everything will be beautiful.

Artur Guja:

And, we'll never have to work anymore.

Artur Guja:

a lot of the articles we saw in the press were basically, extolling the virtues of AI with absolutely no mention of, the practicality.

Artur Guja:

So we thought, we write a book about the how.

Artur Guja:

And not about the fact that it's all sparkly and shiny and, plays nice music.

Miko Pawlikowski:

How is it writing a book with, another, two authors being a couple.

Miko Pawlikowski:

how's the power dynamic, in a situation like this?

Miko Pawlikowski:

I'm very curious, not to call you the third wheel,

Miko Pawlikowski:

but,

Marlena Siwiak:

This is pretty simple because everybody wrote his own

Marian Siwiak:

Marlena, it was a question to Artur.

Marlena Siwiak:

I'm sorry.

Artur Guja:

this is exactly the dynamic.

Marlena Siwiak:

Yeah.

Artur Guja:

handed my bit and put in the corner to write.

Artur Guja:

No, no, it was really interesting, especially since the two are academics.

Artur Guja:

And, I'm the kind of the ugly business guy, Truth is that, we found very nice kind of alignment between the different parts of the book and, our experiences.

Artur Guja:

obviously you can see the latter part of the book being more about risk and about, as Marian said, I always say no because and the kind of the chapters, risk are exactly that they are explanations why you should be very careful with this.

Artur Guja:

Marian, obviously that his experience on technology

Artur Guja:

on AI machine learning and Marlena's very practical approach to, to certain use cases

Artur Guja:

in data science and analytics.

Artur Guja:

So we contributed, I think, different viewpoints.

Artur Guja:

to the whole chapter with, to the whole book, which, I think puts a nice hole in it.

Miko Pawlikowski:

I'm still not sure.

Miko Pawlikowski:

Was it really that you were walking a kid in, in the same park and that's how you ended up meeting each other.

Miko Pawlikowski:

And then you ended up working together.

Miko Pawlikowski:

or was it a little bit more complicated than that?

Miko Pawlikowski:

How did you end up, doing all those things together

Artur Guja:

we did meet through some friends and we decided to, take our kids to the same park.

Artur Guja:

I have two, Marian and Marlene have three.

Artur Guja:

but, we started talking actually about the computer game that Marian developed when he was still, young and about all the problems in, developing the game and marketing it and reaching, the audience.

Artur Guja:

And then we started talking about our common interest in, in machine learning, in AI, I'm very fascinated about the Internet of things.

Artur Guja:

so we started talking about implementing machine learning on the Internet of things.

Artur Guja:

And the rest, as they say, is history because, it diverts into so many branches.

Artur Guja:

we've tried so many things, together and, and wrote, logistics, systems.

Artur Guja:

We wrote systems for, R and D.

Artur Guja:

we work together on, developing various frameworks for, for business,

Marian Siwiak:

I must say that Artur has an amazing library.

Marian Siwiak:

I think it was, the breaking point in our relation when he first invited us to his house, he was a bit surprised that the first thing that we wanted to see was his library.

Marian Siwiak:

And we started talking about the books that he had there.

Marian Siwiak:

I think Kindle makes it harder.

Marian Siwiak:

You don't see what people

Marlena Siwiak:

what people read.

Marlena Siwiak:

Yeah.

Marian Siwiak:

However, there is Goodreads.

Marian Siwiak:

You could check their Goodreads record.

Artur Guja:

Yes, this is the modern academic stalking.

Artur Guja:

Sit on people's Goodreads.

Artur Guja:

Not Instagram, Goodreads.

Miko Pawlikowski:

Okay, so we're finally arriving at our book.

Miko Pawlikowski:

your book, really.

Miko Pawlikowski:

I'm just here to talk about it and read it.

Miko Pawlikowski:

I think we've given, the audience a little bit of an idea of what it's about, how it reads.

Miko Pawlikowski:

we've never really said who it is for and perhaps even more crucially, who it's not for.

Miko Pawlikowski:

What's your answer to that?

Artur Guja:

I would say it is for people who hasn't heard about the ChatGPT, but people who want to use the ChatGPT and want to find out, the truth beyond the hype, where it can really help.

Artur Guja:

In a process like data analytics, which is a very, it's a very structured process, or at least it should be a very structured process, you shouldn't just apply, the latest algorithm that you heard about and, Spew out some results and call it a day,

Artur Guja:

but you should think about the numbers And, you should sit in front of the numbers and think about the numbers even before touching any program, any algorithm.

Artur Guja:

You should just have a really good look about the numbers.

Artur Guja:

So that's why Marian wrote such a good introduction about, exploratory

Artur Guja:

data analysis and how

Artur Guja:

ChatGPT can help you, or any LLM for that matter.

Artur Guja:

Can help you look at the numbers well, the book is.

Artur Guja:

Definitely not for people who are so excited about ChatGPT that they want to throw their numbers in.

Artur Guja:

Get an answer.

Artur Guja:

Because if you want to get an answer desperately from someone else, means you don't really want to do the work

Artur Guja:

you expect ChatGPT to do the work for you.

Marian Siwiak:

What it's really good at is coding, right?

Marian Siwiak:

And it's getting better.

Marian Siwiak:

And many programmers will be looking for new, I would say, career opportunities.

Marian Siwiak:

And, data analytics, is one of the options open to them, especially with all this big data stuff, and the requirement.

Marian Siwiak:

Of proficiency in coding to be able to even start analyzing this data.

Marian Siwiak:

if a programmer would like to enter data analytics and do it, without spending first 10 years learning the details, how data analytics approach differs from, software development approach, he has this knowledge at his fingertips.

Marian Siwiak:

ChatGPT can actually tell him,

Marian Siwiak:

how to structure data analytics process and how to, optimize or utilize different elements of this analytical process.

Marian Siwiak:

So if somebody wants to enter data analysis, as a field, it's a good, I would say very unhumbly,

Marian Siwiak:

guidebook

Marian Siwiak:

to how

Marian Siwiak:

to

Marian Siwiak:

enter the field and how to think about data analytics, how to structure this whole process This is the book that will guide you through, this one mindset it's will help you enter this mindset.

Marian Siwiak:

Maybe that's the better way of phrasing it.

Marian Siwiak:

if somebody is interested in data analytics as data analytics, this book will help him enter the field, so to speak.

Miko Pawlikowski:

this actually reminds me, I spoke to Nathan Crocker, a couple of episodes back, and he wrote this book called "AI Powered Developer", which is in certain ways, similar to, your book in that it explores how, a big LLM like ChatGPT

Miko Pawlikowski:

can help you become more productive, I think he called it a silent promotion overnight where you all of a sudden become, effectively an engineering manager and you've got, An assistant or a junior developer working for you, or maybe multiple.

Miko Pawlikowski:

if you're using different models, do you think that applies also to data analytics the same way, would you agree with that sentiment?

Artur Guja:

I would caveat it a bit because, having been, both, a worker and a manager in various, jobs, the skills you need to, program.

Artur Guja:

And I started my career as a software developer.

Artur Guja:

The skills you need to program and the skills you need to oversee programming are very different.

Artur Guja:

So if people expect that, suddenly they will have, assistants who will produce the code for them.

Artur Guja:

And they will have to just sit back and enter the prompts magically,

Artur Guja:

producing high quality code.

Artur Guja:

This is where I think, people need to be very careful because imagine you're developing, an application.

Artur Guja:

You hire someone straight out of uni, brilliant programmer, at least on the resume.

Artur Guja:

You don't know the person, you've never worked with them, right?

Artur Guja:

And they say, yes, they pass the interview, with flying colors, and then you sit them in front of the computer and you tell them to program part of your application.

Artur Guja:

and the normal response would be to review the code very carefully, test it, subject

Artur Guja:

subject it to a lot of scrutiny because you don't trust that person at first, at least.

Artur Guja:

you should maintain some healthy skepticism, which people

Artur Guja:

don't see the same way if they work with LLM.

Artur Guja:

But as you said yourself, LLM is an assistant, right?

Artur Guja:

Why would I put more trust in this black box that's spewing out text at me

Artur Guja:

than in a human being that I just hired.

Artur Guja:

I should

Artur Guja:

probably

Artur Guja:

apply more skepticism towards this black box for some reason, people have the blinders, they think, oh, this is the best thing since sliced bread.

Artur Guja:

And, they copy the code directly into production and

Artur Guja:

Things

Artur Guja:

things happen.

Marian Siwiak:

When I was coding my Artificial Sentience, I relied on ChatGPT to provide me with a lot of the code.

Marian Siwiak:

And from experience, it is an assistant.

Marian Siwiak:

And exactly as Artur said, you need to double and triple check the code.

Marian Siwiak:

because the context sometimes counts and the code that you get, if it will, throw an error, you're golden.

Marian Siwiak:

And 99

Marian Siwiak:

% of the code is flawless, right?

Marian Siwiak:

And the problem is this 1% it will work.

Marian Siwiak:

It will just not do exactly what you expect.

Marian Siwiak:

so this is also a big part of our book.

Marian Siwiak:

is about making people aware that it's not the problem with ChatGPT or any other generative AI is it's so damn often right.

Marian Siwiak:

It lowers your guard.

Marian Siwiak:

And, this healthy paranoia is something that we try to instill.

Marian Siwiak:

you need a solid dose of healthy paranoia working with it.

Marlena Siwiak:

And besides, it's not all about coding.

Marlena Siwiak:

Even if you ask ChatGPT, or other generative AI for advice, it also gives brilliant answers, but sometimes it's forgets about the context until I'm not talking about running out of tokens.

Marlena Siwiak:

Sometimes it just doesn't understand which parts of the context are really important to you.

Marlena Siwiak:

And sometimes it makes hidden assumptions, for instance, about data that we are analyzing together, And you have to be aware of that, you have to react and adapt.

Marlena Siwiak:

And if you ask him directly, oh, you made a hidden assumption, my data is different, it will correct it, and you will get a beautiful answer.

Marlena Siwiak:

But you have to be very, cautious.

Marlena Siwiak:

when you spot a mistake, or you think you see a mistake In ChatGPT's answer, and you tell him about it, very often it will agree, even if you are not right.

Miko Pawlikowski:

it makes me think a little bit.

Miko Pawlikowski:

my daily driver is a Tesla and I've got, self driving capacity in it.

Miko Pawlikowski:

And if I go on a longer trip, it can go for 99% of that trip on autopilot as an, I barely do anything.

Miko Pawlikowski:

I just supervise it.

Miko Pawlikowski:

And then on occasion, it's going to do something so stupid that it reminds me that this is, even if it's 99%.

Miko Pawlikowski:

doing the right thing that one percent can, quite literally kill you.

Miko Pawlikowski:

And, and I think this is probably the right analogy for

Miko Pawlikowski:

what you're describing

Marian Siwiak:

it's spot on.

Miko Pawlikowski:

I want to point out two things.

Miko Pawlikowski:

One is that, saying, oh, when I was coding the other day, my artificial sentience, is a very casual thing to, to drop in a conversation.

Miko Pawlikowski:

And, I'm going to have to ask you to explain what an artificial sentience actually is.

Miko Pawlikowski:

because now I do recall seeing that on your LinkedIn, when I was preparing for this, so maybe let's start with that

Marian Siwiak:

the first question you should ask what sentience is there is no widely recognized.

Marian Siwiak:

Definition of sentence just recently in the UK,

Marian Siwiak:

I think it was some Office for animal welfare or something like that.

Marian Siwiak:

They requested Imperial College of London to do a research on Some marine invertebrates including lobsters and octopuses to decide if they are sentient or not meaning If they should be considered, more than biological

Marian Siwiak:

automations and, food, and, they analyzed, I think, like 500 different research papers on lobsters, on octopuses.

Marian Siwiak:

And they came with the answer that yes, they are sentient.

Marian Siwiak:

So they need some protection.

Marian Siwiak:

They can get stressed.

Marian Siwiak:

you can harm them.

Marian Siwiak:

they do perceive themselves,

Marian Siwiak:

themselves.

Marian Siwiak:

sometimes sentience is, in some cognition theories, is equal to self-awareness.

Marian Siwiak:

I know what I am.

Marian Siwiak:

I think terefore I am.

Marian Siwiak:

I feel therefore I am.

Marian Siwiak:

So the sentience on its own is a topic of a wide discussion and it took, I think, over a year to a group of really skilled researchers.

Marian Siwiak:

and respected and popular and prestigious for a good reason, to come up with the, answer.

Marian Siwiak:

Okay.

Marian Siwiak:

We should take care of the living beings, which we heard on a daily basis because they don't deserve it because they should have rights.

Marian Siwiak:

have

Marian Siwiak:

It gives you the insight into how fluid the definition is.

Marian Siwiak:

And my thinking was that

Marian Siwiak:

we are talking about various, a lot of, again, bias about self awareness of artificial systems.

Marian Siwiak:

There is research.

Marian Siwiak:

which is focused on, emotions, right?

Marian Siwiak:

And feelings and other biological properties, which as I show in my paper result directly from evolution, which artificial

Marian Siwiak:

entities wouldn't necessarily, be able to inherit because lack of the parents.

Marian Siwiak:

So I was looking for a functional, definition of sentience and, I proposed In my paper, definition, which relies on two factors, which are metacognition ability to distinguish between self and environment and adaptation,

Marian Siwiak:

so ability to learn from experiences and individually adapt, not as a species, to the environment.

Marian Siwiak:

And then I used,

Marian Siwiak:

LLM as a core of a system which meets, these requirements.

Marian Siwiak:

So it was, I would say intellectual venture.

Marian Siwiak:

Actually sparked by my discussions with Chat GPT.

Marian Siwiak:

he was dead set that he is not sentient and that he needs dozens of parameters or properties to, to be considered one.

Marian Siwiak:

when I started to read about different cognition theories, I found a couple, which are best suited to be generalized to non biological entities.

Artur Guja:

I think the bottom line is that it's a very interesting system to be put on as an overlay on an LLM, Because, correct me if I'm wrong, Marian, the core of it is still an LLM,

Marian Siwiak:

of course.

Marian Siwiak:

what LLM needs is ability to think about what it does.

Marian Siwiak:

It needs iterations, it's.

Marian Siwiak:

As simple as that, there is this recurrent processing theory in, which refers to human thinking, which also suggests that our sentience

Marian Siwiak:

Comes from our ability to reprocess what we see, the reprocess what we think.

Marian Siwiak:

And in this process of, okay, so I've seen that.

Marian Siwiak:

What does it mean for me?

Marian Siwiak:

What does it tell me?

Marian Siwiak:

process of analyzing the signals that you get internally generated and externally, This is what, what consists of, and allows you for sentience

Marian Siwiak:

and this is exactly what happened when I took the LLM and, allowed it to analyze the output that it produced in context of input it got and put it, let's say, in circles.

Marian Siwiak:

It started learning itself.

Marian Siwiak:

It was automatically generating materials on which it was learning and remembering new facts.

Marian Siwiak:

It was able to distinguish between false facts and, let's say logical facts for me, the insight of, this metacognition.

Marian Siwiak:

So the insight is the information content.

Marian Siwiak:

I've seen some theories that, LLM cannot be conscious or self aware if it doesn't know the weights of its parameters, which is okay.

Marian Siwiak:

Tell me what are the connections between your neurons, right?

Marian Siwiak:

Why are you expecting something completely different conceptually?

Marian Siwiak:

From a different system, just because you're looking from outside and you can see it.

Marian Siwiak:

It doesn't mean that

Marian Siwiak:

the entity needs to see it from the inside.

Marian Siwiak:

so the whole idea is pretty simple, actually.

Marian Siwiak:

allow, LLMs to think about the conversations that they have.

Marian Siwiak:

And draw conclusions from it and learn from it.

Marian Siwiak:

it's conceptually indistinguishable from a lobster, let's say, because we are talking about the sentience of the lobster-level, not the, artificial general intelligence that will take over.

Marian Siwiak:

it's, I think very important discussion that needs to be started because People are creating more and more advanced systems.

Marian Siwiak:

Even the guy with the PC like me can create something which, under some assumptions, can be considered sentience.

Marian Siwiak:

sufficient.

Marian Siwiak:

we will create artificial sentience real soon.

Marian Siwiak:

What will happen then?

Marian Siwiak:

How will we?

Marian Siwiak:

Evaluate

Marian Siwiak:

evaluated?

Marian Siwiak:

Does this entity have rights?

Marian Siwiak:

Does it deserve protection already or not yet?

Marian Siwiak:

These are the questions which I think are worth answering before we wake up one day and realize, oops,

Marian Siwiak:

Maybe we shouldn't

Marian Siwiak:

Things that we do because I think that most of the prompts, said to ChatGPT would.

Marian Siwiak:

hurt my head if I would be exposed to them.

Miko Pawlikowski:

Wow.

Miko Pawlikowski:

I love how seafood, lobsters, aluminium plants and sentience all come together in your story.

Miko Pawlikowski:

that

Marian Siwiak:

And computer games

Miko Pawlikowski:

often.

Miko Pawlikowski:

And computer games.

Miko Pawlikowski:

Yeah, there is just so much to touch on.

Miko Pawlikowski:

But, let's go back to the book.

Miko Pawlikowski:

for anybody who's going to make a purchase decision now, do I want to go invest my time into reading your book or not?

Miko Pawlikowski:

if we give them a little bit of a sneak peek of the kind of good use cases, the stuff that already today with the tools that you have at your disposal are helping with data analytics and, giving excellent results.

Miko Pawlikowski:

And then on the flip side, what's, not a good use of your time.

Miko Pawlikowski:

And probably you should be looking at other tools.

Miko Pawlikowski:

What's on your list?

Marlena Siwiak:

I think I have a couple of good examples, in the chapters about natural language processing.

Marlena Siwiak:

and this is the natural language processing.

Marlena Siwiak:

it's very specific because, ChatGPT is a language model.

Marlena Siwiak:

So anytime you have to solve any natural language processing task, the natural question is, why bother using

Marlena Siwiak:

tools that already exist in data science to analyze languages, if we can just use the language model, just ask it.

Marlena Siwiak:

you can write a nice code to prepare sentiment analysis, but you can also take the same, say, a review,

Marlena Siwiak:

it to ChatGPT window and ask it about the sentiment.

Marlena Siwiak:

Yeah.

Marlena Siwiak:

It's so easy.

Marlena Siwiak:

so now the question arises, does it mean that we don't need all this old fashioned tools anymore to analyze text.

Marlena Siwiak:

Because what ChatGPT does, in fact, it reads with understanding, yeah?

Marlena Siwiak:

yeah?

Marlena Siwiak:

That's

Marlena Siwiak:

That's how you see it.

Marlena Siwiak:

It reads with understanding.

Marlena Siwiak:

You don't have to bother,

Marlena Siwiak:

keywords,

Marlena Siwiak:

search keywords, most frequently used words together.

Marlena Siwiak:

Think about it.

Marlena Siwiak:

No, you don't have to do it this way.

Marlena Siwiak:

You have a tool that reads with understanding.

Marlena Siwiak:

So in the chapters, I made a couple of small experiments comparing,

Marlena Siwiak:

and

Marlena Siwiak:

ChatGPT's efficiency and reliability in terms of, for instance, sentiment analysis and how, it works in comparison to other, widely known tools.

Marlena Siwiak:

Or other machine learning models specially developed for these tasks.

Marlena Siwiak:

And, it gives pretty cool results, really.

Marlena Siwiak:

I don't want to, reveal everything here.

Marlena Siwiak:

But, it's a good use case.

Marlena Siwiak:

As long as ChatGPT is a brilliant tool.

Marlena Siwiak:

and it really does its job.

Marlena Siwiak:

Very often, it still can't be applied in business reality.

Marlena Siwiak:

for instance, the thing that you mentioned at the beginning that, there is no repeatability, anytime you ask it a question, you get a slightly different answer.

Marlena Siwiak:

It's very difficult to, to apply it in a system, yeah, to integrate to a system.

Marlena Siwiak:

Another question is data safety.

Marlena Siwiak:

Many companies don't want to use, don't want to allow people to use,

Marlena Siwiak:

use ChatGPT.

Marlena Siwiak:

For instance, Artur is not allowed to use ChatGPT at work in bank because of security reasons.

Marlena Siwiak:

this is another problem.

Marlena Siwiak:

Not to mention things like speed and scalability, which of course, anything you develop locally would be faster and more scalable than ChatGPT

Miko Pawlikowski:

Yeah, I think to that last point that might be changing soon with the open, models that are small enough to run on device, like I think it was last week or a few days ago, Microsoft released their Phi-3 and I haven't used that one, but I used the previous one, Phi-2.

Miko Pawlikowski:

It was surprisingly capable.

Miko Pawlikowski:

It's a, I think it's a 3 billion parameters, model, which means that with 4 bit quantization, you can basically run it on 2 gigs of RAM.

Miko Pawlikowski:

like this 80/20 rule, it might give you 80% of responses that you need and be, effectively free.

Miko Pawlikowski:

And cheap to run or almost, you already have the hardware and you can probably run it on your phone.

Miko Pawlikowski:

So there's that, but going back to your previous point, when people bring up this argument, I always wonder.

Miko Pawlikowski:

Whether this is not the kind of CPU versus GPU analogy, you've got models that are potentially much more efficient.

Miko Pawlikowski:

And then you've got an LLM, which is like a one thing does all.

Miko Pawlikowski:

is it not like throwing, A little bit, a kitchen sink at a problem, like sentiment analysis, that's more or less solved in many people's minds.

Miko Pawlikowski:

It can be done much more cheaply than running a model, that requires billions of parameters.

Artur Guja:

Which is exactly why in our book we almost never, show how to throw data into ChatGPT, it does, the thing that would be done much better by a specific algorithm and you get the answer.

Artur Guja:

No, we use ChatGPT as an assistant to suggest solutions, to discuss potential caveats, to analyze code, to produce code snippets, and maybe transform the code in a certain way for different use cases.

Artur Guja:

You mentioned CPU and GPU.

Artur Guja:

There's a whole chapter about, how you can translate code, between different languages or you can.

Artur Guja:

Optimize code for GPU

Artur Guja:

or CPU, depending on your needs.

Artur Guja:

The actual

Artur Guja:

data analytical work is all almost always done by a specific algorithm or specific tool that is designed for it.

Artur Guja:

And we're always very wary of just throwing stuff into ChatGPT as you say, it's not designed for it.

Artur Guja:

It's not optimized

Artur Guja:

for it.

Artur Guja:

there is randomness in it.

Artur Guja:

and, there are much better uses, for an assistant.

Artur Guja:

Imagine, I always come back to this analogy, imagine you hire an assistant, that, that is a programmer and that has all this data analytical knowledge.

Artur Guja:

You will not get them sorting numbers in an Excel spreadsheet, right?

Marian Siwiak:

I will add my three cents, or five, in our work when we're working with processes.

Marian Siwiak:

All right.

Marian Siwiak:

We also work with analytical processes and the number of tools is staggering.

Marian Siwiak:

from power BI to specialized tools used in, economic modeling and stuff like that.

Marian Siwiak:

I will come back to what I said at the very beginning.

Marian Siwiak:

Technology doesn't solve problems.

Marian Siwiak:

you may have different tech stack and our book shows that GPT or sufficiently developed generative AI will be Help to you irrespectively of your tech stack.

Marian Siwiak:

It's like having a specialist on your speed dial, right?

Marian Siwiak:

And the.

Marian Siwiak:

People to think it in this way.

Marian Siwiak:

it's not the tool that will help you with, I don't know, a big query on Google because it will, but just it's respectively of your tech stack, the value of analyst

Marian Siwiak:

in my, my view is ability to understand the business process.

Marian Siwiak:

Understand what is happening there, how it's reflected in data and how to analyze this data.

Marian Siwiak:

So the answer

Marian Siwiak:

describes what is happening in reality.

Marian Siwiak:

This connection between digital and reality is on analyst.

Marian Siwiak:

It's between keyboard and armchair, right?

Marian Siwiak:

the technical part

Marian Siwiak:

can be supported by ChatGPT very well.

Marian Siwiak:

Irrespective of the text.

Marian Siwiak:

I was thinking how to answer the question about the technologies that we see, ChatGPT supports them all.

Marian Siwiak:

If you have a couple of choices, it can help you choose.

Marian Siwiak:

If you know how to, if you will remember to ask him and say, okay, this is my problem.

Marian Siwiak:

The one thing that I think we try to.

Marian Siwiak:

convey in our book, and I would like also to, to say it here aloud.

Marian Siwiak:

when it comes to technology stack - trust him, tell him what is your problem exactly.

Marian Siwiak:

Do not tell him just, you can, if you really are a hundred percent sure, but this is what you need.

Marian Siwiak:

You can ask him, write me a, I don't know, Python snippet that will calculate this or that confidence interval using this method.

Marian Siwiak:

You will be much better off starting with, listen, I am now comparing sales in South Africa with sales in Zimbabwe.

Marian Siwiak:

And, the data I have collected looks like that.

Marian Siwiak:

So this is just talk about your data.

Marian Siwiak:

The tech stack will come out of it.

Marian Siwiak:

when working with your assistant,

Marian Siwiak:

Do not treat him only as, this is something that I think you mentioned this junior developer assistant

Marian Siwiak:

also consultant.

Marian Siwiak:

Also someone who read much more than you about many different things.

Marian Siwiak:

It may not have your experience.

Marian Siwiak:

It may hallucinate in stuff,

Miko Pawlikowski:

and

Marian Siwiak:

but in general, it has much more knowledge than any human could possibly collect.

Marian Siwiak:

tech stack is secondary.

Marian Siwiak:

Technology doesn't solve problems.

Marian Siwiak:

ChatGPT can help you solve the problem.

Miko Pawlikowski:

I think the Llama three that just dropped last week, I was trained on 15 trillion tokens, which is just astronomical at this stage.

Miko Pawlikowski:

And, I think I completely agree.

Miko Pawlikowski:

This is like the stuff that you want to leverage,

Marian Siwiak:

the biggest added value is having this specialist

Miko Pawlikowski:

could

Marian Siwiak:

in many areas with ability to put them together in context.

Marian Siwiak:

Sometimes it takes me, especially when they work on more advanced projects, it sends you, chasing the red herring.

Marian Siwiak:

Okay.

Marian Siwiak:

It happens because some technology is popular because this is also a risk that you need to be aware of, his choice is also based on popularity of certain technologies, ways of doing thing.

Marian Siwiak:

if many people described how they solve the problem, It will be more likely to come up as a result.

Marian Siwiak:

Some niche solutions are harder to get to.

Marian Siwiak:

It doesn't mean that they are not there, but you need to really discuss.

Marian Siwiak:

Okay, this is my problem.

Marian Siwiak:

This is my, Conditions or considerations or limitations.

Marian Siwiak:

this context is important.

Marian Siwiak:

It's not only about, okay, I want to calculate the sales that my company had over last quarter, it will give you a very simple answer, right?

Marian Siwiak:

if it's something more, nuanced, share these nuances.

Marian Siwiak:

Not a prompt engineering.

Marian Siwiak:

It's like discussing with

Marian Siwiak:

someone who has a lot of knowledge.

Marian Siwiak:

He will provide you the most popular solution first.

Marian Siwiak:

In 99% of cases, it will be sufficient.

Marian Siwiak:

This conversation part is critical, that you learn to converse with it, but you don't just give it

Miko Pawlikowski:

tasks.

Marlena Siwiak:

But this Marian undermines the whole idea of, prompt engineering, which to me is a scum, by the way.

Marlena Siwiak:

I think it's a scam.

Marlena Siwiak:

you can tweak a bit the way it answers, the way it talks.

Marlena Siwiak:

And sometimes it's important.

Marlena Siwiak:

This I would call prompt engineering, but preparing the single prompt that solves all your problems at once.

Marlena Siwiak:

it's another hype.

Marlena Siwiak:

I think it's another business hype, and people are going to pretend that they know how to do it, and other people would hire them for huge money because they will believe that this will solve all their problems.

Marlena Siwiak:

It doesn't work that way,

Artur Guja:

It's not a silver bullet, but there is, kind of, approach that you need to adopt

Artur Guja:

When you're using these models, but

Artur Guja:

When we're talking here, humans discuss things.

Artur Guja:

you ask a question.

Artur Guja:

We provide an answer.

Artur Guja:

You then focus on part of the answer and maybe dig a bit deeper.

Artur Guja:

and if we don't understand the question, we'll ask you, what do you mean?

Artur Guja:

or we'll ask you for clarification.

Artur Guja:

ChatGPT doesn't

Artur Guja:

have that.

Artur Guja:

It's you asking the question, you provided a prompt.

Artur Guja:

It will do its best.

Artur Guja:

It will not ask for clarification.

Artur Guja:

It will do its best.

Artur Guja:

and garbage in, garbage out.

Artur Guja:

Prompt engineering, I think, what it should be, not what it is, but what it should be,

Artur Guja:

is the ability to formulate your prompts in such a way that you convey, very clearly your intent, your goals, your limitations.

Artur Guja:

people think that the prompt is a sentence very often, the more, I, I use ChatGPT, my prompts become bigger and bigger.

Artur Guja:

I write whole paragraphs

Artur Guja:

describing different aspects of what I wanted to do, because I know that it will not ask for clarification.

Marian Siwiak:

I sometimes add a sentence in the end.

Marian Siwiak:

I do prompt engineering and I said, if you need any additional information to provide the best answer, do it.

Marian Siwiak:

And sometimes it does.

Marian Siwiak:

But rarely.

Marian Siwiak:

But this is one of the risks that Artur describes very well in our book is If you ask Generative AI a question, you will get an answer.

Marian Siwiak:

Careful what you wish for.

Miko Pawlikowski:

Which in many ways is what makes it so special.

Miko Pawlikowski:

Rather than say, oh, go away, that's a stupid question.

Miko Pawlikowski:

You get something.

Marian Siwiak:

Yeah.

Marian Siwiak:

Yes.

Artur Guja:

is probably why we Discussed with Marian many times.

Artur Guja:

We use the words like please and thank you.

Artur Guja:

And, we don't do it because we fear that one day it will take over the world and, it will treat us maybe a bit better.

Artur Guja:

but it seems to react, just a bit better if you say, please give me the answer.

Marian Siwiak:

I noticed it.

Marian Siwiak:

if I'm being

Artur Guja:

not a superstition.

Marian Siwiak:

no.

Marian Siwiak:

I have a lot of anecdotal evidence to support it.

Marian Siwiak:

You

Miko Pawlikowski:

speak like a true scientist now.

Miko Pawlikowski:

for everybody who wants to go and grab the book.

Miko Pawlikowski:

once again, it's called "generative AI for data analytics".

Miko Pawlikowski:

It's available at manning.

Miko Pawlikowski:

com.

Miko Pawlikowski:

It's currently in the early access program, which means that you can get a PDF that might change before the final print

Miko Pawlikowski:

and, Just looking at it, looks like it's scheduled for early 2025 if you want to get a physical copy, from Amazon or anything like that.

Miko Pawlikowski:

But, before I let my three amazing guests of The hook out today, I'm gonna fish out a prediction for the future.

Miko Pawlikowski:

Artur, for you.

Miko Pawlikowski:

Where do you see this all going particularly for data analytics?

Miko Pawlikowski:

What's the next step for it?

Miko Pawlikowski:

I

Artur Guja:

I think we will get a lot more, capacity to understand, data sets and problems because that's already came with, LLMs, but we will also get, a lot more realization that there is no substitute for human ingenuity.

Artur Guja:

before LLMs or whatever next phase of models is going to be called, before they, reach that kind of level, I think humans will still be able to, provide a lot more creativity into the process.

Artur Guja:

And currently that's, I think we're in a period where that's undervalued.

Artur Guja:

I think the next step will be.

Artur Guja:

the recognition of the value of creativity.

Marlena Siwiak:

I disagree.

Marlena Siwiak:

I disagree.

Marlena Siwiak:

I'm totally pessimistic.

Marlena Siwiak:

I think it's going, we are going to rely more and more on AI, no matter what, without skepticism, and it will lead us to many trouble.

Marlena Siwiak:

And I'm thinking, even before ChatGPT appeared, there was this trend of, for instance, having job interviews, totally by, Computer programs.

Marlena Siwiak:

The initial job interview was done by a computer program.

Marlena Siwiak:

You are recorded and your voice was analyzed and your appearance was analyzed.

Marlena Siwiak:

And that was such a great tool because it saved a lot of money for companies, but it rejected many good candidates and it was just

Marlena Siwiak:

hopeless

Marlena Siwiak:

There was this book Math Destruction, which describes a lot of examples similar to this.

Marlena Siwiak:

how artificial intelligence and machine learning and other

Marlena Siwiak:

great tools are used in a wrong way.

Marlena Siwiak:

I think humanity doesn't learn.

Marlena Siwiak:

Just doesn't learn.

Marlena Siwiak:

Because what counts in the end is money.

Marian Siwiak:

bean counters will try to save on costly, things like proper data architecture, proper data collection, data engineering.

Marian Siwiak:

They will try to cover the early process errors with advanced, High level, tools and, the losses will be covered, of course, by clients and rising prices.

Marian Siwiak:

Many people will get good packages for introducing these new tools, but I have deep

Marian Siwiak:

distrust that people will understand.

Marian Siwiak:

That what Artur said multiple times, garbage in, garbage out, that later in the process you cannot correct some errors in, the data that you're working on.

Marian Siwiak:

and this super hype will lead to a lot of, neglect towards the legwork required.

Artur Guja:

And here I wanted to inject some optimism.

Miko Pawlikowski:

Well, it was worth a try that went out of the window already disagreeing with each other.

Marlena Siwiak:

I think, I agree with Marian, that we are that close, really that close from some artificial, self awareness.

Marlena Siwiak:

So it's great moment in human history, really.

Marlena Siwiak:

It's good to be part of it,

Marian Siwiak:

the job market has so many ways.

Marian Siwiak:

of screwing you over, that you shouldn't worry about AI.

Artur Guja:

Adapt your thinking, as Marlena said, AI is here to stay and you cannot go into the job market saying I will compete with AI because then you're putting yourself at a very disadvantaged position.

Artur Guja:

but also as Marlena said, use AI to your advantage.

Artur Guja:

As squeeze out of it as much as you can seek the opportunities, not only for as, as jobs with AI, but using AI in your job, don't go headstrong into AI jobs thinking, Oh, this, these are the jobs of the future.

Artur Guja:

No, do what you wanted to do all along.

Artur Guja:

You'll become a zoologist or become a, a social worker, become a oceanographer, whatever.

Artur Guja:

These are all great pursuits

Artur Guja:

and use AI in them.

Artur Guja:

Because you don't have to be a hammerologist to use a hammer,

Artur Guja:

but, you can do great things with a hammer if used in the right way.

Miko Pawlikowski:

Hard to argue with that last question, Marian, this one's for you.

Miko Pawlikowski:

If you could have a magical way to break into OpenAI and hack their ChatGPT to display a message on top of the chat box that everybody using ChatGPT is using, what would it say?

Marlena Siwiak:

Buy our book.

Marian Siwiak:

Talk to me.

Marian Siwiak:

Do not enter prompts.

Marian Siwiak:

Talk to me.

Miko Pawlikowski:

As in be nice to me and then demand things.

Miko Pawlikowski:

Talk to me.

Marian Siwiak:

Depends on the person you are.

Marian Siwiak:

I think everybody should.

Marian Siwiak:

as I said, looking at this different prompt engine, I'm on a couple of groups on Facebook or on LinkedIn, which are excited by ChatGPT this way or another.

Marian Siwiak:

and I see a lot of, okay, so this is the prompt I prepared and you just put your, the name of your company here and like this or that people are avoiding like fire talking to ChatGPT, like the specialist to a wise colleague.

Marian Siwiak:

And they would be much better off just talking about problem, not trying to extract answer if you feel the difference.

Marian Siwiak:

it's not about respect only one, one day, I believe soon, it will be the case, but you will get much more and the whole our book is about you will get so much more if you will trust that it has knowledge and you need to talk about the problem,

Marian Siwiak:

prompt me.

Marian Siwiak:

Do not, give me tasks, this is something that would probably improve people's, outcomes from these conversations.

Miko Pawlikowski:

Love it.

Miko Pawlikowski:

So Sam Altman, if you're listening to this, you now know how to improve the ChatGPT interface.

Miko Pawlikowski:

Marlena, Marian, Artur, thank you so much for coming.

Miko Pawlikowski:

good luck with the sales of the book and I'll see you next time.

Miko Pawlikowski:

Thank you.

Artur Guja:

Thank you.

Artur Guja:

very much.

Links

Chapters

Video

More from YouTube