Small wins convert skepticism into sustained practice

Early tests may feel underwhelming. Keep asking better questions.
Simple tasks like chart advice can match expert checklists.
All around intelligence across tasks shows up in classification and synthesis.
Those moments build trust and trigger deeper experiments.

Madhu: Thank you, Anand. That is a good insight. So you understand the LLMs' capabilities by way of giving them certain tests. You evaluate how well they perform for what cost. And then you give a score to evaluate the effectiveness of a particular LLM, right? So that is what you essentially do as an LLM psychologist, right?

I think I'm a little bit more intrigued into your pathway, right? What decision point made you pivot? Because you did engineering in IIT Madras, and then you did MBA in IIM Bangalore. You worked in Boston Consulting Group, you did a lot of projects. But you did your own startup as well as a storyteller and a data expert in Gramener. And what was this aha moment that said, "Okay, this is my pathway," when AI came in, or even before AI came in? I think you were in the direction of thinking about LLMs and doing a lot of data-related analytics work that were anyway leading you to this journey, but when did you actually have the aha moment and what was the pivot?

Anand: ChatGPT was released in November 2022. And even before that, the GPT class of models had been released. So I had tried these in the past. For example, I was trying to see if the GPT-2 model could actually generate the configuration for some of the comic strips that we were playing around with. And it turned out that it had some moderate success, but it did feel very impressive. When ChatGPT was out, I tried it once, maybe twice, and I didn't really understand what the big deal was. And this is how most opportunities get missed.

There was a client presentation where they were looking to understand what the impact of AI was. I was forced to go back and try it out. I explored ChatGPT on 29th May 2023. I put in a question to ChatGPT, which was, "How can I visualize the change in sales since last month?" I'm in the data visualization space. I know this in and out. So when I as an expert looked at its answer, which was, it said to visualize the change in sales since last month, you can use various types of charts. Here are some popular options. It said line chart, and you would use it here's how you would draw it, here's when you would use it, bar chart, an area chart, a pie chart, a column chart, and mentioned A, when you would use which, how each of these would be drawn, what tools you could use. Like, it would suggest Excel, Google Sheets, Tableau, Python libraries, Matplotlib, Plotly. At that point, I said, "Okay, at least for a basic question like this, it has the ability to give the same answer as me, who is an expert in the field. That's not bad."

And then a week later, 8th June ‘23, I asked it, "Here are, here is a survey that I did of common problems that software developers have. I'd like you to firstly, summarize the five most common problems." And it did, which was fantastic. Then I said, "Okay, I get it. So now against these five buckets—communication, resources, timelines, work-life balance, process—I want you to categorize each of the original problems against that." And it did. Then I said, "Now tell me, which of these should I focus on to have the highest improvement on a software company's productivity?" And it went through, thought through step by step in detail and said, "Focus on the communications-related problems because that's what seems to be, A, mentioned more often, seems to have a higher impact because of these reasons."

Now, given the volume of input, this is something that would have taken me at least a few hours. So definitely a time-saving tool. But what impressed me was in the middle stage where it did that categorization. I'm not sure I could have come up with a categorization that was as good. So here is something where it actually has higher intelligence than me. And that's where it struck me. We are talking about an all-around intelligence, something that may not be as good as an expert in a specific area, but is reasonably good in almost all areas. So where I have deep knowledge, sure, I'll give an answer. But almost anywhere where I don't have that deep knowledge, here is a tool that I can use to answer my questions, which means that my need for average people supporting me has suddenly vanished. And that is a huge discovery. That point, I became a total convert.

LLM Psychology

LLM psychology treats models like people to learn faster

LLMs show distinct personalities on human trait tests

We study the model’s mindset, not the human

Benchmarks and price show value that keeps rising

Small wins convert skepticism into sustained practice

Different models fill gaps across tasks and media

Leverage your strengths and practice every single day

Publish logs, stories, and prototypes to compound learning

Cross-checking models cuts errors and boosts trust

Limit risky powers to keep agent systems safer

AI can expand a composer’s creative palette

AI can guide athletes and investors with data

AI helps authors speak to every language and reader

AI can scale advocacy and defend trust in media

AI turns practice sessions into measurable coaching

AI tutoring can lift learning where teachers are scarce

Curiosity and clear experiments keep you ahead

LLM Psychology

Quiz

Errata

Counterpoints

Feedback