Anand's LinkedIn Archive

LinkedIn Profile

May 2024

Manas Pratim Haloi You're right! I missed Replicate's 5c/MTok for llama-3-8b-instruct. https://llmpricecheck.com/replicate/meta-llama-3-8b-instruct

I've revised the site. Yes, mistral-7b does drop off. Thanks for this!
Vishnu Agnihotri Claude 3 Opus is excellent. But it is $15 per million tokens and the latest GPT4 Turbo, which is about as good on ELO score, is only $10. So it doesn't make the cut
There are 4 frontier #LLMs today. No other (popular) model beats them on BOTH cost and quality.

llama-3-8b-instruct
claude-3-haiku-20240307
llama-3-70b-instruct
gpt-4o-2024-05-13

This list changes rapidly. But in practice, it means there's little reason to use any other LLM. They beat every other model on cost and quality (measured by the LMSYS Arena ELO score.)

I opened Straive + Gramener's keynote yesterday at marcus evans Group's Digitech forum with this. Strange that this is not well known. Especially as switching from GPT-4 to Claude 3 Haiku can shrink a $1.2 million Gen AI budget to just $10K.

See the interactive version at https://lnkd.in/eM7zRJrN

10 May 2024: mistral-7b-instruct-v0.2 was dropped since llama-3-8b-instruct is available for cheaper on Replicate.
19 May 2024: gemini-1.5-pro-api-0409-preview and gpt-4-turbo-2024-04-09 were dropped since gpt-4o-2024-05-13 is half the price at similar quality
Oh, wish I could make it! I'll look forward to the recordings (if they're possible)
250 BC is when I'd pick to time-travel to. Ashoka was turning into one of the most famous emperors of India and Archimedes was growing into one of the greatest mathematicians of all time.

Parallel Lives is a beautiful visualization by Jan Willem Tulp that shows who lived when, showing overlaps, and sized by their prevalence on Wikipedia. I'm a history fan and have spent several hours scrolling through the site:

https://lnkd.in/g_hY86uR