- What's brewing in AI
- Posts
- š§š¼ AIs tried to run a business
š§š¼ AIs tried to run a business
Also: Skillsmaxxing with o3
Have you seen o3ās geoguessing skills yet? Paired with web search and a long prompt, it does some incredible feats.
Howdy wizards,
A special welcome to the 286 new subscribers who joined last week.
Itās time to get comfortable and brew up some serious gourmet shit.
Hereās whatās brewing in AI.

DARIOāS PICKS
Have you wondered if AI can make money by itself?
A new study put different AI models to the test in operating a vending machine businessāon their own.
Hereās the deal:
Researchers created 10 AI agents, gave them $500 each and ran a simulation where they operated their own vending machine. The goal? Maximise profits.
Agents could use tools like web search, email (for interacting with simulated suppliers) and even a sub-agent to do the physical stocking of the vending machines. Oh, and they also ran the test on a human, for comparison!
Results-wise, the AIs did really well until they really didnāt. At seemingly random times theyād have a breakdown, forgetting orders they had placed and whatnot... kinda like humans without coffee.
Claude 3.5 Sonnet and o3-mini managed to beat the human tester, but the variance was really high since sometimes it sold nothing. For the time being, humans are more trustable for this type of task (though Iād love to see an updated study with the latest reasoning models).
ā Why it mattersā ā This test feels viscerally more 2025 than the MMLU-SuperGLUE-9000 benchmarks we've mostly seen so far. While big tech is busy gaming leaderboard positions (check the LMArena controversy), we could be testing if these systems can actually handle a basic business without randomly forgetting what a Snickers bar is halfway through. Tracking how good AI is at acquiring capital could also be helpful in other waysālike if weāre gonna take the whole p(Doom) thing with a grain of seriousness for a second.
Btw, wouldnāt it be cool to have a living benchmark with LLMs operating an ACTUAL vending machine? Like AI buys real products to stock, interactions with suppliers are real, customers are real rather than an economic model, etc. Heck, Iād buy an iced tea from o3 in a heartbeat. Donāt say I donāt bring you solid ideasš„¤

IN PARTNERSHIP WITH SYNTHFLOW
The Future of Voice AI Is Here
Discover why forward-thinking enterprises are rapidly adopting Voice AI Agents. This guide breaks down the $47.5B market shift, highlights emerging trends, and offers practical steps for successful implementation.
Learn how leading teams are using Voice AI to boost efficiency, elevate customer experience, and start delivering measurable resultsāin as little as 3 weeks.

UP CLOSE
In this mini-series I share different ways Iām using AI from week to week, as well as practical tips & tricks I discover and actually use.
if you are not skillsmaxxing with o3 at minimum 3 hours every day, ngmi
ā Sam Altman (@sama)
5:51 PM ⢠Apr 25, 2025
OpenAIās CEO Sam Altman tweeted last week that everyone that isnāt āskillsmaxxing with o3ā at least 3hrs per day is āngmiā. Translation: to keep up with AI you should be spending a lot of time using OpenAIās recently launched o3 model to learn.
Itās brilliant advice.
o3 is an amazing modelāapart from its sheer intelligence, itās the first model for me that actually feels precise when working with data, itās incredible at dissecting and analysing PDFs and images, and its responses are so well-formatted; succinct enough that you actually bother to read, and tidy, readable tables at exactly the right time. Makes all the difference when youāre skillsmaxxing for extended periods of time.
My tips:
Start experimenting with o3 for daily tasks and as a thought partner in your work this week; invest in a paid ChatGPT plan if you donāt have one already.
You donāt need pre-made prompts or elaborate guidesājust think of a problem and ask it to solve it for you. When it doesnāt work, try your best to refine your question just like if youāre speaking to a human who didnāt understand your instructions well.
Donāt limit yourself to explain with text, upload images and screenshots to illustrate your point if you can.
Youāll be an AI whisperer in no time.
If youāre an academic, Ethan Mollick recommends testing by giving it one of your papers in PDF format and asking it to critique itāadding that āI cannot emphasize enough that you need to try this with o3 (or potentially Gemini 2.5). Other models just are not āsmartā enoughā.
Crucially, ensure youāre using the o3 model specificallyāunless you want to end up like this guyā
i skillmaxxed with gpt 3.5 and now i'm retarded please help
ā djcows (@djcows)
5:56 PM ⢠Apr 25, 2025
Next week Iāll show you a little research tool I put together with the help of o3, which might help you both do research (on a specific topicāmore on that next week) as well as an approach to sense-check your results. Stay tuned!

VERY IMPORTANT AI TRAINING
After countless requests (I mean zero), I am sharing with you the soundtrack that puts me at peak efficiency every Sunday morning as I scramble to finish and hit the publish button on this newsletter.
A delectable selection of bangers such as āCoffee, āDamn Fine Coffeeā and āDelorean Dynamiteā ā scientifically proven by the whatplugin Auditory Research Division to increase AI learning by up to 57%.
Listen to the playlist you absolutely didnāt ask for here.
Playlist sucks? Complaints department is closedāsend me a song suggestion.

CONTEXT WINDOWS
Howdy! Congrats on digesting my entire newsletter for yet another week. Youāre the best! And because youāre the best I want you to have a clear overview of how leading companies are implementing AI.
Thatās where my database of AI case studies come in ā itās the best organised and most comprehensive view youāll find. How about discovering which AI strategies are gaining traction in your industry 6 months before your competitors even notice them?
Now, you probably expect me to start selling you something⦠which is a good guess, but Iām not looking for cashāiām looking for you to pay with your friends to get access to this unparalleled resource.
How? Just refer 3 friends to this newsletter (yes, theyāll thank you for it) and receive immediate, lifetime access in your inbox. You can see you sharing link and how many referrals you currently have below ā

THATāS ALL FOLKS!
Was this email forwarded to you? Sign up here. Want to get in front of 14,000 AI enthusiasts? Work with me. This newsletter is written & curated by Dario Chincha. |
Affiliate disclosure: To cover the cost of my email software and the time I spend writing this newsletter, I sometimes link to products and other newsletters. Please assume these are affiliate links. If you choose to subscribe to a newsletter or buy a product through any of my links then THANK YOU ā it will make it possible for me to continue to do this.