- What's brewing in AI
- Posts
- š§š¼ o1 has an IQ of ā120
š§š¼ o1 has an IQ of ā120
Also: Audio Overviews is incredible
in partnership with
Subscribeāāāāā|Sponsorāā|1,000 best GPTs |Our top pick tools
This issue is brought to you by
Companies will pay to meet your most talented friends
Share tech job opportunities with your network and earn big rewards. Join Draftboard now and start receiving up to $10k for every successful referral!
Happy Monday, wizards.
Did you already try OpenAIās o1 model and faced this problem too? š
Darioās Picks
The most important news stories in AI this week
o1 scores 120 on Mensaās IQ test, a cut above all other models. Maxim Lott has been tracking AI modelsā performance on the IQ test made by Mensa Norway ā o1 scored 120, well above the average human IQ.
To eliminate the possibility of o1 already being trained on the Mensa test, giving it an unfair advantage over older models, the author did a secondary test for all models using an āofflineā test version as well. In this test, all models performed worse but the lead of OpenAIās new o1 model to the second best performing model (Claude 3.5 Opus) was still really big.
OpenAI's new o1 model is a BIG breakthrough in AI intelligence, if IQ tests say anything.
I gave it the Norway Mensa IQ test, and it blows other AIs out of the water.
I'm surprised!... Because there hadn't been public progress in the last 6mo.
Link to full analysis below:
ā Maxim Lott (@maximlott)
7:50 PM ā¢ Sep 14, 2024
ā Why it mattersā ā Iād take the score of 120 in IQ with a grain of salt ā the āofflineā version of the test yielded a lower IQ score for all models, but the differences between the models is still pretty consistent. Also, remember that these tests donāt reflect the full spectrum of human intelligence at all, but focus on a narrow set of logic and reasoning tasks.
PS take a look at the articleās first example of the most difficult question o1 was able to solve. I tried it myself and failed! AI is not only more knowledgeable than most humans, but also excels in pattern recognition tasks like this. No wonder all the CAPTCHAs around the web are waay more difficult now.
Continued after the adā¦ ā
From our partners
Doing the same boring work again and again is exhausting.
What if you had a personal AI assistant who could do the job for you?
Googleās Audio Overview feature turns information into an engaging, AI generated discussion. Prepare to have your mind blown ā the Audio Overviews feature takes documents, slides, charts or simply a URL, and turns it into an engaging audio discussion between two AI hosts. It sounds close to a natural discussion between two experienced radio hosts. For context, this new feature is part of NotebookLM, a research assistant developed by Google and powered by Gemini 1.5 Pro.
Iāve made simple demonstration for you below. This was generated using one click(!) and giving the URL of the article mentioned above about OpenAIās o1 model getting 120 in IQ score.
ā Why it mattersā ā The realism of this is astonishing. Audio Overviews is capable of turning the boring information we sometime have to consume into something much more engaging ā with zero effort. I can honestly count the number of wow-experiences with AI like this on one hand.
Memo is changing the way people study. A while back, we reviewed the most popular tools that can turn study materials (like a PDF deck) into notes, flashcards and quizzes in seconds. Memo has sailed up in popularity lately, and reportedly has a user base of over 100,000 students now. The backstory: Memoāformerly known as PDF2Ankiāwas created by two medicine students who used flashcards to memorise big amounts of knowledge during their studies, and decided to streamline the manual creation of these cards with AI.
ā Why it mattersā ā I canāt help but imagine the amount of hours I would have saved if this existed while I was a student. Under the hood it uses frontier AI models to instantly create helpful study notes. If youāre a student, Iād warmly recommend checking it out.
Was this email forwarded to you? Sign up here. Want to get in front of 13,000 AI enthusiasts? Work with me. This newsletter is written & curated by Dario Chincha. |
What's your verdict on today's email? |
Affiliate disclosure: To cover the cost of my email software and the time I spend writing this newsletter, I sometimes link to products and other newsletters. Please assume these are affiliate links. If you choose to subscribe to a newsletter or buy a product through any of my links then THANK YOU ā it will make it possible for me to continue to do this.