šŸ§™šŸ¼ Meta's Video Gen

Also: Apple intelligence is around the corner

Happy Monday, wizards.

And welcome to the 100th edition of this newsletter. Really appreciate all of you who read and follow the stuff I share here every day. Wouldnā€™t be much fun without you!

Without further ado ā€“ hereā€™s the most interesting things happening in AI today.

DARIOā€™S PICKS

Screenshot: Meta | Edit: Whatā€™s brewing in AI

Meta just released a new AI model series that can generate and edit videos + generate soundtracks and sound effects for your video clips. It claims to outperform or be on par with current top video models on on all quality metrics: OpenAIā€™s Sora, Runway Gen3 and Kling 1.5. Apparently, it also beats them all on realness and aesthetics.

The system can generate up to 16 seconds long clips with a syncronised audio track. Mark Zuckerberg announced the new model is coming to Instagram next year ā€“ along with a video of himself pushing about 20,000 chicken nuggets on leg day. Check it out.

ā€Ž Why it mattersā€Ž ā€Ž The video generated with these models look truly impressive, and itā€™s interesting to see whatā€™s coming. That being said, I think many of us are growing slightly tired of new yet-to-be-realeased video models claiming to beat other yet-to-be-released video models.

TOGETHER WITH DRAFTBOARD

Why should company employees have all the fun with referral bonuses?

Meet Draftboard: A platform where referral bonuses are open to everyone. Share job opportunities from companies like SeatGeek, Via, Formlabs, Bilt, Triple Whale, and OneSignal with your network. Earn when your friends get hired.

Hereā€™s How:

  • Seamless Sharing: Copy and share your unique referral link via text, email, or social media.

  • Diverse Companies: Access job postings from over 80 premier companies across multiple sectors.

  • Real-Time Updates: Track the progress of your referrals as they advance through the hiring process.

DARIOā€™S PICKS

2. Three recent case studies from OpenAI: agents, role-playing and AI web development

Thereā€™s some fascinating recent case studies shared by OpenAI which I havenā€™t featured in this newsletter yet:

Creative Agents ā€“ Media & Entertainment

  • Altera uses OpenAI's GPT-4o to create ā€œdigital humansā€ that play Minecraft, AI agents that play the game with users like friends. Altera is combining GPT-4o and a brain-inspired multi-module system ā€“ letting these agents operate autonomously for up to four hours.

Customer Agents ā€“ Education

  • Speak is an app for practicing new languages thatā€™s making the experience of language learning a lot more fun. It OpenAI's new Realtime API to power its role-play feature, which allows users to practice conversations in a new language with the same speed and naturalness of the speech-to-speech model (same model as Advanced Voice Mode in ChatGPT).

Code Agents ā€“ Software & IT

  • Coframe uses GPT-4o to build an AI engineering assistant that generates new sections of a website based on existing code and images. By fine-tuning GPT-4o with vision and text, they improved the consistency of visual style and layout by that could be generated by 26%.

ā€Ž Why it mattersā€Ž ā€Ž Alteraā€™s use case shows that weā€™re getting closer to autonomous agents that can do tasks without human intervention. Speak and Coframe are good showcases of what can be done with OpenAIā€™s latest API improvements: the Realtime API and the vision fine-tuning capability.

PS Iā€™ve started tracking all the use cases of companies implementing AIā€”and the technology they use for itā€”from all around the web. Iā€™m building something to make it easier for everyone to see whatā€™s working with AI. Stay tuned for it in the next days!

FROM OUR PARTNERS

200+ hours of research on AI tools, prompting techniques & hacks packed in a solid 3 hour masterclass.

DARIOā€™S PICKS

Bloomberg is reporting that Apple Intelligence is going to be launched on October 28th. Initially planned for mid-October, Apple is apparently taking extra time to fix bugs and ensuring their private cloud compute servers can handle all the traffic that will come their way.

The release is expected to feature several of the cool stuff Apple has been showcasingā€”the new Siri interface, writing tools, notification summaries, and more. However, the full brand-new Siri experience (the best and most innovative part of it), isnā€™t set to release until later, possibly delaying into 2025.

The experience isnā€™t going to be available to everyone though ā€“ youā€™ll need an iPhone 15 Pro or later to install Apple Intelligence. Ipads and Macbooks with A1 chips or later will also be able to use it.

ā€Ž Why it mattersā€Ž ā€Ž Many testers have been underwhelmed by Apple Intelligence so far, as it uses GPT-3.5-level intelligence for all native features to power everything locally on the phone. Thatā€™s great for privacy, but not so much for performance. I still think having an AI on your phone, integrated with your apps, could be a massive upgrade in terms of the user experience though.

THATā€™S ALL FOLKS!

Was this email forwarded to you? Sign up here.

Want to get in front of 13,000 AI enthusiasts? Work with me.

This newsletter is written & curated by Dario Chincha.

What's your verdict on today's email?

Login or Subscribe to participate in polls.

Affiliate disclosure: To cover the cost of my email software and the time I spend writing this newsletter, I sometimes link to products and other newsletters. Please assume these are affiliate links. If you choose to subscribe to a newsletter or buy a product through any of my links then THANK YOU ā€“ it will make it possible for me to continue to do this.