- What's brewing in AI
- Posts
- š§š¼ AI in healthcare gone wrong
š§š¼ AI in healthcare gone wrong
Also: How 4 major SaaS companies use AI
Happy Monday, wizards.
I still have a few sponsorship slots available in this newsletter and on whatplugin.ai for Q4! Leading AI startups are advertising through these channels because of the trust Iāve built with my audience.
Ready to reach thousands of early adopters and top professionals? Fill out this quick form, and Iāll be in touch with more details.
ā
Letās dive into whatās brewing in AI today (oh, and here's what you mightāve missed from last week)
DARIOāS PICKS
š·Excited to present our paper, āCareless Whisper: Speech-to-text Hallucination Harmsā at @FAccTConference! š·We assess Whisper (OpenAIās speech recognition tool) for transcribed hallucinations that donāt appear in audio input. Paper link: arxiv.org/abs/2402.08021, thread š
ā Allison Koenecke (@allisonkoe)
4:13 PM ā¢ Jun 3, 2024
A recent study finds that Whisper, OpenAIās model for speech-to-text transcription, occasionally hallucinatesāinventing entire sentences during moments of silence in recordings. Whisper is widely used by several AI companies that do clinical note-taking, including Nabla, which is used by more than 30,000 clinicians, and has processed over 7 million medical conversations so far.
The study, conducted by researchers from Cornell and the University of Washington, found that Whisper added inaccurate or nonsensical phrases in about 1% of transcriptionsāthatās 70k potential transcripts with hallucinations from Nabla alone. Some hallucinations included invented medical conditions or even phrases like āThank you for watching!ā (thatās right, OpenAI trained the model on a bunch of YouTube videos).
ā Why it mattersā ā AI making occasional errors isnāt just limited to medical transcriptions ā itās inherent in the nature of LLMs. One physician in the comment section of The Vergeās original article, noted that you canāt just copy-paste using these tools; proofreading and verifying the AI-generated notes is part of the processāsimilar to traditional dictation. The big win here is that these tools can reduce documentation time from 5-10 minutes to just 1ā2 minutes per patient, allowing doctors to spend more time where it matters most: treating patients.
PS a bit off-topic but can we hear it for creative research titles like this š¶
TOGETHER WITH WRITER
Writer RAG tool: build production-ready RAG apps in minutes
RAG in just a few lines of code? Weāve launched a predefined RAG tool on our developer platform, making it easy to bring your data into a Knowledge Graph and interact with it with AI. With a single API call, writer LLMs will intelligently call the RAG tool to chat with your data.
Integrated into Writerās full-stack platform, it eliminates the need for complex vendor RAG setups, making it quick to build scalable, highly accurate AI workflows just by passing a graph ID of your data as a parameter to your RAG tool.
DARIOāS PICKS
2. How 4 major SaaS companies are using LLMs
Iāve taken a look at how some of the big tech companies are using AI models to power up their workflows and products.
Hereās 4 fresh examples on how AI is being used by software companies (and some results):
Microsoft utilizes Copilot and the new autonomous agents internally across sales, customer support, marketing, and HR departments. Their sales team uses Copilot to power insights and automate routine tasks, achieving a 9.4% increase in revenue per seller and closed 20% more deals. Customer support resolves cases nearly 12% faster, marketing saw a 21.5% increase in conversion rates on Azure.com with a custom agent assisting buyers, and HR improved answer accuracy by 42% with an employee self-service agent.
Notion uses Claude to improve its product, offering AI features like Q&A, autofill, and writing assistance. These features help businesses like Osaka Gas reduce search time by 35% and save Remote.com 10 minutes per search across 300 queries daily. Notionās use of Claude even eliminates the need for additional AI tools for some companies like dbt Labs, who estimate they save over $35k annually.
Gumroad, an e-commerce platform for digital creators, uses Claude 3.5 Sonnet to enable customer support teams to fix issues with code and contribute to product development. This has led to a 300% increase in feature shipping, faster feature deployment, and reduced context switching for engineers. Claude assists them in writing code, locating files, and resolving customer issues.
Zoom collaborates with Perplexity to enhance its AI Companion by integrating multiple AI models, including Anthropic, OpenAI, and Meta, to provide richer insights during Zoom calls. The AI Companion 2.0 understands user workflows and tracking tasks, in addition to meeting summaries.
ā Why it mattersā ā From making their product more useful to customers to super-empowering their non-technical teams internally ā SaaS companies are embracing AI across their operations. Iām especially impressed by Gumroad: Their customer support team is coding!
Jobs are certainly a-changing.
FROM OUR PARTNERS
Doing the same boring work again and again is exhausting.
What if you had a personal AI assistant who could do the job for you?
THATāS ALL FOLKS!
Was this email forwarded to you? Sign up here. Want to get in front of 13,000 AI enthusiasts? Work with me. This newsletter is written & curated by Dario Chincha. |
What's your verdict on today's email? |
Affiliate disclosure: To cover the cost of my email software and the time I spend writing this newsletter, I sometimes link to products and other newsletters. Please assume these are affiliate links. If you choose to subscribe to a newsletter or buy a product through any of my links then THANK YOU ā it will make it possible for me to continue to do this.