Note from Matt: Below is an AI-generated list of AI news and tools. Our bot scours the internet daily to find AI news. Because it’s AI, it often finds older news or non-AI news and adds it to the mix. It also seems to break links often. We’re constantly working on improving the automation but this should, for the most part, ensure no AI news ever gets missed
The Latest AI News:
- OpenAI launches GPT-4o – OpenAI unveiled GPT-4o, a new flagship “omni-model” with advanced, real-time voice and vision capabilities that is significantly faster, 50% cheaper for developers, and now available to free ChatGPT users.
- Google reveals major AI updates at I/O 2024 – At its annual developer conference, Google showcased Project Astra, a real-time multimodal AI assistant, integrated “AI Overviews” into its main search results, and announced Gemini 1.5 Flash, a new speed-focused model.
- Scale AI raises $1B, doubling valuation to $13.8B – The essential data labeling and evaluation provider for AI companies has raised $1 billion in a Series F funding round with investment from Amazon, Meta, and others.
- Apple nears deal with OpenAI to bring ChatGPT to iOS 18 – Apple is reportedly finalizing an agreement with OpenAI to integrate its technology into the iPhone’s next operating system, with an announcement expected at WWDC in June.
- Microsoft is developing a new large AI model, MAI-1 – Microsoft is training a massive new in-house model, codenamed MAI-1, with 500 billion parameters, signaling its intent to compete directly with models from Google, Anthropic, and its partner OpenAI.
- ElevenLabs launches AI music generator – The AI voice company has released a new text-to-music tool that allows users to create high-quality songs and instrumentals from simple text prompts.
- TikTok introduces AI-generated avatars for branded content – TikTok is launching tools for creators to use pre-scripted “Symphony” avatars or custom avatars in their videos, alongside a new policy to label AI-generated content.
- Poe now lets you chat with multiple AI bots at once – Quora’s AI platform, Poe, has introduced a feature that allows users to call on multiple different AI models, like GPT-4o and Claude 3, within a single chat thread.
- Hugging Face and Google Cloud expand partnership – The partnership allows developers to easily train and deploy open models from Hugging Face on Google Cloud’s infrastructure, with no data egress fees for data going between the two platforms.
- SEC charges company with “AI-washing” – The SEC has charged investment adviser Delphia and its partner Global Predictions for making false and misleading statements about their use of AI, marking a significant regulatory crackdown.
- OpenAI superalignment team leaders depart over safety concerns – Jan Leike, a key leader of OpenAI’s superintelligence safety team, resigned, citing a shift in company culture where “safety culture and processes have taken a backseat to shiny products.”
- Google, Intel, and others form UALink to challenge NVIDIA’s dominance – A new industry group including AMD, Google, Microsoft, and Meta is creating an open standard called Ultra Accelerator Link (UALink) for connecting AI chips in servers, aiming to compete with NVIDIA’s proprietary NVLink.
- Amazon is building ‘Metis’, a new advanced AI chatbot to rival ChatGPT – Amazon is developing a powerful new AI assistant, codenamed ‘Metis’, which will be powered by an advanced Olympus model and provide smart, agent-like capabilities.
- Google unveils Veo, its most capable text-to-video model yet – Google announced Veo, a new generative video model designed to compete with OpenAI’s Sora, capable of creating high-quality, 1080p videos over a minute long with a sophisticated understanding of cinematic language.
- Google reveals Imagen 3, its highest quality text-to-image model – As part of its I/O announcements, Google introduced Imagen 3, its best-performing image generation model to date, with improved detail, photorealism, and significantly better text rendering.
- Self-driving startup Wayve raises $1.05B from SoftBank and NVIDIA – The UK-based autonomous driving company secured over $1 billion to advance its “embodied AI” approach, which aims to teach cars to drive like humans using video data.
- Runway teases its new Gen-3 Alpha video model – Video-generation startup Runway shared a sneak peek of its upcoming Gen-3 Alpha model, showing significant improvements in generating detailed and consistent human characters.
- Microsoft releases Phi-3-vision, a small multimodal model – Microsoft has launched Phi-3-vision, a new 4.2B parameter model that can analyze images and text, making powerful multimodal capabilities available in a smaller, more cost-effective package.
- Robot solves Rubik’s Cube in a record-breaking 0.305 seconds – A robot from Mitsubishi Electric set a new world record for solving a Rubik’s Cube, beating the previous record by 0.07 seconds thanks to its high-speed motors and AI-powered color recognition.
- Apple releases OpenELM, a family of open source AI models – Apple has launched OpenELM (Open-source Efficient Language Models), a family of small models designed to run efficiently on-device rather than in the cloud.
- How to improve GPT-4o’s vision capabilities – An OpenAI cookbook provides practical tips and prompting techniques, such as using “chain of thought” and specifying output formats, to get better and more consistent results from GPT-4o’s vision analysis features.
- Viggle AI lets you control any character with motion videos – A new free AI tool allows users to generate a video of any character performing specific movements by combining a still image of the character with a motion-capture video.
- The full AlphaFold 3 paper is now published in Nature – The research paper detailing DeepMind’s groundbreaking AlphaFold 3 model, which can predict the structure and interactions of nearly all biological molecules, is now officially available.
- GitHub Copilot Workspace enters technical preview – GitHub has launched a preview of Copilot Workspace, a new AI-native environment designed to help developers brainstorm, plan, build, and test code from a project’s initial concept.
Trending AI Tools:
- ChatGPT / GPT-4o – OpenAI’s new flagship model that understands and generates text, audio, and images in real-time, now with a desktop app and free access for all users.
- Google’s Project Astra – A real-time, multimodal AI assistant from Google designed to see, hear, and understand the world around you through a device’s camera.
- Google Gemini 1.5 Pro/Flash – Google’s latest powerful and efficient multimodal AI models, with Gemini 1.5 Pro featuring a massive one million token context window.
- Veo – Google’s most capable text-to-video model designed to generate high-quality, 1080p videos over a minute long in various cinematic styles.
- Imagen 3 – Google’s new text-to-image model that generates photorealistic, lifelike images with a deep understanding of natural language and prompts.
- Poe by Quora – An AI platform and app that provides access to a wide variety of chatbots and models, including GPT-4o, Claude 3, and Llama 3.
- Suno – An AI music and song generator that creates realistic audio, including vocals and instruments, from a simple text prompt.
- Llama 3 – Meta’s latest family of powerful open-source large language models available for developers and researchers.
- Claude 3 – A family of powerful AI models from Anthropic, including the highly capable Opus variant, known for its strong performance and large context windows.
- Viggle AI – A free AI video generator that can animate any character in a photo using a reference motion video, effectively making them dance.
- Hume AI – An AI model with an ‘Empathic Voice Interface’ designed to understand the emotional tone and expression in human speech to have more natural conversations.
- Krea AI – A creative suite of AI tools that can generate high-quality images and videos in real-time as you type or adjust parameters.
- Mindgrasp – An AI learning assistant that instantly creates accurate notes, summaries, and answers questions from any document, video lecture, or recording.
- Scribe – An AI tool that automatically documents your processes and creates step-by-step visual guides by recording your screen.
- Opus Clip – An AI video tool that repurposes a single long video into multiple short, viral-ready clips for social media platforms.
- Durable – An AI-powered website builder that can generate a complete business website with copy, images, and a contact form in under a minute.
- Fireflies.ai – An AI meeting assistant that joins your calls to automatically record, transcribe, summarize, and analyze voice conversations.
- ElevenLabs – A popular AI voice generator used for creating realistic text-to-speech audio and cloning voices for various applications.
- HeyGen – An AI video platform for creating professional videos featuring realistic AI-generated avatars and voiceovers from text.
- Midjourney – A renowned AI image generator famous for producing artistic and high-quality visuals from natural language text prompts.
- Runway – An advanced suite of AI-powered tools for content creation, specializing in video generation and editing.
- Otter.ai – An AI transcription service that provides real-time notes for meetings, interviews, and lectures, complete with summaries and speaker identification.
- Descript – An all-in-one audio and video editor that simplifies editing by allowing you to modify the content by just editing the text transcript.
- Synthesia – An AI video creation platform that enables users to produce studio-quality videos with AI avatars and voiceovers in minutes.
- Notion AI – A set of AI features integrated directly into the Notion workspace to help write, summarize, brainstorm, and organize information.
- Gamma – An AI-powered alternative to traditional slide decks that allows you to generate polished presentations, documents, or webpages from a prompt.
- Veed – An online video editing platform that uses AI to simplify tasks like adding subtitles, removing background noise, and creating professional content.
- Superhuman – A premium, AI-enhanced email client designed to make you faster and more efficient at managing your inbox.
- Lindy – An AI assistant that automates tasks like calendar management, email drafting, and meeting scheduling to save you time.
- Tome – An AI-powered storytelling and presentation tool that helps you create and design compelling narratives from a simple text prompt.
- Chatbase – A tool that allows you to build and train custom AI chatbots using your own data and documents.
- Glean – An AI-powered work assistant that provides enterprise-level search and knowledge discovery across all of a company’s applications.
Sponsors:
- Runhouse – A platform that helps you scale your AI applications without worrying about infrastructure.
- Lemon.io – A marketplace for hiring vetted software engineers.
- Brilliant – An online platform that helps you build your math, data, and computer science skills with fun and interactive courses.
- Masterworks – An award-winning platform for investing in shares of blue-chip art.
- AssemblyAI – A simple API to transcribe and understand speech with superhuman accuracy.
- Setapp for Teams – A service that bundles over 240 Mac and iOS apps into a single subscription to boost team productivity.
- Cohere – A platform that provides access to the world’s most powerful language models to power your applications.
- Beehiiv – An email newsletter platform built for growth.