Note from Matt: Below is an AI-generated list of AI news and tools. Our bot scours the internet daily to find AI news. Because it’s AI, it often finds older news or non-AI news and adds it to the mix. It also seems to break links often. We’re constantly working on improving the automation but this should, for the most part, ensure no AI news ever gets missed
The Latest AI News:
- OpenAI launched GPT-4o – a new flagship “omnimodel” that natively processes text, audio, and vision in real-time and is being rolled out to all users for free.
- Google announced a suite of AI updates at its I/O conference – including Project Astra for real-time multimodal assistance, the faster Gemini 1.5 Flash model, and new generative media tools.
- Scarlett Johansson stated OpenAI’s ‘Sky’ voice assistant sounds “eerily similar” to her own – after she had previously declined an offer to voice the system, prompting OpenAI to pause its use.
- Jan Leike announced he is joining Anthropic – after resigning as co-lead of OpenAI’s superalignment team over safety concerns, he will continue his work at the rival AI company.
- Google is rolling out AI Overviews to its search results – which are AI-generated summaries at the top of results for all users in the US, with plans for a global expansion.
- Apple is reportedly finalizing an agreement with OpenAI – to integrate ChatGPT features into the upcoming iOS 18.
- The European Union has given final approval to its landmark AI Act – establishing the world’s first comprehensive legal framework for artificial intelligence.
- Google introduced Veo – its most advanced text-to-video generation model, designed to compete with OpenAI’s Sora by creating high-quality, 1080p videos over a minute long.
- Microsoft announced “Copilot+ PCs” – a new category of Windows computers with powerful neural processing units designed to run AI tasks locally and efficiently.
- Meta introduced Chameleon – a new family of early-stage multimodal models that can understand and generate interleaved images and text in any sequence.
- Nvidia announced blowout quarterly earnings – with revenue soaring 262% to $26 billion, driven by massive demand for its AI chips, and also announced a 10-for-1 stock split.
- Perplexity launched Pages – a new feature that can convert prompts or research into comprehensive, well-structured articles or reports.
- Google unveiled Imagen 3 – its most advanced text-to-image model yet, which offers improved photorealism, detail, and a better understanding of natural language prompts.
- OpenAI has partnered with Reddit – to bring its content to ChatGPT, allowing the chatbot to learn from and display information from Reddit discussions.
- Krea AI launched a new real-time video tool – that allows users to generate and enhance videos from text prompts or by uploading source videos.
- AI music generator Suno released version 3.5 – which is now available to all users and features improved audio quality and longer song extensions up to four minutes.
- Pika Labs added a new sound effects feature – which can automatically generate sound effects to match the content of user-created videos.
- Midjourney introduced a new `–sref random` command – that allows users to apply a random, unexpected style to their image generations.
- Amazon implemented a new AI feature for reviews – which summarizes thousands of product reviews into a short paragraph to help customers quickly understand user sentiment.
- Cognition AI released a technical blog post on Devin – detailing how its AI software engineer achieves a high score on the SWE-bench benchmark.
- Mistral AI launched a new self-service platform – called la Plateforme, released new embedding models, and made its models generally available on Microsoft Azure.
- Cleanlab has raised $25 million in Series A funding – a startup focused on data-centric AI that automatically finds and fixes issues in datasets.
- Google announced Trillium, its 6th generation TPU – which delivers a 4.7x improvement in compute performance per chip over the previous generation.
- The SEC is reportedly scrutinizing OpenAI investors – as part of its investigation into CEO Sam Altman’s brief ouster and reinstatement.
- Galileo launched Luna – a new platform for enterprises to build and evaluate trustworthy conversational AI applications by generating high-quality training data.
- Researchers proposed ‘Principle-Driven Self-Alignment’ – a method for AI alignment that uses a few human-defined principles to guide a model’s behavior.
- A new study reveals LLMs are more persuasive than humans – showing that models like GPT-4 are more effective at persuading people in debates.
- Modal Labs has raised $16 million in a Series A – a platform that helps developers run generative AI models without managing complex infrastructure.
Trending AI Tools:
- Llama 3 – Meta’s latest family of open-source large language models, designed to be a top-performing and widely accessible foundation for AI applications.
- Meta AI – A new AI assistant from Meta, powered by Llama 3, that is integrated across Facebook, Instagram, WhatsApp, and Messenger.
- Viggle – An AI tool that can animate any static character image to move and dance based on a reference video clip.
- Ideogram 1.0 – A text-to-image model that excels at generating images with coherent and readable text.
- Luma Dream Machine – A new text-to-video generation model known for creating high-quality, realistic, and fluid video clips from prompts.
- GitHub Copilot Workspace – An AI-native development environment that helps developers go from an idea to a full coding plan and implementation using natural language.
- Cognition AI – Devin – An AI software engineer designed to autonomously handle entire development projects from a single prompt.
- Stable Diffusion 3 – The latest version of Stability AI’s text-to-image model, featuring significant improvements in typography and multi-subject prompts.
- Google VLOGGER – A Google AI model that can create realistic, controllable videos of people talking and gesturing from a single still image.
- Pika – An AI platform for generating and editing high-quality videos from text prompts and still images.
- Freepik Pikaso – A real-time sketch-to-image AI tool that transforms simple drawings and text prompts into detailed images instantly.
- Emu Video & Emu Edit – Meta’s new models for high-quality text-to-video generation and precise video editing based on text and image instructions.
- Snowflake Cortex – A fully managed service enabling businesses to build AI applications using their own enterprise data within the Snowflake ecosystem.
- Dora – An AI-powered website builder that can generate professional, animated, and 3D websites from a single text prompt.
- NVIDIA Chat with RTX – A tech demo that lets you run a personalized AI chatbot on your local PC, using your own documents and files as its knowledge base.
- Opus Clip – An AI tool that automatically turns long videos into short, shareable clips designed for social media platforms.
- ElevenLabs Dubbing Studio – A tool for automatically translating and dubbing videos into different languages while preserving the original speaker’s voice.
- AlphaCodium – An AI code generation tool that improves accuracy by using a test-based, multi-hypothesis approach for solving coding problems.
- Leonardo AI – Phoenix – A new image generation model from Leonardo AI that excels at photorealism and following complex user prompts.
- Krea AI – A suite of AI tools for creatives that offers real-time image generation, upscaling, and enhancement.
- Guidde – A platform that helps create video documentation and how-to guides with AI-generated voiceovers and step-by-step instructions.
- Recap – An AI meeting tool that transcribes, summarizes, and extracts key insights and action items from your calls.
- PlayHT – An AI voice generator that creates realistic text-to-speech audio and can clone voices for various applications.
- FineShare – An AI-powered online voice generator for creating high-quality voiceovers and dubbing for videos and other content.
Sponsors:
- DeepLocal – DeepLocal provides expert AI product management, design, and engineering services to help teams build and deploy impactful products faster.
- Brilliant – Brilliant helps you build quantitative skills in math, science, and computer science with fun, interactive lessons.
- Text-Em-All – Text-Em-All provides reliable mass texting and automated calling services for businesses.
- Rows – Rows is a modern spreadsheet that integrates data from any source and uses AI to help with analysis and creating interactive reports.
- Masterworks – Masterworks is an exclusive platform that allows you to invest in shares of multi-million dollar paintings by artists like Banksy and Basquiat.
- AE Studio – AE Studio is a development and data science studio that helps businesses build and ship their big ideas.
- Responsible AI Institute – The Responsible AI Institute is hosting a workshop in NYC to help you prepare for AI regulations and build customer trust through responsible AI systems.