Note from Matt: Below is an AI-generated list of AI news and tools. Our bot scours the internet daily to find AI news. Because it’s AI, it often finds older news or non-AI news and adds it to the mix. It also seems to break links often. We’re constantly working on improving the automation but this should, for the most part, ensure no AI news ever gets missed
The Latest AI News:
- Mistral launches Codestral and new models – Mistral has released Codestral, a 22-billion-parameter model specifically for code generation, alongside new embedding models and a partnership with Microsoft.
- OpenAI forms new safety committee – OpenAI has formed a new Safety and Security Committee, led by CEO Sam Altman, to oversee the development and safety of its future AI models.
- Scarlett Johansson’s lawyers demand answers from OpenAI – Lawyers for Scarlett Johansson are demanding an explanation from OpenAI about how its ‘Sky’ voice was created after she declined an offer to voice the assistant.
- Microsoft’s new Recall feature sparks privacy concerns – Microsoft’s new ‘Recall’ feature for Copilot+ PCs is facing major privacy and security criticism for its practice of constantly taking screenshots of user activity.
- Perplexity launches Pages to turn prompts into articles – AI search engine Perplexity has launched ‘Pages,’ a new feature that generates comprehensive, well-structured articles and reports from a single prompt.
- Elon Musk’s xAI raises $6 billion – Elon Musk’s AI startup, xAI, has raised $6 billion in a Series B funding round to build out its infrastructure and accelerate product development.
- NVIDIA stock surpasses $1,000 after blowout earnings report – NVIDIA reported a massive 262% revenue increase year-over-year, causing its stock to surge past the $1,000 mark and announcing a 10-for-1 stock split.
- Google’s AI Overviews generate bizarre answers – Google’s new AI Overviews in search are providing dangerously incorrect and strange answers, such as advising users to add glue to pizza.
- AI agent startup Adept secures over $100M in new funding – Adept AI, a startup building an AI agent that can automate software tasks, has reportedly secured over $100 million in a new funding round.
- Humane reportedly seeking a buyer for its AI Pin business – Following poor reviews of its AI Pin, hardware startup Humane is reportedly exploring a sale of the company for an asking price between $750 million and $1 billion.
- Google Search ranking algorithm details leaked – A massive leak of over 2,500 internal Google documents has revealed previously unknown details about how its search engine ranks content, some of which contradict its public statements.
- Microsoft releases Phi-3-vision, a small multimodal model – Microsoft has launched Phi-3-vision, a powerful 4.2B parameter multimodal model that can analyze images and text and is small enough to run on a mobile device.
- Google unveils Veo, its most advanced text-to-video model – Google introduced Veo, a new text-to-video model designed to compete with OpenAI’s Sora, capable of generating high-quality, minute-long videos in 1080p.
- ElevenLabs launches text-to-sound-effects generator – Voice AI startup ElevenLabs has released a new tool that can generate sound effects, short instrumental tracks, and soundscapes from text prompts.
- GitHub announces Copilot Workspace for AI-native development – GitHub has unveiled Copilot Workspace, a new environment that uses AI agents to help developers brainstorm, plan, code, and test projects using natural language.
- Pika Labs adds sound effects to its AI video generator – Pika Labs has integrated an AI-powered sound effects generator into its platform, allowing users to add audio to their video creations with text prompts.
- Suno releases v3.5 model for improved AI music generation – AI music creation platform Suno has launched its v3.5 model, featuring improved audio quality and the ability to generate songs up to four minutes long.
- Klarna’s AI assistant is now doing the work of 700 agents – Fintech company Klarna reports its AI assistant handles two-thirds of all customer service chats, maintaining satisfaction scores on par with human agents.
- Apple reportedly finalizing deal to add ChatGPT to iOS 18 – Apple is reportedly closing in on a deal with OpenAI to integrate ChatGPT features directly into the iPhone’s next operating system, iOS 18.
- Researchers use an LLM to control nuclear fusion plasma – Scientists at Princeton have successfully used a large language model to control the unstable plasma inside a nuclear fusion reactor, a significant step in developing clean energy.
- Microsoft forms new consumer AI team led by Inflection AI co-founders – Microsoft has created a new group called Microsoft AI to oversee its consumer products like Copilot and Edge, led by Mustafa Suleyman and Karén Simonyan.
- AI model discovers a new class of antibiotics – Researchers have used a deep learning model to discover a new compound capable of killing a deadly, drug-resistant superbug.
- Reka launches Reka Core, a powerful new multimodal model – AI startup Reka has released Reka Core, its new flagship multimodal language model that shows competitive performance against industry leaders like GPT-4 and Claude 3 Opus.
- Midjourney updates /describe command for style consistency – Midjourney has enhanced its
/describefeature to provide style suggestions from an uploaded image, making it easier to create new images with a consistent aesthetic. - Legal AI startup Harvey raises $80 million Series B – Harvey, a company building generative AI platforms for law firms and professional services, has raised $80 million in a new funding round led by Kleiner Perkins.
- The AI revolution is accelerating drug discovery – Pharmaceutical giants like Eli Lilly and Merck are using generative AI to design novel molecules from scratch, potentially cutting years off the traditional drug development process.
- Aizip raises $21M to build ‘tiny’ AI models for edge devices – AI startup Aizip has raised $21 million to develop small, efficient AI models for applications in the Internet of Things (IoT) and edge computing.
- Rabbit reports it has sold 100,000 R1 devices – Despite mixed reviews, Rabbit’s CEO announced the company has sold 100,000 units of its R1 AI-powered gadget, doubling initial projections.
- Getty Images and NVIDIA partner on a commercially safe AI image generator – Getty Images has launched a new AI image generator trained exclusively on its licensed stock photo library, offering full copyright indemnification for the content produced.
- How an author used AI to uncover a complex fraud – A writer details their experience using AI tools to analyze documents and identify inconsistencies that exposed a con artist’s elaborate deception.
- Survey reveals trends in AI engineering jobs and tools – A new survey from Section and Latent Space highlights key trends among AI engineers, including top salaries, preferred GPU providers, and the most-used frameworks.
- Unstructured open-sources its core data transformation platform – Unstructured has open-sourced its core platform for processing complex unstructured data, making it easier for developers to build enterprise-grade RAG applications.
- Mindtrip is a new AI-powered travel planner – A new tool called Mindtrip uses AI to create personalized travel itineraries based on user preferences, including flights, hotels, and activities.
Trending AI Tools:
- GPT-4o – OpenAI’s new flagship model that accepts text, audio, and image as input and produces text, audio, and image output.
- Veo – Google’s most advanced text-to-video model, designed to create high-quality, realistic, and consistent video clips from prompts.
- Imagen 3 – Google’s latest text-to-image model, offering incredible detail, photorealism, and accurate text rendering in generated images.
- Gems – A feature that allows users to create custom, specialized versions of the Gemini model for specific tasks or topics.
- Project Astra – Google’s vision for a universal AI assistant that can understand and respond to the world in real-time through continuous video and audio input.
- Viggle AI – A tool for generating character-consistent videos by animating a static image according to a motion prompt video.
- Story Diffusion – An open-source model designed for generating a series of consistent images and videos from a story description.
- Gemini 1.5 Flash – A lightweight, fast, and cost-efficient version of Google’s Gemini 1.5 Pro model, optimized for high-volume, high-frequency tasks.
- Opus Clip – An AI tool that automatically repurposes long videos into polished, viral short clips for social media platforms.
- Functionary V2 – An open-source large language model that excels at reliably calling functions and using external tools.
- Pika – A video generation platform that can create and edit videos from text prompts, images, or existing video clips.
- Music AI Sandbox – A suite of AI-powered music creation tools from Google designed to help artists generate new musical ideas.
- Kindred Tales – An AI-powered service that helps you create personalized, illustrated storybooks for children.
- Magic Loops – A platform that allows you to build internal tools and automate workflows using natural language prompts.
- Trieve – An open-source search infrastructure built for RAG applications, combining traditional search with vector search capabilities.
- GPT-4o Desktop App – A native macOS application that provides seamless access to ChatGPT and GPT-4o directly from your desktop.
- Runway – A comprehensive creative suite with AI magic tools for video editing, generation, and special effects.
- ElevenLabs – A voice AI research and deployment company providing realistic text-to-speech and voice cloning capabilities.
- Midjourney – An AI image generator known for creating highly artistic and stylized images from text prompts via Discord.
- UFO – A UI-focused agent from Microsoft that can automate tasks across different applications on the Windows operating system.
- Aerochain – An open-source framework for building and deploying Retrieval Augmented Generation (RAG) pipelines efficiently.
- Majax – A framework for building and training large language models on JAX, used for creating modern web applications.
- Reflexion – An open-source framework for building agents that can learn from past failures to improve their performance over time.
- Submagic – An AI tool for creators to generate trendy captions, B-roll, transitions, and sound effects for short-form videos.
- Google AI Studio – A web-based developer tool for prototyping and running prompts with Google’s latest Gemini models.
- Suno – An AI music creation tool that generates songs, complete with vocals and instrumentation, from a simple text prompt.
- Luma Labs – An AI research company specializing in 3D capture and text-to-video generation.
- Kling – A text-to-video model from Kuaishou capable of producing high-quality 1080p video up to two minutes long.
- Krea – A creative suite that uses real-time AI to generate and enhance images, patterns, and 3D textures.
- Gamma – An AI-powered tool that allows you to create engaging presentations, documents, and webpages from a simple prompt.
- HeyGen – A video platform that uses generative AI to create studio-quality videos with AI avatars and voice cloning.
- Descript – An all-in-one editor that makes audio and video editing as simple as editing a text document.
- Synthesia – An AI video creation platform for enterprises that turns text into professional videos with AI avatars in minutes.
- Captions – A mobile app that uses AI to automate video editing for creators, including adding dynamic captions, removing filler words, and more.
- Talknotes 2.0 – An app that transforms your rambling voice memos into clearly summarized notes, transcripts, and action items.
- Audio-to-Video by Petals AI – A tool that generates compelling videos from an audio input file.
- Video-to-Video by Fal AI – An API that provides a simple way to transform existing videos into new styles using text prompts.
- Vidyo.ai – An AI video repurposing tool that helps creators turn long-form content into short, shareable clips for social media.
- Vizard – A tool that helps marketers and content teams repurpose webinars, conference recordings, and interviews into clips.
- Pictory – An AI video generator that enables you to create and edit professional-quality videos using text, ideal for content marketing.
- Stico – A single, unified model that can generate images directly from speech without needing a text transcription step.
- Wondercraft – An AI-powered audio studio that makes it easy to create podcasts, ads, and audiobooks with professional voice cloning.
- Veed – An online video suite with a wide range of AI-powered tools for professional editing, recording, and hosting.
- Learn LM – An interactive visual explainer designed to help people understand the core concepts behind how language models work.
- MusicFX – A Google AI experiment that lets you create music by describing the sound or mood you’re looking for.
- Openlayer – An evaluation platform for large language model applications to help developers identify and fix model failures.
- Parea AI – A developer-focused platform for debugging, testing, and monitoring large language model applications.
- Trillium – Google’s 6th generation of its custom AI accelerator, the Tensor Processing Unit (TPU), designed for high-performance model training.
- Haiper – A text-to-video generation tool focused on creating high-quality and creative video content.
Sponsors:
- Vultr – Provides optimized cloud compute solutions, including cloud GPUs, for building, testing, and deploying AI models.
- RunPod – Offers a GPU cloud platform designed for building, training, and scaling AI applications.
- The AI Solopreneur – A newsletter that teaches how to leverage AI to launch and grow a one-person business.
- AE Studio – A development, data science, and design studio that helps companies build and implement custom AI solutions.
- Midjourney Mastery – A best-selling Udemy course that teaches you how to create professional AI art.
- Iterate – A platform that helps you get insights from your users with AI-powered surveys and analysis.
- AssemblyAI – Provides AI models for speech-to-text, summarization, and understanding audio and video data through a simple API.
- Opus Clip – An AI-powered tool that repurposes long videos into short, viral clips for social media with a single click.
- Taplio – An all-in-one tool for growing a personal brand and generating business opportunities on LinkedIn using AI.
- Tweet Hunter – An all-in-one tool designed to help you grow your audience and monetize your presence on X (formerly Twitter).
- Rows – A modern spreadsheet application that allows you to import data from various tools and includes an AI Analyst feature to help analyze your data.
- Growth School – Offers an AI & ChatGPT for work workshop designed to help professionals become more productive.
- Masterworks – A platform that allows you to invest in shares of multi-million dollar art by artists like Banksy and Basquiat.
- beehiiv – A newsletter platform built for growth, helping creators launch, grow, and monetize their publications.