Note from Matt: Below is an AI-generated list of AI news and tools. Our bot scours the internet daily to find AI news. Because it’s AI, it often finds older news or non-AI news and adds it to the mix. It also seems to break links often. We’re constantly working on improving the automation but this should, for the most part, ensure no AI news ever gets missed
The Latest AI News:
- ElevenLabs Previews Text-to-Music Model – Voice AI startup ElevenLabs has unveiled a new text-to-music generation model, allowing users to create full songs with vocals from a simple text prompt.
- OpenAI to Announce AI Search Product on Monday – OpenAI is expected to announce its new AI-powered search product on Monday, May 13th, positioning it as a direct competitor to Google.
- Apple Unveils New iPad Pro with M4 Chip for AI – Apple has unveiled its new M4 chip in the latest iPad Pro, branding it as a ‘powerhouse for AI’ with a significantly faster Neural Engine for on-device AI tasks.
- Humane is Looking for a Buyer – Just weeks after launching its controversial AI Pin, Humane is reportedly seeking a buyer for the company, with a potential price tag between $750 million and $1 billion.
- Biden Administration Appoints Leaders for U.S. AI Safety Institute – The White House has announced key leadership appointments for the U.S. AI Safety Institute, including a director from the NSA and a chief technology officer from Google DeepMind.
- Microsoft’s VASA-1 Creates Realistic Talking Videos from a Single Photo – Microsoft Research has developed VASA-1, an AI model that can create highly realistic, talking-head videos from a single image and an audio clip, featuring synchronized lip movements and naturalistic facial expressions.
- Stack Overflow partners with OpenAI – Stack Overflow has announced a partnership with OpenAI to integrate its knowledge base into ChatGPT, providing verified and accurate technical information to developers.
- Google Builds “Googler” AI Chatbot for Internal Use – Google has reportedly built an internal AI chatbot named “Googler” to help employees code and answer internal questions.
- Midjourney Tests New ‘Describe’ Feature – Midjourney is testing a new
/describefeature that generates four detailed text prompts based on an uploaded image, allowing users to recreate similar styles and compositions. - Alibaba Releases New Qwen1.5-110B Model – Alibaba has released Qwen1.5-110B, a new large language model that ranks among the top performers on the AlpacaEval 2.0 leaderboard.
- US and China to Hold High-Level AI Talks – High-level officials from the U.S. and China are set to hold their first formal talks on artificial intelligence in Geneva to discuss the risks and safety concerns of the technology.
- Wayve raises $1.05B for AI-powered self-driving cars – UK-based self-driving car startup Wayve has raised $1.05 billion in a funding round led by SoftBank to advance its AI-driven autonomous vehicle technology.
- Meta Launches New AI-Powered Ad Tools – Meta is launching new AI-powered tools for advertisers, including features for full image and video creation and text overlays, to enhance campaign creation on its platforms.
- Mistral AI Reportedly Raising $600M at a $6B Valuation – French AI startup Mistral is reportedly in talks to raise $600 million at a $6 billion valuation, with investors like DST, General Catalyst, and Lightspeed Venture Partners expected to participate.
- OpenAI Forms New Team for Safety and Security – OpenAI is forming a new “Safety and Security” team to oversee the security of its models and infrastructure, led by the company’s Head of Security.
- Inside Apple’s Troubled Car Project and its Pivot to AI – A new report details the internal struggles and eventual cancellation of Apple’s decade-long self-driving car project, codenamed “Project Titan,” and the company’s subsequent shift to generative AI.
- Google DeepMind Introduces New AI Red-Teaming Methods – Google DeepMind has introduced new AI-based red-teaming methods that use one language model to find flaws in another, which could improve the safety evaluation process for AI systems.
- Perplexity Launches “Discover” for Related Search Queries – AI search engine Perplexity has launched a “Discover” feature that suggests related search queries on its homepage, helping users explore topics more deeply.
- OpenAI Adds New Data Controls for Business Customers – OpenAI has introduced new data management controls for its ChatGPT Team and Enterprise customers, allowing them to exclude their data from being used to train models.
- ‘AI Steve’ is Running for Parliament in the UK – A businessman from Sussex is running for UK parliament as “AI Steve,” a political avatar that will use AI to interact with constituents and help formulate policies.
- Canva adds new suite of AI-powered design tools – Canva has introduced a suite of new AI-powered design tools for its enterprise customers, including features for generating images, transforming text, and editing videos.
- Spotify tests AI-generated playlists from text prompts – Spotify is reportedly testing a new feature that allows users to create playlists using AI-powered text prompts.
- Dropbox Open-Sources Its Password Manager – Dropbox has open-sourced its password manager, Passwords, allowing developers and companies to build their own secure credential management solutions.
- Character.ai adds voice to all AI characters – The Character.ai platform now allows users to have voice conversations with all of its millions of AI characters.
- UK AI Safety Institute releases ‘Inspect’ model evaluation tool – The UK’s AI Safety Institute has released ‘Inspect,’ an open-source software library for testing the capabilities and safety of AI models.
- How to Run Llama 3 on an M3 MacBook Pro Locally – A step-by-step guide details how to set up and run Meta’s Llama 3 model locally on an M3 MacBook Pro for free.
- GitHub Copilot Workspace Waitlist Opens – GitHub has opened the waitlist for its Copilot Workspace, a new environment that uses AI agents to help developers progress from an idea to a coded solution.
- My experience using Devin at a hackathon – A developer shares their experience using the AI software engineer ‘Devin’ at a hackathon, providing insights into its current capabilities and limitations.
- Rabbit R1 Leaks All Its Keys, Posing a Security Risk – A security researcher discovered that the Rabbit R1’s entire system could be compromised by leaking just one API key, raising significant security questions about the device.
- Microsoft introduces Phi-3-vision multimodal model – Microsoft has released Phi-3-vision, a new 4.2B parameter multimodal model that can reason over images and extract and process text from them.
- Google introduces a new method to understand LLM behavior – Google has introduced a new method that uses linear probes to better understand and predict the internal behavior of large language models.
- The Evolution of RAG: Past, Present, and Future – An article explores the history and future direction of Retrieval-Augmented Generation (RAG), a key technique for making large language models more accurate and up-to-date.
- Open-source AI-powered sales agent – An open-source project called PrivateGPT-Sales has been launched, offering an AI-powered sales agent that can automate outreach and manage prospect information privately.
- AI Tool Helps Identify At-Risk Children – An AI tool developed in New Zealand is helping social workers identify children at high risk of future harm with 76% accuracy.
- LinkedIn launches new AI tool for B2B marketers – LinkedIn has launched an AI-powered tool called “The B2B Edge” to help marketers create better-performing ad copy for their campaigns.
- E2B launches open-source AI agent development platform – E2B has launched an open-source platform that provides a secure cloud environment for developers to build, test, and deploy AI agents.
- Research Paper on Improving LLM-based Web Navigation – A new research paper explores techniques for improving the performance of LLM-based agents in navigating and interacting with websites by using HTML-based models.
- OpenAI spent $930,000 on lobbying in Q1 2024 – OpenAI has disclosed that it spent $930,000 on lobbying the U.S. government in the first quarter of 2024, focusing on regulations and research.
- Create an AI version of yourself with fine-tuning – A tutorial explains how to create a personalized AI chatbot of yourself by fine-tuning a large language model on your own data.
- Scientists Use AI to Analyze Whale Songs – Scientists are using AI to analyze the vocalizations of sperm whales, identifying a complex communication system akin to a “phonetic alphabet.”
- Suno AI CEO Discusses the Future of Music – An interview with the CEO of Suno AI discusses the future of music creation with generative AI and how the company is approaching copyright issues.
- Google’s AI Flood Forecasting Expands to 80 Countries – Google’s AI-powered flood forecasting tools are now operational in 80 countries, providing real-time alerts to help governments and aid organizations save lives.
- How to Build an AI that Earns Money – An essay explores the challenges and potential of creating autonomous AI agents that can generate income by completing tasks online.
- Building a Custom GPT with Actions from Scratch – A tutorial provides a step-by-step guide on how to build a custom GPT that can interact with external APIs using actions.
- Perplexity Pro Users Can Now Add Custom Instructions – Perplexity Pro users can now add custom instructions to their profile to personalize their search experience and get more tailored answers from the AI.
Trending AI Tools:
- GPT-4o – OpenAI’s new flagship “omnimodel” that natively processes and generates text, audio, and images in real-time.
- Claude 3.5 Sonnet – Anthropic’s fastest and most cost-effective model to date, excelling at complex tasks with top-tier vision capabilities.
- Project Astra – Google’s vision for a universal, real-time multimodal AI agent designed to be helpful in everyday life.
- Recall – A new Windows feature that uses AI to create a searchable photographic memory of everything you do on your PC.
- Artifacts – A feature in Claude that generates and displays content like code or website designs in a dedicated window next to the conversation.
- Veo – Google’s most capable text-to-video generation model, designed to create high-quality, 1080p videos over a minute long.
- Imagen 3 – Google’s latest text-to-image model that generates incredibly detailed, photorealistic images with improved text rendering.
- Llama 3 – Meta’s latest family of powerful open-source large language models available for broad use.
- Gemini 1.5 Pro – A powerful, multimodal Google model with a massive one million token context window for processing large amounts of information.
- Suno – An AI music generation tool that creates original songs, complete with vocals and instruments, from a simple text prompt.
- Pika – An AI-powered platform for generating and editing videos from text prompts or still images.
- Perplexity – An AI-powered “answer engine” that provides direct, cited answers to user questions by searching the web.
- GitHub Copilot Workspace – An AI-native developer environment that helps plan, build, and test code from natural language specifications.
- Ideogram – An AI image generator known for its remarkable ability to reliably render coherent text within generated images.
- HeyGen – An AI video platform for creating professional spokesperson videos with realistic avatars and voice cloning.
- Fine – An AI agent designed to automate software development by handling complex programming tasks based on high-level instructions.
- Diagram – An AI design companion that automates various design tasks and generates UI components within Figma.
- ElevenLabs – A leading AI voice synthesis and cloning platform for creating realistic, human-like audio.
- Midjourney – A popular and powerful AI image generator known for producing images with a distinct, artistic aesthetic.
- Opus Clip – An AI tool that automatically turns long videos into short, engaging clips perfectly formatted for social media platforms.
- Firefly – Adobe’s family of creative generative AI models integrated directly into Photoshop, Illustrator, and other Adobe apps.
- Leonardo AI – A comprehensive platform for creating and editing high-quality visual assets, from concept art to game assets, using AI.
- Krea – An AI tool that offers real-time image and video generation and enhancement, allowing for interactive creative control.
- Recraft – A generative AI design tool specialized in creating and editing vector art, illustrations, and 3D graphics with a consistent style.
- Playground – An online AI image creator that combines a powerful editor with various models to generate and refine visual content.
- Superlist – A modern productivity app that combines lists, tasks, and notes with AI-powered features for both individuals and teams.
- Galileo AI – An AI tool that generates editable UI designs for apps and websites from a simple text description.
- Tome – An AI-powered tool for creating entire presentations, documents, and microsites from a single prompt.
- Gamma – An AI presentation maker that generates polished and ready-to-use slides, documents, or webpages in seconds.
- Lindy – An AI assistant that automates administrative tasks like calendar management, email drafting, and meeting scheduling.
- Vapi – A developer platform for building, testing, and deploying realistic, low-latency voice-based AI agents.
- Vizcom – An AI-powered design tool that transforms simple sketches into highly rendered product concepts in seconds.
- Durable – An AI website builder that creates a complete business website with copy, images, and a contact form in under a minute.
- Uizard – An AI-powered design tool that helps users create wireframes, mockups, and prototypes for apps and websites from text prompts.
- Taskade – An AI-powered productivity platform that offers a unified workspace for real-time collaboration on tasks, notes, and projects.
- Brevian – A no-code platform that enables users to build custom AI agents and automate complex business workflows.
- Looka – An AI-powered platform that helps entrepreneurs design a custom logo and build a cohesive brand identity.
- Splash – An AI-powered music platform that allows anyone to create and perform unique songs using generative AI tools.
- Paperclips – An AI-powered desktop organizer that automatically sorts your screenshots and other files.
- SEC Insights – An AI tool designed for searching and analyzing SEC filings to quickly extract key financial data and insights.
- Trillia – An AI-powered search engine built specifically for finance professionals to analyze financial data and documents.
- Mindtrip – An AI travel planner that creates personalized itineraries and provides recommendations based on your preferences.
- Momo – An AI-powered travel planning app that helps users discover destinations and build personalized trip itineraries.
- Replicate – A cloud platform that makes it easy for developers to run and fine-tune open-source machine learning models via an API.
- Talkie – A developer tool for building performant, real-time voice agents that can handle interruptions and complex conversations.
- Chatbot UI – An open-source chat interface that provides a clean frontend for various AI models, including OpenAI and Google models.
- Gems (Google Search) – An AI feature in Google Search that allows users to create customized, shareable search result pages tailored to specific queries.
- Music AI Sandbox (Google) – A suite of experimental AI music generation tools from Google for creating novel sounds and melodies.
- LearnLM (Google) – A new family of AI models from Google that are fine-tuned for learning and educational applications.
- CodeGemma – A family of lightweight, open code models from Google built for code completion and generation tasks.
- PaliGemma – An open, lightweight vision-language model from Google designed for image captioning and visual Q&A.
- Coolicons – A free and open-source collection of icons designed for use in various digital projects.
Sponsors:
- Brave – Brave Leo is a smart AI assistant built right into the Brave browser, offering capabilities like summarizing pages, answering questions, and generating new content.
- Wethos – Wethos’ AI-powered platform helps you scope projects, create proposals, and streamline payments effortlessly.
- Clay – Clay is hosting a free, virtual AI Revenue Summit featuring top speakers to teach you how to use AI to drive revenue.
- Masterworks – Masterworks allows you to invest in blue-chip art from artists like Banksy and Basquiat, which has historically outpaced the S&P 500.
- Gcore – Gcore offers a powerful AI cloud with top NVIDIA GPUs like the H100 and L40S, providing a cost-effective infrastructure for training and inference.
- DoMore.ai – DoMore.ai offers a free newsletter with the world’s best ChatGPT prompts to help you become more productive.
- Guidde – Guidde is an AI-powered tool that helps you create stunning video documentation and how-to guides in seconds.
- Runway – Runway’s Gen-2 is a video generation model that can create compelling videos from simple text prompts or by applying styles to existing videos.
- MarketingProfs – MarketingProfs is a marketing education company that provides training, events, and resources for modern marketers.
- Drift – Drift is a conversational marketing and sales platform that helps businesses connect with buyers in real-time.
- Persado – Persado is an AI platform that generates motivational language for digital marketing to increase customer engagement and conversions.
- HubSpot – HubSpot is a leading CRM platform offering software and support to help businesses grow.
- Sitecore – Sitecore is a digital experience platform that provides content management, commerce, and personalization solutions.
- Jasper – Jasper is an AI content platform that helps teams create high-quality content for marketing, social media, and more.
- Writer – Writer is a full-stack generative AI platform that helps enterprises build AI applications and workflows securely.
- BrandOps – BrandOps is a brand intelligence platform that provides insights into brand performance and competitive landscapes.
- Code3 – Code3 is a performance marketing agency that specializes in driving business results across various digital channels.
- HAIN (Human-Aware AI Network) – The Human-Aware AI Network (HAIN) focuses on building and promoting AI that is ethical and human-centric.
- LTIMindtree – LTIMindtree is a global technology consulting and digital solutions company helping enterprises accelerate digital transformation.
- Notified – Notified is a communications cloud for public relations, investor relations, and marketing professionals to manage and measure their strategies.
- SOCi – SOCi is a marketing platform for multi-location brands, helping to manage social media, reviews, and local listings at scale.
- Tiled – Tiled is an interactive content platform that allows users to create engaging microapp experiences without code.
- Content at Scale – Content at Scale is an AI writing platform designed to produce long-form, SEO-optimized blog posts that rank.
- Regie.ai – Regie.ai is an AI-powered sales engagement platform that helps teams create and manage personalized sales campaigns.
- Truescope – Truescope is a media intelligence company providing insights and analytics from media coverage.
- Venngage – Venngage is an online tool for creating infographics, presentations, reports, and other visual designs.