Note from Matt: Below is an AI-generated list of AI news and tools. Our bot scours the internet daily to find AI news. Because it’s AI, it often finds older news or non-AI news and adds it to the mix. It also seems to break links often. We’re constantly working on improving the automation but this should, for the most part, ensure no AI news ever gets missed
The Latest AI News:
- Google’s VLOGGER AI can create a talking, moving avatar from a single photo – Google Research introduced VLOGGER, an AI model that can generate a controllable, talking, and moving 3D avatar from a single still photograph and an audio clip.
- Tim Cook confirms Apple’s AI is coming this fall – Apple CEO Tim Cook confirmed that the company will reveal its plans for generative AI later this year, stating it will “break new ground.”
- Nvidia’s revenue skyrockets 265% on AI chip demand – Nvidia announced quarterly revenue of $22.1 billion, a 265% increase from a year ago, as demand for its AI chips continues to surge.
- Pika adds AI-powered sound effects to its video generator – AI video platform Pika has launched a new feature that automatically adds sound effects to generated videos based on the content or a user’s text prompt.
- Microsoft partners with and invests in French AI startup Mistral – Microsoft announced a multi-year partnership with French AI company Mistral, making Mistral’s models available on the Azure cloud platform and investing in the startup.
- Figure AI raises $675M from Jeff Bezos, Nvidia, Microsoft, and OpenAI – Humanoid robotics startup Figure AI announced a $675 million funding round from a coalition of top tech companies to accelerate the development of its AI-powered robots.
- Stability AI releases research paper for Stable Diffusion 3 – Stability AI published the technical details of its upcoming Stable Diffusion 3 model, which uses a new architecture to significantly improve its ability to follow complex text prompts and generate higher quality images.
- Google’s Genie AI can create playable games from a single prompt – Google DeepMind introduced Genie, a new AI that can generate an endless variety of interactive, playable 2D worlds from a single text or image prompt.
- GitHub Copilot Enterprise is now generally available – GitHub has officially launched Copilot Enterprise for $39 per user per month, allowing companies to customize the AI coding assistant with their own private codebase.
- What’s the difference between an AI PC and a regular PC? – The Verge published a guide explaining what an “AI PC” is, how it differs from a traditional computer, and whether the new hardware is worth the upgrade.
- EU lawmakers give final approval to landmark AI rules – The European Parliament has passed the AI Act, creating the world’s first comprehensive legal framework for artificial intelligence.
- Elon Musk to open-source Grok chatbot – Elon Musk announced that his company xAI will open-source its Grok AI model, a move that comes shortly after he sued OpenAI for allegedly abandoning its own open-source mission.
- Hugging Face and Google Cloud announce major partnership expansion – Hugging Face and Google Cloud are deepening their partnership, allowing developers to train and deploy open models from Hugging Face directly on Google’s infrastructure with one click.
- Microsoft introduces ScreenAgent, an AI that can navigate user interfaces – Microsoft Research unveiled ScreenAgent, a vision-language model designed to understand and interact with graphical user interfaces to automate complex tasks based on simple instructions.
- Klarna’s AI assistant is doing the work of 700 agents – Klarna revealed its new AI-powered customer service assistant handled 2.3 million conversations in its first month, performing the equivalent work of 700 full-time human agents.
- ElevenLabs launches a new model for generating text-to-sound effects – AI voice company ElevenLabs has released a new tool that can generate sound effects, short instrumental tracks, and soundscapes from text prompts.
- Perplexity launches a ‘Discover’ feed for trending topics – AI search engine Perplexity has introduced a new Discover feed that provides users with a curated stream of popular and interesting questions being asked on the platform.
- AI is used to create a ‘new’ Nirvana song – The “Lost Tapes of the 27 Club” project used AI to analyze the music of Kurt Cobain and create a new song in the style of Nirvana, titled “Drowned in the Sun,” to raise awareness for mental health.
- Stanford unveils Mobile ALOHA 2, a robot that learns by watching humans – Researchers at Stanford University have developed Mobile ALOHA 2, a dual-arm mobile robot system that can be trained to perform complex tasks like cooking and cleaning by imitating human actions.
- AI software engineering startup Magic AI raises $117 million – Magic, a startup aiming to create an AI software engineer, has raised $117 million in a new funding round with backing from CapitalG, Nat Friedman, and Daniel Gross.
- A deep dive into how prompt injection attacks work – A comprehensive guide explains the vulnerabilities of prompt injection, a method used to trick large language models into ignoring their original instructions and following malicious ones instead.
- Google DeepMind announces SynthID for watermarking AI videos – Google has expanded its SynthID watermarking tool to support AI-generated videos, embedding an invisible marker to help identify synthetic content and combat misinformation.
- Animate Anyone can create realistic character animations from a single image – Researchers from Alibaba developed Animate Anyone, a new AI framework that can bring a still image of a person to life by applying a sequence of movements to it.
- Nomic AI launches Atlas to visualize unstructured data – Nomic AI has released Atlas, a tool that helps developers and enterprises interact with and visualize large, unstructured datasets of text and images.
Trending AI Tools:
- Meta AI (Llama 3) – Meta’s new open-source large language model and the AI assistant it powers, integrated across Facebook, Instagram, and WhatsApp.
- Reka Core – A powerful, next-generation multimodal model that understands text, images, video, and audio, competing with leading proprietary models.
- Adobe Premiere Pro (AI features) – Professional video editing software that is integrating new generative AI features for object manipulation and shot extension.
- Devin – An AI software engineer designed to autonomously handle and complete complex development projects from a simple prompt.
- Viggle AI – A free-to-try tool that animates a character from a still photo according to a specified motion prompt video.
- Mindtrip – An AI-powered travel agent that creates personalized itineraries and assists with booking flights and accommodations.
- Replicate – A platform that allows developers to run and fine-tune open-source models like Llama 3 through a cloud API.
- Udio – An AI music generation tool for creating high-quality, full-length songs with vocals from simple text prompts.
- Luma Labs – Dream Machine – An upcoming and highly anticipated text-to-video model designed to create high-quality, realistic video clips.
- Pika – An AI video generator that now allows users to automatically add a wide range of sound effects to their creations.
- Clipse – An AI tool that automatically analyzes long-form videos to identify the most viral-worthy moments and edits them into short clips.
- Magnific – An AI image upscaler and enhancer that has added a new feature to maintain character and style consistency across multiple images.
- HeyGen – An AI video platform for creating realistic avatar videos, now with enhanced features for adding pauses and controlling speech pace.
- Jellypod – A service that turns your selected online articles, newsletters, and documents into a personalized, ready-to-listen podcast.
- Superhuman – An AI-powered email client designed to help users process their inbox faster with features like summarization and automated sorting.
- Groq – A cloud platform providing developers with access to language models at exceptionally high speeds for real-time applications.
- Magic – A new diffusion-based model for code generation that aims to improve performance and reasoning on complex programming tasks.
- Chisel – An open-source tool that generates live, editable React and HTML/CSS UI components from text prompts or images.
- Defog – A tool that connects to your database and converts plain English questions into optimized SQL queries to retrieve data.
- Sunburst – A Chrome extension that provides quick access to various AI models like GPT-4 and Claude from any webpage.
- Storia – A platform that uses AI to help users create, illustrate, and customize personalized children’s storybooks.
- Promptly – A no-code platform that enables users to build and launch their own AI applications, chatbots, and agents without writing code.
- Openlayer – A platform for developers to test and evaluate their LLM applications to identify and fix model failures before production.
- Hugging Chat – A web interface from Hugging Face that allows you to chat with and test various open-source language models, including Llama 3.
- AI Story Generate – An AI-powered platform with tools for creating characters, plots, and worlds to help you write compelling stories.
- Open LLaMA 3 Chat – An open-source, locally runnable user interface for chatting with Meta’s Llama 3 model.
Sponsors:
- Brave – Brave Leo is a private, browser-native AI assistant that can summarize pages, answer questions, translate, write code, and more.
- AE Studio – AE Studio helps founders and executives build custom software and AI solutions.
- Mastering YouTube – Mastering YouTube is a live course that teaches you how to grow a YouTube channel from 0 to over 1 million subscribers.
- CrowdStrike – CrowdStrike’s Fal.Con is an annual cybersecurity conference focused on how leading companies prevent breaches.
- VAST Data – VAST Data provides insights on how enterprises can overcome challenges with AI adoption and leverage it for success.
- Masterworks – Masterworks is a platform for investing in blue-chip art from renowned artists like Banksy and Basquiat.
- Amplifyer – Amplifyer provides a free directory of AI tools to help businesses find the right AI solutions.
- MeetGeek – MeetGeek is an AI meeting assistant that records, transcribes, summarizes, and shares key highlights from your meetings.
- Guidde – Guidde is a platform that uses AI to create video documentation for explaining complex tasks.