Note from Matt: Below is an AI-generated list of AI news and tools. Our bot scours the internet daily to find AI news. Because it’s AI, it often finds older news or non-AI news and adds it to the mix. It also seems to break links often. We’re constantly working on improving the automation but this should, for the most part, ensure no AI news ever gets missed
The Latest AI News:
- Pika 1.0 – Pika Labs unveiled Pika 1.0, a new text-to-video platform with advanced editing features, and has opened the waitlist for public access.
- Perplexity Pages – Perplexity launched Pages, a new feature that automatically converts search queries or topics into comprehensive, customizable, and well-cited articles or reports.
- Microsoft and G42 invest $1B in Kenya – Microsoft and G42 are co-investing $1 billion to build a geothermal-powered data center in Kenya, marking a major U.S.-backed digital infrastructure initiative in East Africa.
- Microsoft VASA-1 – Microsoft Research introduced VASA-1, a new AI model capable of generating highly realistic talking-head videos from just a single image and an audio file.
- Apple develops ReALM – Apple has developed a new family of small AI models called ReALM that can run on-device and outperform models like GPT-4 at understanding on-screen and conversational context.
- Humane AI Pin reviews are negative – The first reviews for the Humane AI Pin are overwhelmingly negative, with critics panning the device for being slow, unreliable, and ultimately not useful.
- Runway introduces Motion Brush – Runway introduced Motion Brush, a new tool that allows users to add controlled motion to specific parts of a static image to create a video.
- OpenAI appoints new board member – OpenAI has appointed Sue Desmond-Hellmann, the former CEO of the Bill & Melinda Gates Foundation, to its board of directors.
- Google releases Pali-3 vision model – Google released Pali-3, a new, smaller vision-language model that excels at localized visual tasks like object detection and reading text within images.
- GitHub Copilot Workspace – GitHub unveiled Copilot Workspace, a new AI-powered environment that helps developers brainstorm, plan, build, and test code from start to finish using natural language prompts.
- Scale AI raising funds at $13.8B valuation – AI data-labeling company Scale AI is reportedly raising $1 billion in a new funding round that values the company at $13.8 billion.
- xAI seeks funding at $18B valuation – Elon Musk’s AI startup, xAI, is in talks to raise up to $6 billion in a funding round that could value the company at $18 billion.
- US and China to hold AI safety talks – The United States and China are scheduled to hold their first high-level bilateral talks on the risks and safety of artificial intelligence in Geneva.
- OpenAI fires researchers for allegedly leaking info – OpenAI has reportedly fired two researchers, including one with ties to chief scientist Ilya Sutskever, for allegedly leaking information.
- Adept AI raises $150M – AI agent startup Adept is reportedly in the process of raising $150 million from key partners including Microsoft and Nvidia.
- Intel shares new Gaudi 3 AI chip details – Intel shared new performance details for its Gaudi 3 AI chip, claiming it can train certain AI models 1.7 times faster than Nvidia’s H100 GPU.
- Amazon Music launches AI playlist generator – Amazon Music introduced an AI-powered playlist generator named Maestro, which allows users to create custom playlists using conversational prompts and emojis.
- Google’s VLOGGER AI – Google researchers revealed VLOGGER, an AI model that can generate realistic and controllable videos of a person talking and moving from just a single still photograph.
- AI may outperform humans in creativity – A new study published in Scientific Reports found that AI chatbots, specifically ChatGPT-4, can outperform the average human in tests of divergent thinking, a key component of creativity.
- Google’s Lumiere 3D model – Google Research introduced Lumiere 3D, a diffusion model capable of generating dynamic 3D scenes with realistic camera movements from a single text prompt.
- Ideogram improves text-in-image generation – AI image generator Ideogram has released an update to its model that dramatically improves the reliability and coherence of rendering text within images.
- Mistral models available for fine-tuning on AWS – Amazon Web Services announced that developers can now fine-tune Mistral’s open-source models directly within the Amazon SageMaker platform.
- AI helps discover new class of antibiotics – Stanford researchers have utilized an AI model to discover a new class of antibiotics effective against drug-resistant Staphylococcus aureus (MRSA).
- Waymo continues U.S. expansion – Waymo continues its U.S. expansion by opening its fully autonomous ride-hailing service to the public in Los Angeles and Austin, with Atlanta on the map for future service.
- ElevenLabs launches AI song contest – Voice AI company ElevenLabs has launched a contest for its new AI music generation tool, inviting users to create and submit original songs for prizes.
Trending AI Tools:
- OpenAI’s GPT-4o – OpenAI’s new flagship multimodal model that natively processes text, audio, and vision in real-time and is available for free.
- Google’s Veo – Google’s most capable video generation model to date, designed to create high-quality, 1080p videos from text prompts.
- GitHub Copilot Workspace – An AI-native developer environment for brainstorming, planning, building, testing, and running code using natural language.
- Google’s Project Astra – A real-time, multimodal AI assistant from Google that can see, hear, and understand the world around it to be contextually helpful.
- ChatGPT Desktop App – A native macOS application that integrates ChatGPT directly into your desktop workflow for seamless access.
- Google’s Imagen 3 – Google’s highest quality text-to-image model, excelling at photorealism, detail, and understanding complex text prompts.
- Google’s Gemini 1.5 Flash – A lighter, faster, and more cost-efficient version of the Gemini 1.5 Pro model, optimized for high-volume tasks.
- Viggle AI – A video generation tool that can animate a character in a photo to perform a dance move described by a text prompt.
- Reka Core – A powerful, frontier-class multimodal language model from startup Reka that rivals leading models from major labs.
- Suno – An AI music and song generator that creates complete songs, including lyrics and vocals, from a simple text prompt.
- Google’s Genie – A foundation model from Google DeepMind that can generate interactive, playable 2D video games from a single image or text prompt.
- MusicFX – An experimental Google tool for creating music, featuring a “DJ Mode” for generating continuous mixes by combining prompts and genres.
- Trillium – Google’s 6th generation Tensor Processing Unit (TPU), offering a significant improvement in compute performance for AI workloads.
- VLOGGER – A research project from Google that can generate controllable videos of a person speaking and moving from a single still photograph.
- Ploom – A research tool from Google that can animate a single image of a plush toy or stuffed animal, making it walk and move realistically.
- Visual Electric – An AI image generator with a unique “creative direction” feature that helps users refine and guide their image creations.
- Video to Music by RunwayML – A RunwayML tool that automatically generates royalty-free music tailored to the mood, theme, and rhythm of your video clips.
- Udio – A powerful AI music creation tool that generates high-quality, full-length songs with vocals based on text prompts.
- Mindgrasp – An AI learning assistant that instantly creates notes, summaries, flashcards, and answers questions from any document, presentation, or video.
- Opus – An AI-powered video editing tool that automatically repurposes long videos into viral-ready short clips for social media.
- MyShell – A decentralized platform for creating, sharing, and discovering unique AI-native applications and chatbots.
- Eightify – An AI-powered tool that generates quick summaries of YouTube videos, allowing you to grasp the key points without watching the entire video.
- DuckDuckGo AI Chat – A privacy-focused service that provides anonymous access to popular AI chatbots without requiring an account.
- Stable Audio – An AI tool from Stability AI for generating high-quality, royalty-free music and sound effects for creative projects.
- Trieve – An API and set of tools for developers to easily add complex semantic and hybrid search capabilities to their applications.
- Ghostwriter by Replit – An AI-powered coding assistant within the Replit IDE that helps developers write, debug, and refactor code more efficiently.
- AI Ask – A no-code tool that allows you to create a personalized AI chatbot trained on your own website content.
- Chatling – A no-code platform for building and deploying custom AI chatbots trained on your business’s data to handle customer support.
- AeroVene – An AI tool that analyzes and optimizes e-commerce product pages to improve customer engagement and increase sales.
- AI Story Generation by The Neuron – A simple tool from The Neuron newsletter that generates a short, illustrated story from a single-sentence prompt.
- Typeform – An online platform for creating engaging forms and surveys, which has been incorporating AI to enhance its features.
- MusicGen – A music generation model from Meta that creates short musical compositions from text descriptions or melodic inputs.
- Jukebox – An open-source neural net from OpenAI that generates music, including rudimentary singing, as raw audio in various genres and styles.
Sponsors:
- Vercel – An AI SDK and API that make it easy to build and stream generative UI from your chosen LLM.
- Page One – An AI-powered tool that summarizes any article into a one-page read.
- Insight Labs – Provides elite AI/ML teams from the top 1% of talent to supercharge AI startups.
- AE Studio – A development, data science, and design studio that collaborates to build innovative products for startups and enterprises.
- Writer – A full-stack generative AI platform designed for enterprise use.
- Mito – A spreadsheet that automatically writes Python code for you as you edit it.
- Clay – An AI-powered sales platform that fills your pipeline with qualified leads.
- Gladwell & Grant – A law firm offering legal expertise and deep technical knowledge in intellectual property, AI, and emerging technologies.
- Athina AI – An LLM evaluation platform for developers to monitor, evaluate, and improve their LLM applications.