Note from Matt: Below is an AI-generated list of AI news and tools. Our bot scours the internet daily to find AI news. Because it’s AI, it often finds older news or non-AI news and adds it to the mix. It also seems to break links often. We’re constantly working on improving the automation but this should, for the most part, ensure no AI news ever gets missed
The Latest AI News:
- Meta Releases Llama 3.1 – Meta released its new Llama 3.1 family of models, including a powerful 405B parameter version that is the largest open model available and rivals GPT-4o in performance.
- Apple Delays AI Features in Europe – Apple announced it will delay the launch of its new AI features, including Apple Intelligence, in the European Union this year due to regulatory uncertainties with the Digital Markets Act.
- Perplexity AI Launches Pages – Perplexity AI launched Pages, a new feature that instantly converts search queries or prompts into comprehensive, shareable articles with citations.
- Google DeepMind Unveils Kling – Google DeepMind unveiled Kling, a new text-to-video AI model capable of generating high-quality, physically plausible 1080p videos up to two minutes long.
- OpenAI Acquires Rockset – OpenAI acquired Rockset, a real-time analytics database company, to enhance the retrieval and indexing infrastructure for its products.
- Stanford Study Finds LLMs More Persuasive Than Humans – A new Stanford study found that large language models like GPT-4 are significantly more persuasive than humans, winning debates against them 81.7% of the time.
- Figure’s Humanoid Robots Begin Work at BMW – Figure’s humanoid robots have begun autonomous work at a BMW manufacturing plant in the US, marking a milestone for AI robotics in real-world commercial environments.
- Pika Labs Introduces “Modify Region” Feature – Pika Labs introduced a “Modify Region” feature that allows users to select an area in a generated video and change it using a text prompt.
- Scale AI Raising $1 Billion – Data labeling and AI infrastructure company Scale AI is raising $1 billion in a new funding round that values the company at $13.8 billion.
- ElevenLabs Launches AI Sound Effects Generator – ElevenLabs launched a new AI model that can generate a wide range of sound effects from simple text descriptions.
- Microsoft Research Reveals VASA-1 – Microsoft Research revealed VASA-1, an AI model that can create highly realistic talking head videos from a single portrait image and an audio speech file.
- Unitree Robotics Launches G1 Humanoid Agent – Unitree Robotics launched the G1, a highly agile and human-like agent AI avatar that costs just $16,000.
- Mistral Considers Creating U.S. Entity – French AI company Mistral is reportedly considering the creation of a U.S. entity to better serve American customers and navigate the political landscape.
- Hugging Face and Pollen Mobile Partner on Decentralized AI – Hugging Face and Pollen Mobile are partnering to create a decentralized AI network by providing GPU access to Pollen’s network of mobile hotspots.
- Amazon Rolls Out AI Shopping Assistant Rufus – Amazon is now rolling out its AI-powered shopping assistant, Rufus, to all customers on its U.S. mobile app.
- UK AI Safety Institute Opens San Francisco Office – The UK’s AI Safety Institute is opening its first overseas office in San Francisco to collaborate with US AI labs and government agencies.
- Wayve Raises $1.05 Billion for Self-Driving Tech – Autonomous driving startup Wayve raised $1.05 billion from investors including SoftBank, Nvidia, and Microsoft to develop its AI-powered self-driving technology.
- Snowflake Announces New AI Features – Snowflake announced several new AI features, including an AI-powered SQL editor and an observability tool, to help businesses build AI applications on their data.
- Legal AI Startup Harvey Raises $80 Million – Legal AI startup Harvey raised $80 million in a Series B funding round, valuing the company at $715 million.
- Google Research Releases AMIE for Medical Conversations – Google Research released AMIE (Articulate Medical Intelligence Explorer), a research AI system designed for diagnostic conversations that demonstrated expertise and empathy in a study.
- Godot Game Engine Integrates AI Features – Godot, a popular open-source game engine, is integrating AI features including a text-to-texture tool and an AI-powered chatbot for asking coding questions.
- Google Search Tests “Notes” on Results – Google Search is testing a new feature that allows users to add their own notes directly onto search results.
- AI Music Generator Suno Raises $125 Million – AI music generator Suno raised $125 million in a new funding round to expand its team and develop its technology further.
- Claude 3.5 Sonnet Now Available on Google Cloud’s Vertex AI – Claude 3.5 Sonnet, Anthropic’s fastest and most affordable new model, is now generally available on Google Cloud’s Vertex AI.
- Reka Upgrades ‘Core’ Model with Multimodality – AI startup Reka has upgraded its ‘Core’ model with multimodality, allowing it to process and understand images, videos, and audio in addition to text.
- GitHub Introduces Copilot for Azure – GitHub introduced Copilot for Azure, a new feature that allows developers to use natural language to ask questions and manage resources in Microsoft’s cloud platform.
- Google Releases Responsible AI Toolkit – Google released the Responsible AI Toolkit, a set of tools to help developers evaluate and improve the fairness and safety of AI models built on structured data.
- AI Solves International Math Olympiad Problems – A research paper shows that AI models can be trained to solve complex international math olympiad geometry problems at a level approaching human gold medalists.
- Researchers Discover “Cramming” in LLMs – Researchers found that Large Language Models can suffer from a phenomenon called “cramming,” where they prioritize information from the beginning and end of their context window.
- Replicate Launches Árbol Spanish Language Model – Replicate launched Árbol, a new open-source 7B parameter Spanish language model trained from scratch on a large dataset of Spanish text.
- ‘Anything to Lottie’ Converts Media to Animation – A new open-source tool called ‘Anything to Lottie’ uses AI to convert images, GIFs, and videos into lightweight Lottie animations.
- AI Generates Full-Length Beatles-Style Song – You can now listen to a full-length song in the style of The Beatles, generated entirely by an AI called Udio after being prompted with “Here Comes the Sun”.
- Spear AI Turns API Docs into SDKs – Spear AI is a new tool that can automatically turn API documentation into fully functional and type-safe SDKs in minutes.
Trending AI Tools:
- GitHub Copilot Workspace – An AI-native developer environment for brainstorming, planning, building, and testing code from start to finish.
- Stable Audio 2.0 – A model from Stability AI for generating high-quality, full-length music tracks with coherent structure from text prompts.
- Command R+ – A powerful, scalable large language model from Cohere designed for real-world enterprise use cases and RAG.
- FigJam AI – An AI assistant integrated into Figma’s online whiteboard to help generate, summarize, and organize ideas.
- Ideogram 1.0 – A text-to-image model that excels at reliably generating images with text and typography.
- Grok-1.5 – xAI’s latest model with improved reasoning capabilities, multimodal understanding, and a 128K token context length.
- WizardLM-2 – A new family of state-of-the-art open-source large language models from Microsoft, excelling in complex chat and multilingual tasks.
- Suno – An AI music and song generator that creates realistic audio, including vocals, from a simple text prompt.
- Character.ai – A platform that lets you create and interact with AI-powered chatbots based on fictional or real personalities.
- Perplexity – An AI-powered answer engine that provides direct, sourced answers to questions by searching the web.
- ChatGPT – OpenAI’s flagship conversational AI model that can answer questions, write content, generate code, and more.
- Claude – A family of large language models developed by Anthropic, known for its focus on safety and large context windows.
- Devin – An autonomous AI software engineer that can handle entire development projects from a single prompt.
- MindEye2 – A model that reconstructs what a person is seeing with high fidelity from fMRI brain scans.
- Viggle – An AI tool for generating videos with controllable motion, allowing you to animate characters based on text or video prompts.
- Open-Sora-Plan – An open-source project that aims to reproduce OpenAI’s text-to-video model Sora.
- Pika – An idea-to-video platform that allows you to generate and edit videos in various styles using AI.
- Runway – A creative suite of AI magic tools for creators, best known for its text-to-video and video-to-video generation.
- ElevenLabs – An AI voice generator that creates realistic, human-like text-to-speech and voice clones.
- Jam – An AI debugging assistant that helps developers understand and fix bugs faster by analyzing bug reports.
- Arc Search – A mobile browser that uses AI to read multiple web pages and provide a single summarized answer to your query.
- Jamba – A new production-grade Mamba-based language model from AI21 Labs with a massive 256K context window.
- Databricks DBRX – A state-of-the-art open, general-purpose large language model from Databricks.
- Chat with RTX – A demo app that lets you run a personalized chatbot locally on your PC using your own documents and content.
- PaliGemma – Google’s open vision-language model designed for fine-tuning on specific visual language tasks like image captioning.
- CodeGemma – A family of lightweight, open code models from Google for code completion and generation tasks.
- Midjourney – A popular AI image generator known for creating artistic and high-quality visuals from text prompts.
- Gemini – Google’s family of powerful multimodal AI models capable of understanding text, images, video, and code.
- Notion AI – An AI assistant integrated into Notion for summarizing notes, drafting content, and organizing information.
- Tome – An AI-powered tool for creating and designing compelling presentations, documents, and stories from a prompt.
- OpenVoice V2 – An improved version of a versatile instant voice cloning model that requires only a small audio sample from a target speaker.
- DINO 1.5 – A self-supervised model from Meta AI for high-performance computer vision tasks that requires no fine-tuning.
- Guidde – A tool that helps create video documentation and how-to guides with AI-generated voiceovers and steps.
- Lindy – An AI assistant to automate tasks like calendar management, email drafting, and contact management.
- Poly – An AI-powered tool that generates high-quality, customizable, and commercially-licensed textures for 3D designs from a text prompt.
- Clipboard – An AI tool that automates data entry by copying information from documents like PDFs and images into spreadsheets.
- Foundry – A no-code platform for building and deploying AI agents and workflows.
- Trieve – An open-source search infrastructure for building multi-modal and RAG-powered search applications.
- Sidekick – An AI-powered meeting scheduler that simplifies finding mutual availability without back-and-forth emails.
- Bento – An open-source platform for building, shipping, and scaling production-ready AI applications.
- Cosmopedia – A massive dataset of 25 million samples of synthetic data generated by Mixtral for pre-training language models.
Sponsors:
- Brilliant.org – An interactive learning platform for mastering AI, math, data science, and computer science.
- Guidde – An AI-powered platform that helps teams create video documentation in seconds.
- RunPod – A cloud platform that allows you to rent GPUs for AI and machine learning tasks.
- Growth School – Offers a 3-hour workshop to learn how to leverage ChatGPT and other AI tools to automate work and boost productivity.
- The Mindstream Post – The premium edition of the Mindstream newsletter, offering a monthly ‘State of AI’ report and access to an exclusive community.
- Masterworks – A platform that allows you to invest in fractional shares of blue-chip art from renowned artists.
- Promptlayer – A platform for prompt engineers to visually build, manage, and track their prompts.