Note from Matt: Below is an AI-generated list of AI news and tools. Our bot scours the internet daily to find AI news. Because it’s AI, it often finds older news or non-AI news and adds it to the mix. It also seems to break links often. We’re constantly working on improving the automation but this should, for the most part, ensure no AI news ever gets missed
The Latest AI News:
- Perplexity launches Pages to turn search results into articles – Perplexity launched Pages, a new feature that lets users convert their search queries or threads into comprehensive, well-structured articles and reports that can be easily shared.
- Anthropic’s Claude gets tool use to connect with external APIs – Anthropic has updated its Claude 3 models with a “tool use” feature, enabling them to interact with external data sources and APIs to perform complex, real-world tasks.
- Google unveils Lumiere, a text-to-video model with realistic motion – Google Research introduced Lumiere, a new text-to-video diffusion model designed to generate videos with coherent and realistic motion by creating the entire video in a single pass.
- ElevenLabs raises $80M at a $1.1B valuation for AI voice generation – Voice AI startup ElevenLabs raised an $80 million Series B round co-led by Andreessen Horowitz, Nat Friedman, and Daniel Gross to expand its research and product offerings.
- OpenAI reportedly in talks for funding at a $100B valuation – OpenAI is in early discussions to raise a new funding round that would value the company at $100 billion or more, making it one of the world’s most valuable startups.
- Midjourney releases V6 alpha with in-image text and better realism – The alpha test of Midjourney V6 is now available, featuring significantly improved prompting, coherence, and the ability to generate legible text within images.
- Apple open-sources Ferret, a multimodal model that can identify objects in images – Apple has released Ferret, an open-source multimodal large language model capable of understanding and reasoning about specific regions within an image.
- Leaked Google memo reveals 2024 AI plans to beat OpenAI – An internal Google document outlines the company’s top AI priorities for 2024, including launching the Gemini Ultra model and developing AI-powered tools that feel personal and helpful.
- Rabbit sells out its fourth batch of 10,000 R1 devices – The AI hardware startup Rabbit announced that the fourth production run of its R1 device sold out in less than a day, bringing total pre-orders to 40,000 units.
- Microsoft VALL-E 2 generates high-quality text-to-speech audio – Microsoft has introduced VALL-E 2, a new text-to-speech model that can generate natural-sounding speech and maintain the speaker’s voice from a short audio sample.
- AI music startup Suno is in talks for new funding – Suno, a startup known for its AI-powered music and speech generation tools, is reportedly in discussions to raise a new round of funding.
- Amazon rolls out AI-generated review summaries in its shopping app – Amazon is now using generative AI to create short summaries of customer reviews for products, highlighting key features and common feedback for shoppers.
- AI can detect Parkinson’s from brain scans 7 years before symptoms appear – Researchers have developed an AI system that can identify signs of Parkinson’s disease in brain scans years before the first physical symptoms emerge.
- Ex-DeepMind scientists launch stealth AI startup in Paris – A team of former Google DeepMind researchers has founded a new stealth startup, UltimateLLM, to build more efficient and powerful large language models.
- Character.ai founder says AI companions are the “killer consumer app” – Noam Shazeer, CEO of Character.ai and a co-inventor of the Transformer architecture, argues that conversational AI companions will become the most impactful application for consumers.
- Google may rebrand its Bard chatbot to Gemini – Google is reportedly planning to change the name of its AI chatbot from Bard to Gemini, aligning the product’s branding with its most powerful underlying model.
- Apple researchers find a way to run LLMs on iPhones – Apple researchers have developed a new technique using flash memory that allows large language models to run effectively on iPhones and other devices with limited RAM.
- Defense tech company Anduril acquires Ad Hoc Research – AI-focused defense contractor Anduril has acquired Ad Hoc Research, a company specializing in microelectronics and secure communication, to enhance its autonomous systems’ capabilities.
- Poe introduces multi-bot chats to talk to several AIs at once – Quora’s AI platform Poe has launched a new feature allowing users to include multiple different AI bots within a single chat conversation by @-mentioning them.
- Reka’s multimodal model beats Gemini Ultra on key benchmark – AI startup Reka announced that its latest multimodal model, Reka-V, has achieved the top position on the public MMMU benchmark for advanced multimodal understanding.
- The New AI Copyright Debate – A new essay explores the evolving and complex legal arguments surrounding generative AI and copyright, moving beyond simple claims of data theft to consider the transformative nature of AI training.
- Waymark, an AI video platform for local ads, raises $10M – Waymark raised a $10M Series A extension to expand its AI-powered platform that enables local businesses to quickly create professional video commercials.
- Hugging Face provides free GPU access to students – Hugging Face is now offering free access to its compute resources, including GPUs, as part of the GitHub Student Developer Pack to support students learning AI.
- Magic Hour is a new tool for creating consistent characters – Magic Hour is a new AI service that lets you train a model on your own characters to generate consistent images of them in various styles and scenes.
- A guide to prompting with Midjourney V6 – A detailed guide explains how to leverage the new features of Midjourney V6, which now has a much better understanding of natural language for prompting.
Trending AI Tools:
- Runway Gen-3 Alpha – A new flagship video model for generating highly detailed, expressive, and realistic video clips from text and images.
- Luma Dream Machine – A publicly available text-to-video model that generates high-quality, realistic 5-second video clips from text prompts.
- Perplexity Pages – An AI tool that turns research queries and prompts into comprehensive, beautifully formatted, and shareable articles or reports.
- Nemotron-4 340B – NVIDIA’s new family of open models that developers can use to generate high-quality synthetic data for training other LLMs.
- Krea AI Video Enhancement – A feature that allows users to upscale and enhance videos to 4K resolution, improving detail and clarity.
- Viggle – An AI model that creates videos of people dancing by applying a motion prompt video to a static character image.
- Suno – An AI music generator that creates original songs, complete with vocals and instruments, from a simple text prompt.
- PowerInfer – A high-speed inference engine that enables large language models to run efficiently on a personal computer with a single consumer-grade GPU.
- Decoherence – A platform for developers to build, test, and deploy AI agents with custom tools, memory, and models.
- Bland AI – A platform for building and scaling AI phone agents that can handle inbound and outbound calls with realistic human-like conversation.
- Scenario – An AI platform that launched a new tool for generating consistent videos and motion from a single image or text prompt.
- Ideogram – An AI image generator known for its superior ability to reliably render coherent and stylized text within images.
- Superagent – An open-source framework for building and managing AI agents with features like memory, document retrieval, and tool usage.
- Udico – An AI-powered tool that generates high-quality User-Generated Content (UGC) style videos for marketing campaigns.
- YouTube’s AI Comment Summaries – An experimental YouTube feature that uses AI to organize large comment sections into easy-to-digest themes for creators.
- Klarna’s AI Assistant – The shopping app’s AI chatbot which now handles the majority of customer service chats, equivalent to the work of 700 full-time agents.
- WebCheck – An all-in-one OSINT (Open Source Intelligence) tool for investigating websites, domains, and IP addresses.
- Defog.ai – An AI tool that translates natural language questions into SQL, helping users query and visualize their data without writing code.
- AI-pply – A tool that automates the job application process by creating personalized resumes and cover letters tailored to each job listing.
- Recraft – An AI design tool for generating and editing vector art, illustrations, icons, and 3D graphics in a consistent brand style.
- Summarize.tech – A tool that uses AI to provide concise summaries of any long YouTube video, such as lectures, live events, or government meetings.
- Midjourney – A popular AI image generator known for creating artistic and high-quality images from text prompts.
- Chroma – An open-source embedding database designed for building and scaling AI applications that use large language models.
- Voiceflow – A collaborative platform for teams to design, prototype, and launch conversational AI assistants for any channel.
- Neat Video – A video filter plug-in designed to reduce visible noise, flicker, and grain in digital video footage from any source.
- Focus Bear – A productivity app that helps users build healthier habits and block distracting websites and applications to improve focus.
Sponsors:
- Incogni – A service that helps you take back your private data from data brokers, which could be used to train AI models.
- Masterworks – An exclusive platform that allows you to invest in shares of multi-million dollar paintings by artists like Banksy and Basquiat.
- AE Studio – A development, data science, and design studio that helps companies build and integrate custom AI solutions.
- FourthBrain – Offers a webinar on managing the generative AI product lifecycle, from proof-of-concept to deployment.
- CommandBar – An AI-powered user assistance platform for SaaS companies to improve user onboarding and feature adoption.