Note from Matt: Below is an AI-generated list of AI news and tools. Our bot scours the internet daily to find AI news. Because it’s AI, it often finds older news or non-AI news and adds it to the mix. It also seems to break links often. We’re constantly working on improving the automation but this should, for the most part, ensure no AI news ever gets missed
The Latest AI News:
- Google showcased Project Astra – a real-time, multimodal AI assistant that can see, hear, and speak to understand and respond to the world around it.
- Google is rolling out “AI Overviews” – providing AI-generated answers to complex queries at the top of its search results for all users in the US.
- Google announced Gemini 1.5 Flash – a new smaller, faster, and more cost-efficient model optimized for high-volume, low-latency tasks.
- OpenAI and Reddit announced a partnership – bringing Reddit’s content to ChatGPT and new AI-powered features to Reddit users and moderators.
- Google unveiled Veo and Imagen 3 – its most capable text-to-video generation model and its highest-quality text-to-image model yet.
- Google expanded Gemini 1.5 Pro’s context window to 2 million tokens – a new industry record, available in private preview for developers.
- OpenAI paused the use of its “Sky” voice in ChatGPT – after users noted its strong resemblance to Scarlett Johansson, who had previously declined to voice the assistant.
- Scale AI raised $1 billion – in a funding round that doubles its valuation to $13.8 billion.
- Anthropic launched “tool use” for its Claude models – allowing them to interact with external APIs and tools to perform complex, real-world tasks.
- Google announced Trillium – its sixth-generation Tensor Processing Unit (TPU), which offers a 4.7x improvement in compute performance per chip over the previous generation.
- Perplexity introduced Perplexity Pages – a new feature that allows users to create and share comprehensive, visually appealing articles from a single prompt.
- Microsoft’s annual Build developer conference is taking place – with expected announcements focusing on integrating AI across its products, from Windows to Azure.
- Apple is nearing a deal with OpenAI – to integrate some of its technology into iOS 18.
- Researchers released Llama 3-V – a new open-source multimodal model that claims to outperform GPT-4V, Llama 3, and Gemini Ultra on several benchmarks.
- Microsoft researchers introduced a “Stop-and-Think” technique – which enables language models to solve complex math problems at a human level by breaking them into smaller steps.
- CoreWeave is acquiring Lambda Labs – a deal between the AI cloud providers that could be valued at over $1 billion.
- Meta AI introduced Chameleon – a family of early-stage, multimodal models that can understand and generate text and images in any sequence.
- The Senate AI working group released a roadmap for AI policy – recommending $32 billion in annual funding for non-defense AI research and development.
- Meta released JASCO – a new model for text-to-music generation that allows for greater user control over elements like harmony, rhythm, and chords.
- Youdao launched QAnything – a local, privacy-focused alternative to ChatGPT that allows users to chat with their documents offline.
- Google DeepMind introduced a set of “AI safety” principles – including a “responsible development” framework and a method for red-teaming generative models.
- A new paper presents Long-Context Language Models (LC-LLMs) – that can handle up to 2 million tokens of context, developed by Chinese researchers.
- Mistral AI released Codestral – a new 22B parameter model specialized for code generation, supporting over 80 programming languages.
- Microsoft and Khan Academy announced a partnership – to provide free access to Khanmigo for Teachers, an AI-powered teaching assistant, to all K-12 educators in the United States.
- A new research paper explores “Contextual Compression” – a method to improve the efficiency of Retrieval-Augmented Generation (RAG) by filtering irrelevant information from retrieved documents.
- Researchers developed an algorithm for a two-legged robot – enabling it to learn to walk from scratch in just 20 minutes using a new approach to reinforcement learning.
- The UK’s AI Safety Institute opened its first international office – in San Francisco to collaborate more closely with US AI companies and researchers.
- Wayve released PRISM – a new dataset from the self-driving car startup aimed at improving the prediction and planning capabilities of autonomous driving systems.
- A new tool called RAGAS allows developers to evaluate RAG pipelines – letting them score the performance of their Retrieval-Augmented Generation systems without relying on ground-truth labels.
- Pika added a sound effects generation feature – to its AI video generation platform.
Trending AI Tools:
- Luma Dream Machine – A publicly available text-to-video model that generates high-quality, realistic 5-second video clips from text prompts and images.
- Perplexity Pages – An AI tool that converts research queries into comprehensive, well-structured articles and reports, complete with citations and a shareable link.
- Kling – A text-to-video model from Chinese developer Kuaishou capable of generating high-quality 1080p videos up to two minutes long with complex motion.
- Claude 3.5 Sonnet – Anthropic’s fastest and most intelligent AI model to date, excelling at complex tasks like code generation, multi-step workflows, and visual analysis.
- ChatGPT-4o – OpenAI’s flagship multimodal AI that accepts text, audio, and image as input to generate human-like conversational output.
- Runway Gen-3 Alpha – Runway’s newest and most powerful video generation model, offering significant improvements in creating realistic humans, detailed scenes, and consistent motion.
- Midjourney – An advanced AI image generator known for producing high-quality, artistic, and stylistically coherent visuals from text prompts.
- Krea AI – A real-time AI design suite that can generate and upscale images, create patterns, and transform existing videos into different styles.
- Suno – An AI music generation tool that creates complete songs, including vocals and instrumentation, from a simple text description.
- Ideogram – An AI image generator that specializes in reliably rendering coherent and stylized text within its generated images.
- Viggle – An AI video tool that animates a static character image with motion captured from a separate reference video, making it dance or move realistically.
- Splash – An AI music creation platform featuring unique vocal models that allows users to generate songs and even create their own AI singer from a text prompt.
- Fine-Tuner.ai – A no-code platform that enables users to create, train, and host their own fine-tuned AI models for specific tasks.
- Chie – A native macOS application for running open-source language models locally on your device for fast, private, and offline AI assistance.
- Phind – An AI search engine and pair programmer for developers that provides instant answers, code examples, and technical explanations.
- Vizcom – An AI-powered design tool that transforms hand sketches, drawings, and text prompts into professional-grade product concepts and renders.
- Sizzle – An AI tool that automatically detects and clips exciting moments from Twitch or YouTube gaming streams to create highlight videos.
- Explainpaper – A tool that helps researchers and students understand complex academic papers by providing AI-generated explanations for highlighted text.
- Consensus – An AI search engine designed for researchers that extracts and synthesizes findings directly from a database of over 200 million scientific papers.
- AI Storyboard Generator (by Boords) – A tool that uses AI to automatically generate a complete visual storyboard, including images and scenes, from a simple video script.
- Pika – An AI video platform that can generate and edit videos from text prompts, images, or existing video clips.
- HeyGen – An AI video generator that allows users to create professional videos featuring realistic avatars and translated voice cloning.
- RecapioGPT – An AI tool that quickly generates comprehensive summaries of long YouTube videos, articles, and podcasts.
- Mindgrasp – An AI learning assistant that instantly creates notes, summaries, and answers questions from any uploaded document, video, or audio file.
- Gamma – An AI tool that creates polished presentations, documents, and webpages from a text prompt, eliminating the need for manual design and formatting.
- Leonardo AI – A full-featured generative AI suite for creating and editing visual assets, including images, game art, and textures.
- Adobe Firefly – Adobe’s family of creative generative AI models that are integrated into its suite of apps like Photoshop for ethical image generation and editing.
- Poe – A chatbot application by Quora that provides access to a wide range of different AI models and user-created bots in one place.
- Superhuman – An AI-powered email client designed for speed, featuring tools for summarizing threads, drafting replies, and inbox automation.
- Motion – An AI-powered app that combines your calendar, project manager, and to-do list to automatically plan your optimal daily schedule.
- Fireworks AI – A platform providing developers with fast and efficient access to a wide variety of open-source and fine-tuned generative AI models.
- HubSpot AI – A collection of AI-powered tools integrated into the HubSpot CRM to assist with content creation, marketing campaigns, and sales automation.
- Glean – An AI-powered search and knowledge discovery tool for the enterprise that connects and understands all of a company’s internal applications.
- Deforum – A tool for creating AI-generated animations and video transformations, known for its use with the Stable Diffusion model.
- Lexis+ AI – A generative AI assistant for legal professionals that aids in legal research, document summarization, and drafting.
- Shortform – A service that provides in-depth, high-quality summaries and analyses of non-fiction books.
Sponsors:
- Brave Leo – A free AI assistant built into the Brave browser that answers questions, creates content, and writes code with a focus on speed and privacy.
- Intel – Provides tools like the oneAPI and the AI Developer Cloud to help build and scale AI applications across different hardware.
- Browse.AI – A no-code platform that allows you to train a robot to extract and monitor data from any website.
- AE Studio – A development, data science, and design studio that works with companies to create innovative products.
- CommandBar – A platform that helps SaaS companies quickly build and deploy AI chatbots and other user-facing AI features.
- The Edge – A newsletter that sends a single, practical AI tip every day to help you get ahead.
- Writer – A full-stack generative AI platform designed for enterprise use.
- beehiiv – A newsletter platform that helps creators and businesses run and monetize their publications.
- Semrush App Center – A collection of over 40 marketing tools, including many powered by AI, to enhance digital marketing campaigns.
- Masterworks – An art investment platform that allows individuals to invest in shares of blue-chip art.
- Descript – An AI-powered audio and video editor that allows you to edit media as easily as editing a text document.