Note from Matt: Below is an AI-generated list of AI news and tools. Our bot scours the internet daily to find AI news. Because it’s AI, it often finds older news or non-AI news and adds it to the mix. It also seems to break links often. We’re constantly working on improving the automation but this should, for the most part, ensure no AI news ever gets missed
The Latest AI News:
- Perplexity launches ‘Pages’ to turn search queries into articles – Perplexity introduced Pages, a new feature that allows users to generate and share visually organized articles and reports from their search queries.
- Microsoft unveils major AI updates at Build conference – At its annual Build conference, Microsoft announced a host of AI upgrades, including the new Phi-3-vision multimodal model, real-time video translation, and advanced features for its Copilot assistant.
- OpenAI’s former chief scientist launches new AI company – Ilya Sutskever, a co-founder and former chief scientist of OpenAI, has started a new company called Safe Superintelligence Inc. (SSI) focused entirely on safely developing powerful AI.
- Google introduces ‘Veo’, its most advanced text-to-video model – Google announced Veo, a new generative AI model designed to compete with OpenAI’s Sora by creating high-quality, minute-long 1080p videos from text prompts.
- Runway releases Gen-3 Alpha, its next-generation video model – Runway has launched Gen-3 Alpha, its newest video generation model that offers significant improvements in speed, control, and the creation of detailed, consistent characters and scenes.
- OpenAI partners with Reddit to train ChatGPT – OpenAI has signed a deal with Reddit to bring the platform’s content to ChatGPT, allowing the AI to learn from and display real-time information from Reddit’s communities.
- Google unveils Imagen 3 for photorealistic image generation – Google’s new text-to-image model, Imagen 3, generates highly realistic images with fewer visual errors and a much better understanding of complex, descriptive prompts.
- ElevenLabs launches a text-to-sound-effects generator – AI voice company ElevenLabs has released a new tool that can generate a wide range of sound effects and short audio tracks from simple text descriptions.
- OpenAI signs content licensing deal with Dotdash Meredith – OpenAI has partnered with publisher Dotdash Meredith to license content from brands like Investopedia and Better Homes & Gardens for training its AI models.
- Suno releases version 3.5 of its AI music generator – AI music creation tool Suno has updated its model to version 3.5, featuring improved audio quality and the ability to create songs up to four minutes long.
- TikTok is developing AI-powered virtual influencers – TikTok is reportedly experimenting with AI-generated virtual influencers that can appear in scripted ad videos to promote products on the platform.
- Hugging Face is reportedly raising a new funding round – AI startup Hugging Face is said to be in talks to raise at least $200 million in a new funding round that would maintain its $4.5 billion valuation.
- MIT study finds AI is augmenting, not replacing, most jobs – A study from MIT reveals that companies are predominantly using AI to enhance employee capabilities and create new tasks rather than for automating jobs away.
- Stack Overflow partners with OpenAI to improve AI coding answers – Stack Overflow has partnered with OpenAI to provide data via its new OverflowAPI, helping ChatGPT give more accurate and attributed programming answers.
- Humane is reportedly seeking a buyer after Ai Pin’s poor reception – Following weak reviews for its wearable Ai Pin, tech startup Humane is reportedly exploring a sale of the company for between $750 million and $1 billion.
- Cohere releases Command R+, a new enterprise-grade AI model – Cohere has launched Command R+, a highly scalable conversational model designed for enterprise applications like advanced retrieval-augmented generation (RAG) and tool use.
- Meta releases technical report for Llama 3.1 – Meta has published the technical report for its newest model, Llama 3.1, detailing its architecture, training process, and performance benchmarks.
- Pika adds sound effects generation to its video platform – AI video generator Pika has introduced a new feature allowing users to add sound effects to their creations by either describing the sound they want or letting AI suggest one.
- YouTube tests AI feature to let viewers “jump ahead” – YouTube is experimenting with a new “Jump ahead” feature that uses AI to analyze viewing patterns and let users skip directly to the most popular parts of a video.
- NVIDIA details Project GR00T for humanoid robots – NVIDIA’s Project GR00T is a general-purpose foundation model designed to help humanoid robots understand natural language and learn new skills by observing human actions.
- A developer’s guide to fine-tuning Mistral models – A new guide offers a comprehensive, code-included tutorial on how to fine-tune open-source models from Mistral AI for specialized tasks.
- Researchers scale Rectified Flow models for high-resolution images – A new research paper presents a method for scaling Rectified Flow models, a more efficient alternative to diffusion models, to generate high-quality, high-resolution images.
- MusicRL uses reinforcement learning to improve AI music – The MusicRL paper introduces a method that uses reinforcement learning from human feedback to enhance the quality and prompt-adherence of text-to-music AI models.
- A survey of LLM-based autonomous agents – A comprehensive academic paper surveys the current state of autonomous agents built on large language models, covering their design, applications, and challenges.
- Chatling – Chatling is an AI tool that lets you build and deploy a custom chatbot trained on your own website data and documents.
- SummerEyes – SummerEyes is an AI-powered browser extension that can summarize any text on the internet, from articles to emails, in a single click.
- Guidde – Guidde is an AI tool that helps teams create video documentation and step-by-step guides significantly faster than with traditional methods.
- Rationale – Rationale is a decision-making tool that uses AI to help you analyze choices by generating pros and cons lists, SWOT analyses, or other frameworks.
Trending AI Tools:
- Luma Dream Machine – A new, publicly available text-to-video model for generating high-quality and coherent video clips from text prompts.
- Kling – A new text-to-video model from Kuaishou capable of generating up to 2 minutes of 1080p video at 30fps.
- Perplexity – An AI-powered search engine that can now generate structured, shareable articles called ‘Pages’ from simple prompts.
- Codestral – A new 22B open-weight generative AI model from Mistral specifically designed for code generation tasks.
- Runway Gen-3 – An upcoming text-to-video model with major improvements in speed and quality, including a new Motion Brush tool for precise movement control.
- ElevenLabs Dubbing – A tool that automatically dubs videos into 29 different languages while preserving the original speaker’s voice.
- Claude 3.5 Sonnet – Anthropic’s latest and most intelligent AI model, featuring a new ‘Artifacts’ workspace for real-time editing and collaboration on AI-generated content.
- Suno – An AI music generation tool that creates songs with lyrics and vocals from text prompts.
- HeyGen – An AI video platform that features tools for creating avatar videos, voice cloning, and automatic video translation.
- Sora – OpenAI’s high-fidelity text-to-video model known for creating cinematic and realistic scenes.
- Microsoft VASA-1 – A Microsoft research project that creates lifelike talking faces from a single static image and a speech audio clip.
- Udio – An AI music generation tool that creates high-quality songs with vocals from text prompts.
- Pika – An AI video generation platform that can create and edit videos from text prompts and still images.
- Viggle – An AI video generation tool that allows you to animate a static character image using a motion video.
- Midjourney – A popular AI image generator that creates detailed and artistic images from natural language descriptions.
- Leonardo AI – An AI platform for generating high-quality game assets, concept art, and other visual content from text prompts.
- Ideogram – An AI image generator known for its superior ability to reliably render coherent text within images.
- Genie by LottieFiles – An AI tool that generates Lottie animations from text prompts or static images.
- ChatGPT – A conversational AI from OpenAI capable of a wide range of text generation and understanding tasks.
- Luminal – An AI agent that learns and automates browser tasks by observing your actions.
- Synthesia Express – A tool for creating AI avatar videos simply by typing text, without needing a complex video editor.
- Genspark – An AI-native search engine that generates custom summaries and webpages called ‘Sparks’ in response to user queries.
- Mindgrasp – An AI learning assistant that creates notes, summaries, and answers questions from documents, videos, and meetings.
- Mindverse – A platform that allows you to create a personal AI trained on your own data for customized knowledge retrieval.
- Motion – An AI-powered tool that uses algorithms to automatically plan your day by managing your calendar, projects, and meetings.
- Diagram – An AI-powered design tool and plugin for Figma that automates and enhances the design workflow.
- Vimeo AI – A suite of AI-powered tools within Vimeo for script generation, a teleprompter, and automatic video editing.
- Danswer – An open-source enterprise question-answering tool that connects to workplace apps like Slack and Google Drive to find information.
- Bland AI – A platform for developers to build, test, and monitor production-ready, human-like AI voice agents for phone calls.
- Vapi – A developer platform for building, testing, and deploying realistic, low-latency AI voice agents.
- Hugging Face – A platform and community that provides tools and resources for building, training, and deploying machine learning models.
- Brev – A development environment that provides fast access to pre-configured, powerful GPUs for building AI applications.
- Amazon Bedrock – A fully managed AWS service that offers a choice of high-performing foundation models from leading AI companies via a single API.
- Godot – An open-source game engine that can be used as a versatile environment for building and simulating AI agents.
- Magika – Google’s open-source, AI-powered file-type identification system designed for fast and accurate detection.
Sponsors:
- tldv – An AI meeting recorder that transcribes, summarizes, and captures key moments from your calls.
- edX – Offers boot camps in high-growth tech fields like AI and machine learning, coding, and data analytics to help you master job-ready skills.
- AE Studio – A development, data science, and design studio that helps businesses create custom software and AI solutions.
- Flatfile – Provides an AI-assisted data import solution to help businesses clean, validate, and import customer data.
- Masterworks – An investment platform that allows you to invest in shares of multi-million dollar paintings by artists like Banksy and Basquiat.
- HubSpot – Offers a CRM platform with software and support to help businesses grow better.
- MarketingProfs – Provides training, events, and resources for modern marketing professionals.
- Rest of World – A non-profit journalism organization that documents the impact of technology outside the Western bubble.
- Aprimo – A digital asset management and marketing resource management platform.
- BrandOps – A brand management platform that provides competitive intelligence and marketing performance analytics.
- Campaigner – An email marketing automation platform that helps businesses create and manage personalized campaigns.
- Sitecore – A digital experience platform that helps create personalized and engaging customer interactions.
- Welcome – A marketing orchestration platform that helps teams manage campaigns, content, and performance.