Note from Matt: Below is an AI-generated list of AI news and tools. Our bot scours the internet daily to find AI news. Because it’s AI, it often finds older news or non-AI news and adds it to the mix. It also seems to break links often. We’re constantly working on improving the automation but this should, for the most part, ensure no AI news ever gets missed
The Latest AI News:
- Runway unveiled Gen-3 Alpha – its new text-to-video model that promises significant improvements in generation quality, structure, and motion.
- Suno launched V3.5 – a free update to its AI music generator featuring an extended song structure up to 4 minutes and improved audio quality.
- Meta released Llama 3.1 – a new family of open-source models including a 405B parameter version that rivals top proprietary models like GPT-4o.
- ElevenLabs introduced a new AI music generation tool – that allows users to create complete songs from text prompts, specifying genres, moods, and instruments.
- Alibaba released Qwen2.5 – a new large language model that has surpassed GPT-4 on the AlpacaEval 2.0 leaderboard for chatbot performance.
- Apple announced the international launch of the Vision Pro – making it available in countries like China, Japan, Singapore, Australia, Canada, France, Germany, and the UK.
- Perplexity introduced new visual and interactive search features – including the ability to generate images and charts directly within search results.
- AI legal startup Harvey laid off a significant portion of its go-to-market team – signaling a shift in its business strategy.
- Microsoft released Florence-2 – a new open-source vision model that excels at a wide range of tasks like captioning, object detection, and segmentation.
- Pika Labs launched a new feature that automatically adds sound effects to videos – allowing users to generate audio by prompting or letting the AI decide.
- Microsoft has made Inflection-2.5 available on its Azure AI platform – the model powering the Pi chatbot, for developers to build with.
- YouTube is testing new AI-powered tools for creators – including a feature that generates video ideas and outlines based on audience interests.
- Google DeepMind developed VLOGGER – an AI model that can create realistic, controllable videos of people talking, gesturing, and moving from a single photo.
- Cohere’s powerful Command R and R+ models are now generally available on Amazon Bedrock – expanding options for enterprise AI developers.
- Researchers have developed an AI tool that can detect early signs of pancreatic cancer – from CT scans up to three years before a human diagnosis is possible.
- Waymo’s self-driving taxi service is now available to the general public in San Francisco – without a waitlist.
- A new report suggests that AI-generated images of hands are still a reliable indicator of fake content – despite recent improvements in image generation technology.
- Microsoft and Coca-Cola have entered a five-year, $1.1 billion strategic partnership – to explore and implement AI technologies across the beverage company’s operations.
- Google.org is committing $5 million to expand its AI-powered flood and wildfire forecasting systems – globally to help communities prepare for natural disasters.
- Reka’s multimodal model, Reka Core, is now available on Microsoft Azure – offering capabilities in text, image, video, and audio processing.
- Poe introduced multi-bot chats – allowing users to bring multiple different AI models into a single conversation to leverage their unique strengths.
- GitHub Copilot Workspace is now in technical preview – offering an AI-native environment to help developers go from an idea to a full coding plan.
- A new study indicates that while AI can boost productivity in creative tasks – it may also lead to a decrease in the overall novelty and originality of human work.
- Hugging Face released an official implementation of I-JEPA – Meta’s self-supervised learning model for computer vision, making it easier for researchers to use.
- Databricks announced its acquisition of Tabular – a data management company, in a deal reportedly valued between $1 billion and $2 billion.
- AssemblyAI released Universal-1 – a new multilingual speech-to-text model trained on 12.5 million hours of audio, setting new standards for transcription accuracy.
- Character.ai is increasing the price of its c.ai+ subscription – from $9.99 to $14.99 per month for new users.
- Google DeepMind developed Safe Diffusion – a new method to fine-tune text-to-image models to prevent them from generating not-safe-for-work or other harmful content.
- NVIDIA announced that its NIM inference microservices are now available on Amazon SageMaker – making it easier to deploy optimized AI models.
- The SEC has charged a promoter with fraud – for their role in a crypto asset trading scheme that falsely claimed to use AI for high returns.
- A new guide demonstrates how to fine-tune Google’s Gemma 7B model – for specific tasks using a single consumer-grade GPU.
- OpenAI has announced $1.5 million in grants to support AI initiatives across Latin America – focusing on education, community building, and practical applications.
Trending AI Tools:
- Perplexity – An AI-powered search engine that provides direct, verifiable answers with cited sources.
- Luma Dream Machine – An AI model for creating high-quality, realistic videos from text and image prompts.
- Pika – An AI video platform that can generate and edit videos from text, images, or existing video clips.
- Runway – A comprehensive suite of AI magic tools for professional video generation, editing, and creation.
- Sora – OpenAI’s text-to-video model capable of generating high-fidelity, minute-long videos from text prompts.
- Leonardo AI – An AI-powered platform for generating high-quality images, textures, and other visual assets for creative projects.
- Ideogram – An AI image generator specializing in rendering coherent and creative text within images.
- HeyGen – An AI video platform that creates realistic avatar videos with translated and dubbed speech for professional use.
- ElevenLabs – An advanced AI platform for generating realistic text-to-speech audio and cloning voices.
- Krea – A suite of AI tools for real-time image generation, upscaling, and transforming existing videos with AI.
- Suno – An AI tool that generates original songs, including lyrics, instruments, and vocals, from a simple text prompt.
- Midjourney – A popular AI image generation service that creates artistic and stylized images via prompts on Discord.
- Lindy – An AI assistant that automates calendar management, email drafting, and note-taking, now available on mobile.
- Hugging Face – A community and platform providing tools and access to build, train, and deploy machine learning models.
- Codestral – A 22B open-weight AI model from Mistral specifically designed for code generation and completion tasks.
- Cozy – An open-source, self-hostable tool for automating tasks and connecting apps, described in plain English.
- Fabric – An open-source framework that enhances Large Language Models by allowing them to use specific tools and capabilities.
- ChatGPT – OpenAI’s flagship conversational AI for a wide range of text-based tasks including writing, coding, and brainstorming.
- Opus Clip – An AI-powered tool that automatically repurposes long videos into shorter, viral-ready clips for social media.
- Gamma – An AI tool for quickly creating polished and engaging presentations, documents, and webpages from text prompts.
- Tome – An AI-powered storytelling and presentation platform that helps users build compelling narratives and visuals automatically.
- Julius AI – An AI data analyst that can analyze and visualize data from spreadsheets and files through conversational prompts.
- Ollama – A tool that simplifies running and managing large language models like Llama and Mistral on your local machine.
- ComfyUI – A powerful and modular node-based graphical interface for experimenting with Stable Diffusion image generation pipelines.
- Morphic – An AI-powered visual development platform that enables users to create and launch web applications without writing code.
- ThinkDiffusion – A cloud platform that provides pre-configured, powerful GPUs to run AI models like Stable Diffusion without any setup.
- Beautiful.ai – A presentation maker that uses AI to automatically design slides, ensuring a professional look with minimal effort.
- Stability AI Developer Platform – An API platform providing access to Stability AI’s suite of models, including the new Stable Diffusion 3.
- OpenRouter – A platform that aggregates and provides access to a wide variety of LLMs through a single, unified API.
- Vercel – The Vercel AI SDK allows developers to easily build and stream generative AI interfaces in their web applications.
- Modal – A serverless cloud platform designed for developers to run code, machine learning models, and apps on-demand.
- Together AI – A cloud platform that enables developers to build and run open-source generative AI models quickly and efficiently.
- Fireworks AI – A fast and cost-effective inference platform for developers to serve and scale open-source generative AI models.
- Lepton – A platform designed to help developers efficiently build, run, and scale various AI models and applications.
- Groq – A technology company providing ultra-low latency inference for AI models on its custom-built Language Processing Unit (LPU) hardware.
- Gradio – An open-source Python library used to quickly build and share simple web app demos for machine learning models.
- AlphaCTR – An AI tool that analyzes past performance data to help generate and optimize high-performing ad creatives.
- Prem AI – A platform that simplifies the process of deploying and self-hosting open-source AI models for enhanced privacy and control.
- Glean – An enterprise AI search and knowledge discovery solution that connects and understands all of a company’s internal data.
- Continual – An AI platform that brings modern Text-to-SQL capabilities and predictive modeling to the modern data stack.
Sponsors:
- Revelo – A platform for hiring vetted, English-speaking, remote developers from a talent pool of 300,000+ in Latin America.
- Masterworks – An investment platform that allows you to diversify your portfolio by investing in shares of blue-chip art from artists like Banksy and Basquiat.
- Brave – Brave’s AI assistant, Leo, offers free, private access to features like summarizing pages and answering questions directly in the browser.
- Incogni – A service that helps you automatically remove your personal data from data broker websites that sell it without your consent.
- Dust – A platform that allows you to build and deploy powerful large language model apps.
- Rows – A spreadsheet application with a built-in AI Analyst that can analyze your data and create summaries, reports, and graphs in seconds.
- CommandBar – CommandBar offers a free AI-powered user assistance platform that helps users with a combination of search, an AI chatbot, and product tours.