Note from Matt: Below is an AI-generated list of AI news and tools. Our bot scours the internet daily to find AI news. Because it’s AI, it often finds older news or non-AI news and adds it to the mix. It also seems to break links often. We’re constantly working on improving the automation but this should, for the most part, ensure no AI news ever gets missed
The Latest AI News:
- Meta released Llama 3.1 – a new family of models including an 8B, 70B, and a massive 405B parameter version accessible via an API.
- Pika launched a “Sound Effects” feature – that automatically generates and syncs relevant sounds to videos with a single prompt.
- Apple published a detailed paper on Private Cloud Compute – its new system for securely processing complex AI requests in the cloud while preserving user privacy.
- Google DeepMind unveiled RT-Trajectory – a new robotics model that can learn complex tasks from a single video demonstration.
- AI search startup Perplexity raised $250 million in new funding – increasing its valuation to $3 billion.
- Figure is integrating OpenAI’s new models into its humanoid robots – to enable more natural conversation and complex reasoning.
- Adobe Research introduced E-MO – a generative AI model that can create expressive audio tracks for animated characters from a text prompt and a style image.
- Midjourney launched a new “Style Random” feature – which allows users to apply a random, cohesive style to their generations for unexpected results.
- Runway released Gen-3 Alpha – its new flagship video generation model, which offers significant improvements in realism, character consistency, and motion.
- The UK’s AI Safety Institute opened its first international office in San Francisco – to collaborate more closely with US AI labs and government agencies.
- Microsoft introduced Florence-2 – a powerful open-source vision model capable of handling a wide variety of vision and vision-language tasks using simple text prompts.
- French AI leader Mistral hired Matthieu Baret as its new CFO – a former finance chief at Doctolib, to guide its rapid growth.
- YouTube is testing an AI-powered “Jump ahead” feature – that allows viewers to skip directly to the most interesting parts of a video.
- Microsoft Edge is introducing a new AI translation feature – that provides real-time translation and dubbing for videos on sites like YouTube and LinkedIn.
- The EU AI Act has been formally approved – finalizing the world’s first comprehensive AI law.
- OpenAI’s head of go-to-market is departing – Aliisa Rosenthal is leaving the company after less than a year in the role.
- A new guide shows how to fine-tune Llama 3 8B – on a single consumer GPU in under 15 minutes.
- Google Research released VLOGGER – an AI model that can generate realistic, controllable videos of people from a single still photograph.
- Google announced Med-Gemini – a new family of highly capable multimodal models specifically trained for the medical field to assist with diagnostics and research.
- Microsoft Research revealed VASA-1 – an AI model that can create hyper-realistic talking face videos from a single portrait image and an audio clip.
- Snowflake released Arctic – a new dense-and-sparse mixture-of-experts language model that is open-source and designed for enterprise-level AI tasks.
- The United States and China held their first formal talks on AI risks – agreeing to continue the dialogue in Geneva.
- AI Pin startup Humane is reportedly exploring a sale – following poor reviews, the company is considering a sale for between $750 million and $1 billion.
- Researchers used AI to read a 2,000-year-old Herculaneum scroll – revealing details about Plato’s final hours.
- An IPPR report warns 8 million UK jobs are at risk from AI – without proactive government intervention in the coming years.
- LinkedIn is rolling out new AI tools for job seekers – helping Premium subscribers discover jobs, write resumes, and draft cover letters more efficiently.
- A new paper introduces “Let’s Think Dot by Dot” – an AI prompting technique that improves visual reasoning by guiding models to analyze images point by point.
- A new tutorial shows how to build a full-stack AI app – powered by Meta’s Llama 3 and Next.js.
- Legal AI startup Harvey raised $80 million – in a Series B funding round, valuing the company at $715 million.
- A demo of Devin, the “first AI software engineer,” received criticism – for potentially misrepresenting some of its autonomous coding abilities.
- ‘The Image of the Whale’ demonstrates AI’s creative potential – a new sci-fi short film blends live-action footage with AI-generated scenes.
- Animated Drawings is a free tool from Meta AI – that lets you upload children’s drawings and automatically animate the characters to move and dance.
Trending AI Tools:
- GPT-4o – OpenAI’s new flagship model that natively accepts text, audio, and image inputs to generate multimodal outputs.
- Project Astra – Google’s vision for a universal, multimodal AI assistant that can see and hear the world to be contextually helpful.
- Veo – Google’s most advanced text-to-video model for creating high-quality, realistic, and stylized videos over a minute long.
- Imagen 3 – Google’s latest text-to-image model that generates photorealistic, lifelike images with incredible detail and fewer artifacts.
- Music AI Sandbox – A suite of AI-powered music creation tools from Google designed to open new creative pathways for artists.
- Suno – An AI music generator that creates realistic songs with vocals and instruments from a simple text prompt.
- Lindy – An AI agent platform that automates complex workflows like email management, scheduling, and contract drafting.
- Trill AI – A tool that allows you to create AI cover songs using your own voice.
- Gems – Customizable versions of the Google Gemini model that can be tailored for specific tasks or styles.
- Gemini 1.5 Flash – A lightweight, faster, and more cost-efficient version of Google’s Gemini 1.5 Pro model for high-frequency tasks.
- AudioMage – A text-to-audio diffusion model from Google that generates high-fidelity audio like music and sound effects.
- Runway – A comprehensive suite of AI-powered tools for advanced video generation, editing, and creation.
- HeyGen – An AI video platform that creates realistic studio-quality videos with customizable avatars and voice cloning.
- Sora – OpenAI’s text-to-video model capable of generating high-fidelity, imaginative scenes from text instructions.
- Luma Dream Machine – An upcoming text-to-video model from Luma Labs designed to generate high-quality, realistic video clips.
- Midjourney – A popular AI image generator known for producing highly detailed, artistic, and aesthetically pleasing images from text prompts.
- Viggle – A video generation AI that animates a static character image according to a reference motion video.
- Pika – An AI video generator that can create and edit videos in various styles from text prompts or images.
- Opus Clip – An AI tool that automatically turns long videos into engaging short-form clips for social media.
- TalkToPDF – A tool that allows you to chat with and ask questions of your PDF documents using AI.
- Scribe – An AI-powered tool that automatically generates step-by-step guides and tutorials by capturing your screen.
- Durable – An AI website builder that generates a professional business website with copy and images in seconds.
- AdCreative AI – An AI platform that generates conversion-focused ad creatives, text, and headlines for marketing campaigns.
- Interview Jarvis – An AI-powered tool that helps you prepare for job interviews by providing personalized feedback and practice sessions.
- Windsor – An AI platform for sending personalized video messages to customers at scale to increase engagement.
- Veed – An online video editor that simplifies video creation with a suite of AI-powered tools and features.
- Typeframes – A simple tool for creating professional-looking product videos in minutes without needing design skills.
- Framer – A web design and publishing platform that incorporates AI to help you build and launch professional websites quickly.
- AI Story Generate – A creative writing assistant that helps you generate unique stories, scripts, and other text-based content.
- ChatGPT – OpenAI’s conversational AI model designed to understand and generate human-like text in response to a wide range of prompts.
- Google Gemini – A family of powerful, multimodal AI models from Google that can reason across text, images, video, and audio.
- Reka Core – A high-end multimodal language model that understands text, images, audio, and video for complex contextual reasoning.
- PaliGemma – Google’s powerful and lightweight open-source vision-language model designed for tasks like image captioning and visual Q&A.
- Tonal – A mobile app that provides real-time transcription, translation, and summarization of spoken conversations.
- VEGAS – An AI model from Adobe Research that generates editable vector graphics from text prompts.
- Glean – An enterprise AI search platform that connects and understands all of a company’s knowledge to provide instant answers.
- Brevian – A no-code platform for building and deploying custom AI agents to automate business processes.
- AgentHub – A platform that enables users to build, deploy, and manage autonomous AI agents.
- Superagent – An open-source framework for developers to build, manage, and deploy AI assistants and agents.
- Galileo Luna – An AI evaluation platform for LLM developers to quickly detect, fix, and monitor hallucination-related risks.
- Modal – A cloud platform that simplifies running serverless AI/ML models, batch jobs, and other computational tasks.
- Vercel – A cloud platform for frontend frameworks that provides infrastructure for deploying AI-powered applications.
- FlowGPT – A platform for sharing and discovering the best prompts for large language models like ChatGPT.
- Hugging Face – A community and data science platform that provides tools for building, training, and deploying machine learning models.
Sponsors:
- Imagine AI Conference 2024 – A conference in Las Vegas focusing on the practical application and commercialization of AI in business.
- AE Studio – A development, data science, and design studio that helps solve business problems with custom software and AI solutions.
- Work OS – An auth platform that provides enterprise-ready features like SSO and SCIM to accelerate B2B SaaS app development.
- Brilliant – An interactive learning platform that helps you build math and computer science skills through hands-on problem-solving.
- HubSpot – A platform offering AI-powered tools like a Campaign Assistant to help businesses with marketing and sales.
- Guidde – An AI-powered platform that helps you create video documentation and how-to guides instantly.
- Masterworks – An investment platform that allows you to invest in shares of multi-million dollar art by artists like Banksy and Basquiat.
- Read AI – An AI tool that automatically writes meeting summaries, transcripts, and playback videos for platforms like Zoom, Google Meet, and Microsoft Teams.
- CommandBar – An AI user assistance platform that helps users find what they need in your app through features like search, help articles, and in-app assistance.