Digestly

Feb 7, 2025

OpenAI & Google Just Made Their Best Models Free

Matt Wolfe - OpenAI & Google Just Made Their Best Models Free

OpenAI's new 03 Mini model is now available across all tiers, offering superior performance in math and science, except for the 01 Pro model. It is accessible to free users via ChatGPT with enhanced features like search and reasoning. The model's Chain of Thought feature, however, is criticized for being less transparent than Deep Seek R1. Additionally, OpenAI launched Deep Research, a tool for Pro users that provides detailed strategic insights, exemplified by its ability to create a comprehensive YouTube strategy. Despite its high cost, Deep Research is valued for its in-depth analysis and accuracy, achieving a 26.6% accuracy rate in recent benchmarks. Meanwhile, Google released Gemini 2.0 models, which are cost-effective and competitive in performance, offering developers a cheaper alternative to other APIs. Google's Imagine 3 AI image generator is now accessible via API, and their Gemini models are available for free use on AI Studio. Other notable updates include GitHub Copilot's new agent mode for code iteration and error correction, and the rapid growth of the Cursor tool, which democratizes software creation.

Key Points:

  • OpenAI's 03 Mini model excels in math and science, available to all users, including free ChatGPT users.
  • Deep Research by OpenAI offers strategic insights but is costly, available only to Pro users.
  • Google's Gemini 2.0 models are cost-effective, offering competitive performance and free access on AI Studio.
  • GitHub Copilot's agent mode enhances code iteration and error correction capabilities.
  • Cursor tool's rapid growth highlights its role in democratizing software creation.

Details:

1. 🚀 OpenAI's 03 Mini Model Released

  • The 03 Mini model outperforms other models in math and PhD-level science questions, except for 01 Pro.
  • It's highly effective in coding and software engineering, being the most powerful model available apart from 01 Pro, which costs $200 a month.
  • The 03 Mini model is available across all tiers, including API access, with unlimited access for Pro users.
  • Plus and team users will have triple the rate limits compared to 01 Mini.
  • Free users can access 03 Mini via Chat GPT by selecting the 'reason' button, and can combine it with search on free plans.
  • OpenAI updated the Chain of Thought feature for both free and paid users as of February 6th.
  • The summarized Chain of Thought might hinder debugging as it doesn't provide full transparency, unlike deep seek R1.

2. 🔍 Introduction of Deep Research by OpenAI

2.1. Launch and Availability

2.2. Naming and Functionality

2.3. User Experience and Value

2.4. Cost vs. Value

2.5. Performance Metrics

2.6. Research Capabilities

2.7. Global Accessibility

2.8. Economic Impact and Future Developments

3. 📰 OpenAI's New Features and Google Announcements

  • Chat GPT search functionality is now universally accessible on chatgp.com without requiring sign-up, providing an easier alternative to traditional search engines like Google.
  • The memory capacity for Chat GPT Plus, Pro, and Team users has been increased by 25%, which is expected to enhance the overall user experience by supporting more complex interactions.
  • OpenAI recently conducted a Reddit AMA with key leaders, including discussions about upcoming projects such as a new image generator and advancements in voice mode technology.
  • Sam Alman, a representative from OpenAI, addressed the need for the company to reassess its open-source strategy, highlighting internal debates and the potential for a strategic realignment.

4. 🌟 Google's Gemini 2.0 and AI Model Comparisons

  • Google released three new AI models: Gemini 2.0 Flash, Flashlight, and Pro, with Gemini 2.0 Pro being their best state-of-the-art model.
  • Gemini 2.0 Flash and Flashlight models have a 1 million token context window, while the Pro model has a 2 million token context window, offering greater processing capacity.
  • The Gemini 2.0 Flash model is priced at 10 cents per million tokens, which is significantly more cost-effective compared to competitors like GPT-4 at $10 per million tokens and Claude 3.5 Sonet at $15 per million tokens.
  • Blind testing positioned the Gemini 2.0 Flash Thinking model as the number one overall model based on user preferences, demonstrating its superior performance.
  • Gemini models occupy three of the top five spots in user preference rankings, surpassing the new OpenAI model 03 Mini, indicating high user satisfaction and preference.
  • The high adoption and preference for Gemini models are reflected in usage rankings, showing they are trending and favored over other AI models.

5. 🤖 Chatbase AI Enhancements for Customer Experience

5.1. Model Rankings and Access

5.2. Chatbase AI for Customer Experience

6. 🔒 Google's Shift in AI Ethics

  • Google has removed its previous pledge not to use AI for weapons and surveillance, marking a significant shift from its original ethical stance which strictly prohibited such applications.
  • This change comes after Google reversed the acquisition terms of DeepMind, which initially included a condition against the use of AI for weapons and surveillance.
  • Key figures involved include Demis Hassabis, CEO of DeepMind, who supports the change due to competitive pressures in global AI leadership and the complex geopolitical landscape.
  • Mustafa Suleyman, co-founder of DeepMind and proponent of the original non-weaponization rule, is now at Microsoft, indicating a shift in leadership and possibly influencing the policy change.
  • The implications of this shift could affect Google's business strategy and global AI ethics, potentially altering perceptions of Google's commitment to ethical AI development.

7. ⚡ Fast Outputs from Mistral AI and Chatbot Developments

  • Mistral AI's chatbot, available at chat.m.ai, offers functionalities similar to ChatGPT, including web search, image generation, code interpretation, and a canvas mode for code and writing.
  • The Pro Plan costs $15 per month and provides additional access and reduced message limits, but the free version remains highly functional.
  • Mistral AI's chatbot is noted for its speed, capable of producing 1,000 tokens per second, making it exceptionally fast.
  • A video demonstration showed the chatbot generating a 'kawaii calculator' in real-time, with follow-up tasks like creating a nature-themed calculator completed in seconds without speeding up the video.
  • The chatbot's capabilities, including generating a functioning calculator in HTML in 2 seconds, are available for free to all users.

8. 🛡️ Anthropic's Security Challenges and Amazon Alexa Updates

8.1. Anthropic's Security Challenges

8.2. Amazon Alexa Updates with Anthropic's AI

9. 🛠️ GitHub Copilot's New Agent Mode

  • GitHub Copilot's new agent mode can iterate on its own code, recognize errors, and fix them automatically.
  • It can suggest terminal commands and ask for user execution, enhancing user interaction.
  • The mode includes self-healing capabilities by analyzing runtime errors, indicating the use of reasoning models.
  • Agent mode not only performs requested tasks but also infers and completes additional necessary subtasks.
  • The feature reduces manual intervention by catching its own errors, improving user efficiency.
  • These enhancements provide quality of life improvements by automating error correction and terminal interactions.

10. 📈 Cursor's Rapid Growth as a SaaS Company

  • Cursor achieved $100 million in annual recurring revenue within one year, making it the fastest-growing SaaS company in history.
  • In comparison, DocuSign took 10 years to reach the same revenue milestone, highlighting Cursor's rapid growth.
  • Cursor's tools empower users globally to create software solutions for personal and professional workflows, even without coding knowledge.
  • The ability to quickly build tools, such as a file conversion app in 15 minutes, demonstrates Cursor's capacity to save time and enhance productivity.
  • Cursor's success is attributed to its democratization of software creation, enabling non-coders to develop functional applications.

11. 🎨 Image Editing with Grok on X

  • Users can edit images directly in Grok on X by generating an image and then selecting it for editing.
  • To edit, users click the 'edit with Gro' button, allowing them to input specific prompts for changes.
  • Examples include altering the color of the sky or other specific image modifications.

12. 🎥 Pika Labs' AI Video Innovations

12.1. Pika Labs' Feature: Peak Editions

12.2. Pika Labs' Feature: Pika Scenes

13. 🚀 Video Upscaling with Topaz Labs' Project Starlight

  • Topaz Labs released Project Starlight, the first diffusion model for video restoration, transforming low-quality videos into high-resolution versions.
  • Example provided: Muhammad Ali fight video, with a comparison showing significant quality improvement from grainy, pixelated footage to a clear, detailed upscaled version.
  • Additional example: VHS quality video enhanced to a much better resolution, demonstrating the model's capability.
  • Project Starlight is in early access; engagement through likes and comments may be required for access.

14. 🔬 New Research in AI Deepfakes and Video Models

  • The Omnium tool enables the creation of deepfakes using just a single image and audio file, facilitating the synthesis of realistic videos with minimal input. This technology has significant potential implications for the media industry, particularly in content creation and personalization.
  • Video Jam enhances video model training by improving coherence and understanding of physics in video synthesis, leading to more realistic representations of human movement. This advancement could revolutionize fields such as virtual reality and gaming by providing more lifelike and interactive experiences.
  • The new training methods developed in Video Jam are expected to be incorporated into existing tools like Runway and Pika. This integration could significantly enhance the capabilities of these platforms, allowing for more sophisticated video models and broader applications in various industries.

15. 🎶 The Beatles Win Grammy with AI-Assisted Song

  • The Beatles won a Grammy for a song created with AI technology, highlighting the growing role of AI in music production and innovation.
  • The award signifies a pivotal moment in the music industry where traditional and AI-assisted creativity are blending.
  • This achievement may inspire further integration of AI in artistic processes, potentially leading to new legislative considerations regarding AI's role in creative works.
  • The use of AI in this context shows potential for both innovation and ethical considerations, prompting discussions on regulation and artistic integrity.

16. 🎥 Channel Updates and AI Tool Resources

  • The Beatles won a Grammy using AI technology to enhance John Lennon's old vocals, highlighting the potential of AI in music production.
  • The channel offers weekly breakdown videos covering significant AI news, aiming to keep viewers informed about the latest developments.
  • Experimentation with new video styles, thumbnails, and titles is ongoing, with viewer feedback encouraged to improve content delivery.
  • Futur Tools website provides a curated list of AI tools, updated daily, with easy filtering options to find specific tools for various needs.
  • A newsletter is available to deliver the latest AI news and tools updates directly to subscribers' inboxes twice a week, along with access to an AI income database.
View Full Content
Upgrade to Plus to unlock complete episodes, key insights, and in-depth analysis
Starting at $5/month. Cancel anytime.