Waveform: Google I/O 2024, Camera Debates & Sony's AI Fiasco

·2h 01m
Shared point

The State of Smartphone Cameras and AI Editing

The Problem with Computational Photography

• Smartphones from US manufacturers are increasingly over-processing images, resulting in "flat" photos with crushed highlights and shadows.
• The goal of these companies is often to create utility photos (e.g., of signs or menus) rather than artistic ones.
• Modern computational photography algorithms often struggle to apply contrast, which is essential for realistic, 3D-looking images.

The Sony and Google AI Backlash

• Sony caused a massive stir on social media with an AI feature demonstration that was widely criticized for producing poor, over-processed results.
• Google’s announcements at the Android Show and Google I/O faced similar backlash, with users noting that their "before and after" image editing examples looked aggressively unnatural.
• The panel discussed how non-creative users often equate "brighter" with "better," causing manufacturers to prioritize brightness and information density over tone curves and artistic quality.

Deep Dive: Google I/O 2024

Model Overload and Naming Issues

• The hosts expressed frustration with Google's fragmented naming conventions, noting that disparate teams often release products with conflicting or confusing branding (e.g., Google Photos vs. Google Pics).
• The emphasis on agentic AI and token usage numbers felt more like a corporate flex than a benefit to average consumers.

New Features and Tools

Gemini 3.5 Flash: A faster, more efficient model aimed at enterprise use.
Gemini Omni World Model: A multimodal tool capable of interpreting text, audio, and video to generate interactive scenes.
Gemini Spark: A cloud-based personal agent designed to run in the background and manage user data autonomously.

Smart Glasses and Future Hardware

• The panel analyzed the transition to Android XR and the new audio-focused smart glasses powered by Gemini.
• Key concerns were raised about privacy, specifically the need for recording indicators to prevent unwanted filming in public spaces.
• There is an ongoing debate about whether AI assistants should be proactive (ordering coffee, scheduling) or should remain reactive, given the potential for error and the loss of user agency.

"I think a lot of people just feel really insulted given how many serious flaws there are in the world right now when people like Google and Meta come out and be like, 'No, the biggest thing we need to solve is how do we get your Google Docs to write themselves.'"

Topics

Chapters

6 chapters
Waveform: The MKBHD Podcast
AI chat — answers grounded in episodes