The market's attention has definitively pivoted to visual AI. Where text-based models once drove initial curiosity, it is now the ability to generate or manipulate images and video that captivates mobile users and drives download spikes. This signifies a maturation of basic conversational AI, pushing novelty into the multimodal domain.
While companies like Google and Meta are attracting downloads with their visual model releases, the intelligence reveals a clear monetization gap. OpenAI stands apart by converting its visual engagement into substantial revenue. This indicates a superior product experience or a more effective monetization strategy that its competitors have yet to crack, positioning OpenAI as the leader in tangible commercial impact from multimodal AI.
Expect a wave of visual AI feature integrations across all major app categories, driven by the immediate download bump. However, most will fail to replicate OpenAI's revenue success, leading to a crowded field of visually-enabled apps that struggle to retain paying users, eventually commoditizing visual AI features unless truly differentiated value is offered.