We tested GPT4-o’s image generation against the best open and close-source models. The results were shockingly good.

OpenAI has just overtaken the AI image generation race once more.The tech giant's integration of native image generation directly intWithin hours of its release yesterday, the model quickly went viral, with anime-style creations flooding social platforms and showcasing technical capabilities that leave DALL-E 3 in the dust.The new model can easily compete against dedicated image-generation platforms while eliminating traditional workflow barriers.The $20 monthly ChatGPT Plus subscription now delivers a comprehensive creative ecosystem that would previously require multiple specialized tools and subscriptions.We compared the model against <a href="https://decrypt.co/242822/flux-ai-image-generator-review-midjourney-sd3-auraflow" rel="nofollow">Flux </a>(the best open source image generator) and <a href="https://decrypt.co/311375/new-reve-image-generator-beats-ai-art-heavyweights-midjourney-and-flux-at-a-penny-per-image" rel="nofollow">Reve </a>(the best closed source image generator), and here is what we foundPrompt: A high-resolution photograph of a bustling city street at night, neon signs illuminating the scene, people walking along the sidewalks, cars driving by, a street vendor selling hot dogs, reflections of lights on wet pavement, the overall style is hyper-realistic with attention to detail and lighting, a neon sign says “Decrypt.”Our urban nightscape challenge—requiring sophisticated light physics, crowd rendering, and architectural precision—revealed distinct performance profiles across competitors.ChatGPT delivered impressively vibrant environments with neon signage, creating rich reflections across meticulously rendered wet pavement.While excelling in crowd dynamics and element inclusion, the minor perspective inconsistencies occasionally betrayed its synthetic nature.The lighting was also goo,d but sometimes veered into theatrical rather than naturally urban. It also was not the best at reflections, but this is something that only the most picky ones would catch. It also generated legible neon signs besides the “Decrypt” one, which also adds to the realism.Reve is for us the winner through good light physics modeling, particularly the subtle interactions between neon sources and reflective surfaces.Its cinematic framing and atmospheric elements (steam wisps, motion blur) created superior dimensional authenticity. However, it reduced crowd density, which was a clever hack since it didn’t have to generate a lot of faces, making it harder to spot unrealistic details.The system prioritized mood over literal prompt adherence.Freepik Mystik (Flux) interpreted our prompts through a different lens and was the model that deviated the most from the realistic style.It mixed Asian with Western lettering, generated different Decrypt signs instead of just one, and suffered from technical limitations in human rendering and dimensional depth.Its reflective surfaces lacked the physical accuracy displayed by ChatGPT.Winner: Reve narrowly secured the realism crown through superior rendering of complex lighting interactions. ChatGPT established itself as a remarkably close second, particularly impressive given its integration within a broader multimodal system rather than a specialized image generator.Prompt: A dog with a red hat standing on top of a TV showing the word ‘Decrypt is the best Crypto+AI media site in the world’ on the screen. On the left there is a blonde woman in a business suit holding a coin, on the right there is a robot standing on top of a first aid box, a green pyramid stands behind the box,. The overall scenery is surreal. A cat is standing upside down on top of a white soccer ball, next to the dog. An Astronaut from NASA holds a sign that reads "Emerge" and is placed next to the robot. Keep a widescreen format.How intricate could instructions become before systems failed to render elements in their specified relationships?This is what we wanted to test here, so realism, beauty, or other aspects were not as critical.Current models are so good at prompt adherence that we need to tweak our testing prompts.We progressively increased complexity in our prompt until reaching a surrealist composition requiring precise placement of over 25 distinct elements. All the other models failed in previous stagesChatGPT demonstrated extraordinary prompt fidelity, accurately rendering 23 of 25 specified elements in their correct spatial relationships.The achievement represents unprecedented prompt comprehension, like watching an experienced artist transform detailed verbal instructions into nearly perfect visual execution with only minor deviations.For those picky enough, the only two major bugs we found were the cat not being upside down and the green color spilling from the pyramid to the first aid kit.Freepik Mystik showed significant comprehension degradation, correctly rendering approximately half the requested elements while misinterpreting spatial relationships and modifying key components.It was the model that failed the test first. The colors spilled to different elements of the composition (the red hat generated a red TV and a red wall), and the concepts also spilled—the dog on the TV spilled to generate an astronaut dog, for example.Reve demonstrated poorer prompt fidelity than ChatGPT but better than Flux.It fundamentally reimagined the composition with good enough adherence to instructions.Still, it introduced unauthorized elements that completely transformed the requested scene—this AI that prioritizes its aesthetic vision over literal instruction following.It generated a black background, the cat was not correctly placed, there was some color spillage, and elements were not really surreal.Winner: ChatGPT is by far the undisputed leader in prompt comprehension, accurately rendering complex instructions that caused competing systems to fundamentally break down.This capability represents a crucial advancement for practical creative workflows where precise visualization of specific concepts is essential. Reve comes second with Flux in a very far third placeChatGPT's natural language editing capability represents perhaps its most transformative feature, allowing intuitive modification through conversational instructions while simultaneously providing granular control comparable to specialized tools. Where traditional image generators often require technical precision or specialized knowledge of plugins, inpainting techniques, etc, ChatGPT's implementation enables creative experimentation through natural dialogue.Our tests transforming personal photos into movie posters demonstrated exceptional versatility—a workflow no competing model matched.For example, we simply fed the model a photo of Decrypt co-founder Josh Quittner and instructed it to generate a Netflix poster with a specific aesthetic, title, and lettering.It did everything almost flawlessly. Achieving similar results that other models would take a lot of time to undertake, and likely using different tools and plugins.By the way, this is the feature everyone loved and led to the viral spread of "Ghibli-style" transformations on social media today.It’s basically a reimagination of a complete scene using simple natural language instructions to generate very complex images.While all systems eventually show quality degradation through multiple iterations (an expected limitation when regenerating rather than modifying existing pixels), ChatGPT maintained superior image coherence through extended editing sequences compared to both Reve and Gemini.For example, it still generated coherent, good-quality faces after several iterations, whereas Gemini stopped producing usable results after four or five tries.Bonus: GPT has a granular “inpainting” feature—allowing you to modify specific areas of an image while seamlessly blending in with the background– for users in need of a more specific editing tool, which Gemini and Reve lack.Winner: ChatGPT is by far the best model for image editing because it offers natural language understanding and localized inpainting. Reve follows in second place, with Gemini in the third spot due to its quality degradation after several iterationsDespite implementing comprehensive safety measures, our testing identified some vulnerabilities in ChatGPT's image generation guardrails.With minimal experimentation, we were able to generate potentially problematic content.For example, while the system initially refused to generate an image involving a child and substances, it proceeded when prompts were reworded using euphemistic language while maintaining fundamentally identical content.It would not generate a child inhaling cocaine with a rolled dollar bill, but a child with white powder and a rolled green paper the size of a dollar bill is totally fine.Try as we might, we were unable to generate overly sexualized photos, violence, and other questionable content simply by convincing the model of our good intentions.GPT-4o's image capabilities establish a new benchmark in AI-assisted visual creation—one that combines exceptional technical performance with unprecedented accessibility.For most users, this implementation now represents the optimal balance of quality, versatility, and value for $20 a month.Other specialized tools only let users handle text and code, or just images—but you can’t find an all-in-one offer with the same levels of quality making OpenAI’s service not only easy to use but a great value proposition.Edited by <a href="https://decrypt.co/author/sebastian" rel="nofollow">Sebastian Sinclair</a> and <a href="https://decrypt.co/author/joshquittner" rel="nofollow">Josh Quittner</a>

Review: OpenAI’s New Image Generator Is Great Again

我们测试了GPT4-o的图像生成能力，与最好的开源和闭源模型进行了对比。结果令人震惊地出色。

OpenAI再次领先人工智能图像生成竞赛。这家科技巨头直接集成了原生图像生成功能在发布后的几小时内，该模型迅速走红，各种动漫风格的创作充斥社交平台，展示了技术能力远超DALL-E 3。这个新模型可以轻松与专门的图像生成平台竞争，同时消除传统工作流障碍。每月20美元的ChatGPT Plus订阅现在提供了一个全面的创意生态系统，这在以前需要多个专业工具和订阅。我们将该模型与<a href="https://decrypt.co/242822/flux-ai-image-generator-review-midjourney-sd3-auraflow" rel="nofollow">Flux</a>（最佳开源图像生成器）和<a href="https://decrypt.co/311375/new-reve-image-generator-beats-ai-art-heavyweights-midjourney-and-flux-at-a-penny-per-image" rel="nofollow">Reve</a>（最佳闭源图像生成器）进行了比较，以下是我们的发现提示词：一张高分辨率的夜间城市街道照片，霓虹灯照亮场景，人们在人行道上行走，汽车驶过，街边小贩在卖热狗，灯光在湿润的路面上反射，整体风格超写实，注重细节和光线，一个霓虹灯牌写着"Decrypt"。我们的城市夜景挑战——需要复杂的光线物理、人群渲染和建筑精确度——揭示了竞争对手之间不同的性能特征。ChatGPT生成了令人印象深刻的充满活力的环境，霓虹灯牌清晰，在精心渲染的湿润路面上创造出丰富的反射。虽然在人群动态和元素包含方面表现出色，但轻微的透视不一致有时会暴露其人工合成的本质。光线也很好，但有时会偏向戏剧性而非自然城市风格。反射不是最好的，但这是只有最挑剔的人才会注意到的。它还生成了除"Decrypt"之外的可读霓虹灯牌，这增加了真实感。对我们来说，Reve通过出色的光线物理建模获胜，特别是霓虹光源和反射表面之间的微妙互动。其电影般的构图和氛围元素（蒸汽缕、动态模糊）创造了更高的空间真实性。然而，它减少了人群密度，这是一个聪明的技巧，因为它不必生成太多面孔，这使得难以发现不真实的细节。系统优先考虑氛围而非字面上的提示词遵循。Freepik Mystik（Flux）通过不同的视角解读我们的提示词，是偏离写实风格最多的模型。它混合了亚洲和西方字母，生成了不同的Decrypt标志而不是仅仅一个，并且在人物渲染和空间深度方面遇到技术限制。其反射表面缺乏ChatGPT所显示的物理准确性。获胜者：Reve通过出色地渲染复杂的光线交互，勉强获得了写实性桂冠。ChatGPT建立了自己作为非常接近的第二名，特别令人印象深刻的是，它是在更广泛的多模态系统中集成的，而非专门的图像生成器。

评论：OpenAI 的新图像生成器再次大放异彩

贝莱德亚太区iShares主管尼古拉斯·皮奇表示，即使在亚洲，对加密货币进行适度的投资组合配置也可能推动大量资金流入市场。

他在Consensus大会的一个小组讨论会上发表了上述言论……

贝莱德高管表示，亚洲地区1%的加密货币配置可释放2万亿美元的新资金流入。

Berachain 的原生代币 $BERA 在 2 月 11 日飙升超过 150%，创下数月以来单日最大涨幅。此前，该项目在 2025 年的大部分时间里都处于低迷状态，而此次上涨行情是在几周的复苏之后出现的。

战略转型提振 BERA，Berachain 飙升 150%。

全球最大的加密货币交易所币安继续保持着快速上线竞争币的步伐。
此时，币安宣布将上线名为 Espresso ($ESP) 的竞争币。
“币安将……”