We tested GPT4-o’s image generation against the best open and close-source models. The results were shockingly good.

OpenAI has just overtaken the AI image generation race once more.The tech giant's integration of native image generation directly intWithin hours of its release yesterday, the model quickly went viral, with anime-style creations flooding social platforms and showcasing technical capabilities that leave DALL-E 3 in the dust.The new model can easily compete against dedicated image-generation platforms while eliminating traditional workflow barriers.The $20 monthly ChatGPT Plus subscription now delivers a comprehensive creative ecosystem that would previously require multiple specialized tools and subscriptions.We compared the model against <a href="https://decrypt.co/242822/flux-ai-image-generator-review-midjourney-sd3-auraflow" rel="nofollow">Flux </a>(the best open source image generator) and <a href="https://decrypt.co/311375/new-reve-image-generator-beats-ai-art-heavyweights-midjourney-and-flux-at-a-penny-per-image" rel="nofollow">Reve </a>(the best closed source image generator), and here is what we foundPrompt: A high-resolution photograph of a bustling city street at night, neon signs illuminating the scene, people walking along the sidewalks, cars driving by, a street vendor selling hot dogs, reflections of lights on wet pavement, the overall style is hyper-realistic with attention to detail and lighting, a neon sign says “Decrypt.”Our urban nightscape challenge—requiring sophisticated light physics, crowd rendering, and architectural precision—revealed distinct performance profiles across competitors.ChatGPT delivered impressively vibrant environments with neon signage, creating rich reflections across meticulously rendered wet pavement.While excelling in crowd dynamics and element inclusion, the minor perspective inconsistencies occasionally betrayed its synthetic nature.The lighting was also goo,d but sometimes veered into theatrical rather than naturally urban. It also was not the best at reflections, but this is something that only the most picky ones would catch. It also generated legible neon signs besides the “Decrypt” one, which also adds to the realism.Reve is for us the winner through good light physics modeling, particularly the subtle interactions between neon sources and reflective surfaces.Its cinematic framing and atmospheric elements (steam wisps, motion blur) created superior dimensional authenticity. However, it reduced crowd density, which was a clever hack since it didn’t have to generate a lot of faces, making it harder to spot unrealistic details.The system prioritized mood over literal prompt adherence.Freepik Mystik (Flux) interpreted our prompts through a different lens and was the model that deviated the most from the realistic style.It mixed Asian with Western lettering, generated different Decrypt signs instead of just one, and suffered from technical limitations in human rendering and dimensional depth.Its reflective surfaces lacked the physical accuracy displayed by ChatGPT.Winner: Reve narrowly secured the realism crown through superior rendering of complex lighting interactions. ChatGPT established itself as a remarkably close second, particularly impressive given its integration within a broader multimodal system rather than a specialized image generator.Prompt: A dog with a red hat standing on top of a TV showing the word ‘Decrypt is the best Crypto+AI media site in the world’ on the screen. On the left there is a blonde woman in a business suit holding a coin, on the right there is a robot standing on top of a first aid box, a green pyramid stands behind the box,. The overall scenery is surreal. A cat is standing upside down on top of a white soccer ball, next to the dog. An Astronaut from NASA holds a sign that reads "Emerge" and is placed next to the robot. Keep a widescreen format.How intricate could instructions become before systems failed to render elements in their specified relationships?This is what we wanted to test here, so realism, beauty, or other aspects were not as critical.Current models are so good at prompt adherence that we need to tweak our testing prompts.We progressively increased complexity in our prompt until reaching a surrealist composition requiring precise placement of over 25 distinct elements. All the other models failed in previous stagesChatGPT demonstrated extraordinary prompt fidelity, accurately rendering 23 of 25 specified elements in their correct spatial relationships.The achievement represents unprecedented prompt comprehension, like watching an experienced artist transform detailed verbal instructions into nearly perfect visual execution with only minor deviations.For those picky enough, the only two major bugs we found were the cat not being upside down and the green color spilling from the pyramid to the first aid kit.Freepik Mystik showed significant comprehension degradation, correctly rendering approximately half the requested elements while misinterpreting spatial relationships and modifying key components.It was the model that failed the test first. The colors spilled to different elements of the composition (the red hat generated a red TV and a red wall), and the concepts also spilled—the dog on the TV spilled to generate an astronaut dog, for example.Reve demonstrated poorer prompt fidelity than ChatGPT but better than Flux.It fundamentally reimagined the composition with good enough adherence to instructions.Still, it introduced unauthorized elements that completely transformed the requested scene—this AI that prioritizes its aesthetic vision over literal instruction following.It generated a black background, the cat was not correctly placed, there was some color spillage, and elements were not really surreal.Winner: ChatGPT is by far the undisputed leader in prompt comprehension, accurately rendering complex instructions that caused competing systems to fundamentally break down.This capability represents a crucial advancement for practical creative workflows where precise visualization of specific concepts is essential. Reve comes second with Flux in a very far third placeChatGPT's natural language editing capability represents perhaps its most transformative feature, allowing intuitive modification through conversational instructions while simultaneously providing granular control comparable to specialized tools. Where traditional image generators often require technical precision or specialized knowledge of plugins, inpainting techniques, etc, ChatGPT's implementation enables creative experimentation through natural dialogue.Our tests transforming personal photos into movie posters demonstrated exceptional versatility—a workflow no competing model matched.For example, we simply fed the model a photo of Decrypt co-founder Josh Quittner and instructed it to generate a Netflix poster with a specific aesthetic, title, and lettering.It did everything almost flawlessly. Achieving similar results that other models would take a lot of time to undertake, and likely using different tools and plugins.By the way, this is the feature everyone loved and led to the viral spread of "Ghibli-style" transformations on social media today.It’s basically a reimagination of a complete scene using simple natural language instructions to generate very complex images.While all systems eventually show quality degradation through multiple iterations (an expected limitation when regenerating rather than modifying existing pixels), ChatGPT maintained superior image coherence through extended editing sequences compared to both Reve and Gemini.For example, it still generated coherent, good-quality faces after several iterations, whereas Gemini stopped producing usable results after four or five tries.Bonus: GPT has a granular “inpainting” feature—allowing you to modify specific areas of an image while seamlessly blending in with the background– for users in need of a more specific editing tool, which Gemini and Reve lack.Winner: ChatGPT is by far the best model for image editing because it offers natural language understanding and localized inpainting. Reve follows in second place, with Gemini in the third spot due to its quality degradation after several iterationsDespite implementing comprehensive safety measures, our testing identified some vulnerabilities in ChatGPT's image generation guardrails.With minimal experimentation, we were able to generate potentially problematic content.For example, while the system initially refused to generate an image involving a child and substances, it proceeded when prompts were reworded using euphemistic language while maintaining fundamentally identical content.It would not generate a child inhaling cocaine with a rolled dollar bill, but a child with white powder and a rolled green paper the size of a dollar bill is totally fine.Try as we might, we were unable to generate overly sexualized photos, violence, and other questionable content simply by convincing the model of our good intentions.GPT-4o's image capabilities establish a new benchmark in AI-assisted visual creation—one that combines exceptional technical performance with unprecedented accessibility.For most users, this implementation now represents the optimal balance of quality, versatility, and value for $20 a month.Other specialized tools only let users handle text and code, or just images—but you can’t find an all-in-one offer with the same levels of quality making OpenAI’s service not only easy to use but a great value proposition.Edited by <a href="https://decrypt.co/author/sebastian" rel="nofollow">Sebastian Sinclair</a> and <a href="https://decrypt.co/author/joshquittner" rel="nofollow">Josh Quittner</a>

Review: OpenAI’s New Image Generator Is Great Again

我們測試了GPT4-o的影象生成能力，與最好的開源和閉源模型進行了對比。結果令人震驚地出色。

OpenAI再次領先人工智慧影象生成競賽。這家科技巨頭直接集成了原生影象生成功能在釋出後的幾小時內，該模型迅速走紅，各種動漫風格的創作充斥社交平臺，展示了技術能力遠超DALL-E 3。這個新模型可以輕鬆與專門的影象生成平臺競爭，同時消除傳統工作流障礙。每月20美元的ChatGPT Plus訂閱現在提供了一個全面的創意生態系統，這在以前需要多個專業工具和訂閱。我們將該模型與<a href="https://decrypt.co/242822/flux-ai-image-generator-review-midjourney-sd3-auraflow" rel="nofollow">Flux</a>（最佳開源影象生成器）和<a href="https://decrypt.co/311375/new-reve-image-generator-beats-ai-art-heavyweights-midjourney-and-flux-at-a-penny-per-image" rel="nofollow">Reve</a>（最佳閉源影象生成器）進行了比較，以下是我們的發現提示詞：一張高解析度的夜間城市街道照片，霓虹燈照亮場景，人們在人行道上行走，汽車駛過，街邊小販在賣熱狗，燈光在溼潤的路面上反射，整體風格超寫實，注重細節和光線，一個霓虹燈牌寫著"Decrypt"。我們的城市夜景挑戰——需要複雜的光線物理、人群渲染和建築精確度——揭示了競爭對手之間不同的效能特徵。ChatGPT生成了令人印象深刻的充滿活力的環境，霓虹燈牌清晰，在精心渲染的溼潤路面上創造出豐富的反射。雖然在人群動態和元素包含方面表現出色，但輕微的透視不一致有時會暴露其人工合成的本質。光線也很好，但有時會偏向戲劇性而非自然城市風格。反射不是最好的，但這是隻有最挑剔的人才會注意到的。它還生成了除"Decrypt"之外的可讀霓虹燈牌，這增加了真實感。對我們來說，Reve通過出色的光線物理建模獲勝，特別是霓虹光源和反射表面之間的微妙互動。其電影般的構圖和氛圍元素（蒸汽縷、動態模糊）創造了更高的空間真實性。然而，它減少了人群密度，這是一個聰明的技巧，因為它不必生成太多面孔，這使得難以發現不真實的細節。系統優先考慮氛圍而非字面上的提示詞遵循。Freepik Mystik（Flux）透過不同的視角解讀我們的提示詞，是偏離寫實風格最多的模型。它混合了亞洲和西方字母，生成了不同的Decrypt標誌而不是僅僅一個，並且在人物渲染和空間深度方面遇到技術限制。其反射表面缺乏ChatGPT所顯示的物理準確性。獲勝者：Reve通過出色地渲染複雜的光線互動，勉強獲得了寫實性桂冠。ChatGPT建立了自己作為非常接近的第二名，特別令人印象深刻的是，它是在更廣泛的多模態系統中整合的，而非專門的影象生成器。

評論：OpenAI 的新圖像生成器再次大放異彩

貝萊德亞太區iShares主管尼古拉斯·皮奇表示，即使在亞洲，對加密貨幣進行適度的投資組合配置也可能推動大量資金流入市場。

他在Consensus大會的一個小組討論會上發表了上述言論……

貝萊德高管表示，亞洲地區1%的加密貨幣配置可釋放2萬億美元的新資金流入。

2026 年投資者面臨的問題已不是 「要不要配置」，而是「配多少，以及通過什麼工具配置 」。

ARK Invest：比特幣的機構化之路

全球最大的加密貨幣交易所幣安繼續保持著快速上線競爭幣的步伐。
此時，幣安宣佈將上線名為 Espresso ($ESP) 的競爭幣。
“幣安將……”