四大AI绘图模型评测:Midjourney、Adobe、SD以及DALLE

四家分别为:

Midjourney V6、Adobe Firefly 3、Stable Diffusion 3、Dalle 3。

编辑搜图

至于评测方式,我依然会从细节质量、审美(构图色彩等)、语义理解这三个维度来评测,剔除掉了风格多样化这个指标(没法测)。

细节质量、审美、语义理解每个类别14个case,总和42个Case(42这个数字的代表意义懂的都懂哈哈哈哈)

同时每个Prompt我会在AI绘图模型中roll3次出12张图,取效果最具有代表性的那个图,尽量减少偏见。同时为了保证公平,基本不会搞特别复杂的prompt。

同时,为了有最后整体可视化的评分让大家看着更直观,所以我会进行打分。在每个案例中,第一名为4分,第二为3分,第三为2分,最后一名为1分,最后计算平均分。

虽然每个case数量都不是很多,但是这也差不多了,而且是我个人的极限了。为了避免文章太长阅读体验极差,我就每个类别只放8个Case来做展示。

OK,让我们开始吧。

一. 细节质量

主要测试AI绘图对于细节的表现能力,比如人物面部皮肤的质感、比如织物纹理的细节、场景细微元素的细节等等,这个是对模型精度和输出质量一个非常重要的考量。

1.Prompt:

Selfie of charming kpop girl, outdoors, evening time, brunette, casual giggle, 2 bun tied hairstyle

编辑搜图

Midjourney > SD3 > Adobe > Dalle

-

2.Prompt:

Portrait of a 2000s blonde woman posing on a sports car, white wired headphones, expressionless, 2000s hairstyle, 2000s fashion, sun rays, light teal and amber,Cinestill 50D

编辑搜图

Midjourney > SD3 > Adobe > Dalle

3.Prompt:

Photo of smiling Labrador wearing sunglasses and straw hat sitting on the beach bench with glass of cocktail, beach scene, realistic

编辑搜图

4.Prompt:

a sports car drifting in a middle of partitions in a festival of vape and there is people around the car vaping, cinematic mood

编辑搜图

SD3 > Adobe > Midjourney > Dalle

5.Prompt:

Realistic illustrations,The drumstick hits the frame and the drum bounces up water droplets

编辑搜图

Midjourney > Adobe > Dalle > SD3

6.Prompt:

a house design inside of the perfect beach house, rustic malibu in style, the beach and surf included in the photos, Photography

编辑搜图

ourney > Adobe > SD3 > Dalle

7.Prompt:

beautiful blonde model made out of porcelain, long hair, wearing sci-fi light mecha armor, in the style of balanced symmetry, white and blue LED lights on armor

编辑搜图

Midjourney > SD3 > Adobe > Dalle

8.Prompt:

Delicious hamburger, floating in the air, food professional photography, studio lighting, studio background

编辑搜图

Midjourney > Adobe > SD3 > Dalle

剩下case略。

在细节质量部分,Midjourney基本以绝对的优势压倒性胜利。

编辑搜图

二. 审美

主要测试AI绘图的审美能力,一张图好不好看,是美是丑,除了细节之外,更多的还需要看模型的审美能力,比如构图、色彩、光影等等,审美强,出的图才好看。

Creatures from the Book of Mountains and Seas of China, a golden alien tiger with a resting bird on its back, attack posture, with light and golden particles emitting in the air

编辑搜图

Midjourney > SD3 > Dalle > Adobe

2.Prompt:

A strong man riding a steel dragon flying in the sky, panorama, steel mecha, futuristic tech wind

编辑搜图

Midjourney > Dalle > SD3 > Adobe

An abstract three-dimensional sculpture in the shape of an orchid, composed of gemstones and frosted viscous materials, in the style of tesseract, light-filled, sparkling water reflections, sunrays shine upon it

编辑搜图

Midjourney > Adobe > SD3 > Dalle

woman smiling and having a cup of 7-eleven coffee outside a 7-eleven convenience store in the morning in the style of 90's anime, 1990s anime texture and colors, thick line work

编辑搜图

fantasy greatsword made from crimson metal, oil painting

编辑搜图

a dark ocean with great Sturm, Captive Souls Pirate's Redemption, ship emerging out of the fog, Giant octopus reaching out of the waters to pull down the ship

编辑搜图

warhammer 40K, Islamic space marine, white armor, black and gold trim,  matte paintin

编辑搜图

Midjourney > SD3 > Adobe > Dalle

oil painting of an angel with wings spread above the forest, light beam from its eyes illuminates path in bright green and blue colors

编辑搜图

Midjourney > Adobe > SD3

在审美部分,Midjourney依然以绝对的优势压倒性胜利,而以设计起家的Adobe,反而拉了最大的跨。

编辑搜图

三. 语义理解

主要测试AI绘图对于复杂语义的理解能力,能否将文本内容都能清晰的表达出来并保证生成图片的质量。

Portrait photograph of an anthropomorphic tortoise seated on a New York City subway train

编辑搜图

Dalle > Midjourney

A businessman on a throne. The AI agents gathered behind him like royal guards. Photo Real

编辑搜图

Dalle > Midjourney

A cup of coffee sitting on a table in front of a window, outside the window is a futuristic city; a futuristic monorail can be seen close by, many lush plants around, shot from ground floor, clouds above

编辑搜图

Dalle > Adobe > SD3 > Midjourney

A hyper-realistic image of an anthropomorphic corn cob working as a cashier at a convenience store, depicted with a cheerful expression while laughing. The corn cob, dressed in the store's uniform, features a friendly face with eyes and a mouth on the husk, showing a big, joyful smile. The scene captures the corn cob scanning items at the cash register, wearing a typical convenience store uniform that includes a neat polo shirt and a name tag

编辑搜图

Editorial photography of astronaut cooking Christmas colorful chocolate honey cookies on spaceship, Christmas honey cookies floating around astronaut, no gravity, in spaceship, levitated

编辑搜图

a close up hyper realistic image of a medieval knight facing off against the grim reaper. Dramatic lighting

编辑搜图

Dalle = Midjourney > Adobe > SD3

a very pretty young woman smilling flying over an aztec city with a dog, both the woman and the dog are flying, she is wearing an aztec outfit, the dog is wearing a colourful collar. they both seem to be having fun, ultra realistic

编辑搜图

> Adobe > SD3

dungeons and dragons, high detailed, fantastic realism, female centaur with unicorn horn on head, hyper realistic

编辑搜图

> SD3 >  Dalle> Adobe

Dalle3和Midjourney基本上处于领先地位,Dalle还是领先一筹。Adobe继续垫底。

编辑搜图

最后总结

在四个大模型三个维度评完了以后,我相信大家应该能对这几个大模型有大概的了解了。

但是为了更直观一些,我再来做个雷达图吧。

编辑搜图

细节质量方面,MJ V6 > SD3 > Adobe Fiefly 3 > Dalle 3。

审美方面,MJ V6 > SD3 >  Dalle 3 > Adobe Fiefly 3。

语义理解方面,Dalle 3 > MJ V6> SD3 > Adobe Fiefly 3。

MJ依然稳坐头把交椅,很多人跟我说,啥XX大模型在什么什么参数评测中已经超越了MJ啥啥的,我每次都点点头:哦。

而Adobe Fiefly 3的全面拉胯以至于我几度怀疑自己是不是选错了模型,直到我再三确认我选的确实就是Fiefly  Image 3预览版。

就...拉胯的令人难以置信。

而SD3至少在我以API方式接入使用下,也没有很多自媒体或者其他人吹的那么神乎其神。

希望这个评测,能抛砖引玉吧,让大家对AI绘图综合有一些了解。

更建议的是,自己上手去试试。

又跑了十几个小时,虽然跟大家说的是只有42个Case,但是背后跑了不知道多少。希望能对大家有所帮助吧。

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包

打赏作者

小菜狗编程笔记

你的鼓励将是我最大的动力!

¥1 ¥2 ¥4 ¥6 ¥10 ¥20
扫码支付:¥1
获取中
扫码支付

您的余额不足,请更换扫码支付或充值

打赏作者

实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值