Install Steam
login
|
language
简体中文 (Simplified Chinese)
繁體中文 (Traditional Chinese)
日本語 (Japanese)
한국어 (Korean)
ไทย (Thai)
Български (Bulgarian)
Čeština (Czech)
Dansk (Danish)
Deutsch (German)
Español - España (Spanish - Spain)
Español - Latinoamérica (Spanish - Latin America)
Ελληνικά (Greek)
Français (French)
Italiano (Italian)
Bahasa Indonesia (Indonesian)
Magyar (Hungarian)
Nederlands (Dutch)
Norsk (Norwegian)
Polski (Polish)
Português (Portuguese - Portugal)
Português - Brasil (Portuguese - Brazil)
Română (Romanian)
Русский (Russian)
Suomi (Finnish)
Svenska (Swedish)
Türkçe (Turkish)
Tiếng Việt (Vietnamese)
Українська (Ukrainian)
Report a translation problem
There are bugs with the image generation especially the wombo one just crashing and most of these bugs will be fixed in the next update.
One more thing to add: When I was experimenting, the image resolution influenced the quality of the image much more than I expected. For example, I thought a small image with 200 iterations would look more realistic than a large image with 100 iterations, but it turns out somehow the largeness of the image being worked on makes the AI just draw it better (in ways that far surpass the resolution itself)
For comparing with the wombo website, which preset worked best for you and what prompt did you test with? There's no difference in the way it gets the images so maybe I need to rework the prompts. The only other difference in quality would be the fact that it only generates long vertical images which will look stretched out in the game
Thanks for commenting, interesting notes, I'll experiment with that in mind.
To answer your question I'll give some examples of each level. Good for everyone to see.
By prompt I guess you mean the text I wrote, I'll note that below.
There are screenshots of the game in this community but mines specify the image conditions as the focus.
For all of these I went with a realistic example of a Feudal Japan setting. I started saving my world presets in an excel spreadsheet / google sheets so I can redo one if needed or if the word ends up not generating due to an error (usually when I push things too far by trying the 16gb model, doesn't work for me every time even with my computer).
And my 2nd example set will be more fantasy which will give different kinds of results.
My computer is RTX 3090, not an average example but I think I can give the full spectrum of low to high end expectation when it comes to the image generation in one report.
=============================
Text I used:
Starting point
My Castle Honno-ji. I am in the throne room in front of my vassals and wife.
World
It is Japan in the feudal warring age. June 21, 1582. This was the Sengoku era in Japan. There were many states at war with each other. Oda Nobunaga has the biggest territory and enemies on all sides.
Your desc.
I am Oda Nobunaga. I look like a Japanese Daimyo with a topknot drawn in a traditional Japanese art style. I am wearing Daimyo clothes of bright colors.
backstory
You are Oda Nobunaga, the Daimyo of Japan.
Your wife's name is Kicho.
You have a vassal named Akechi Mitsuhide.
=============================
There might be some unneeded details in each I don't have that down to a science (Yet).
Wombo
https://steamcommunity.com/sharedfiles/filedetails/?id=2841969723
In this case everything looked correct, recognizable samurai etc. When the concepts were more vague though, the art got real freaky and psycadelic.
It draws cartoons. Not pixelated more like water paintings. Strangely, this was the only one to give me a decent character portrait out of this Japan test.
Wombo feels very different from anything the computer GPU image generation will do.
It looks like a face in Japanese painting art style its a really good result for a random AI picture!
Load speed: This has the best load speed, it can really work concurrently with the AI doing its other work since its using the network download speed and a remote Wombo AI to generate the pictures. Its really nice if you want the images to go quick.
Quick
https://steamcommunity.com/sharedfiles/filedetails/?id=2841969559
Pixelated like a blurry picture but you can make out people, somewhat accurate.
My character portrait which I'll note separately from all the other images generated.
In general with GPU my character portrait pictures were so different from wombo I am wondering whats going on with the image generation or character portrait specifically. More than any other picture I would really like to "reroll" this since I'll be stuck with it all game.
For this one, I don't know whats going on some kind of scene of a woman running? haha.
Maybe if there was some context where it should know this is supposed to be a picture of 1 person. Unless people want it to be open to take picturs of anyone. Maybe theres some wording that can work in that staid. "this is a portrait of..." "I am sitting on a chair alone". Maybe some suggestions like that will help guide it and it won't need any kind of nudging.
Load speed was quick as the name suggests.
Balanced
https://steamcommunity.com/sharedfiles/filedetails/?id=2841969647
Pixelated but much better than quality. Like going from nes to snes. Still realistic photos of places and people but not all that detailed.
My character portrait was good. A man in feudal Japanese gear standing in the grass. Kind of semi action pose, really nice. Same description as the others, totally different result. I'm starting to think I should explain the scene and maybe even art style as much as the description for the portrait. I'm still not sure what words to exclude like "I have a" and character name etc. I usually put the name in here and the game catches it but not sure what it does the portrait.
Load speed: was a little longer but definitely fast enough to load everything as you are reading descriptions and getting your Barings.
Quality
https://steamcommunity.com/sharedfiles/filedetails/?id=2841974282
I tested this one more than once and generated dozens of pictures. So the notes below were consistent even with different game scenarios.
Photo realistic AI drawings. I say AI drawings because sometimes it will be random (like my character portrait lol) and sometimes it'll cut someone's head or be a zoom in of someone at a weird angle for no reason. But it draws really good things. An army of people, a real looking Japanese castle, Japan interiors etc. And if it doesn't look real it looks like a 3d model or a high quality painting like renaissance era painting but maybe even more detailed. (That might be a good keyword for the AI to if you want your pictures to be more realistic looking, "renaissance era style" worth testing another time.)
My character portrait... why did they do this? Its bad, real bad its either random or the AI really is thinking differently only here. I didn't take the screenshot with steam on my first try but it was funny and totally off the mark but it was a portrait picture of a face at least.
Load speed: Its slower but still workable and worth it every time. I think other computers it will be slower, this is the only level I think that will be heavily influenced by video card power so only use if you have the power for it. On my computer its still around the same speed of balanced believe normally it will be quite slow.
Now we are getting into custom territory
CUSTOM - 2x quality
Resolution 800x500 :: 100 iterations :: rate .2
https://steamcommunity.com/sharedfiles/filedetails/?id=2841982294
I only changed the resolution to 2x what I saw in quality so that the ratio would stay the same.
Don't try this! I suggest picking a level, quick, etc and then increasing iterations if you want to adjust things for some custom results. I'm not ready to play with rate, I think leaving it at .2 is the safe choice. Sometimes this doesn't even work. So I'll close and load the game.
I'd say it looks more detailed than quality, but not 2x like you'd expect. I'm seeing EFFECTS in pictures at this point that I didn't see in quality. So you'll see light effects more small details, take a look at "library" in the example. Theres 4 separate OTHER PICTURES in the picture. Fire lighting effects, light from a doorway. Really nice stuff. Loads more people in the army picture. Not sure why the location picture isn't as detailed,
Quality was definitely enough.
Portrait is... GOOD! Its just a close up of clothes but it looks good! Its correct japanese clothes, I included the description in the pic here.
I think the best way to use this is to increase the resolution higher on ONE thing you want more detail maybe the location, and keep the rest on quality.
Load speed: SUUPER slow,
Other notes
Here is the load game screen. This is good because it shows a few portraits and locations at different sizes. Note the locations since they are sized down to what you'd see on characters etc.
https://steamcommunity.com/sharedfiles/filedetails/?id=2841983140
I tested quality on a cyberpunk themed game it was same, super detailed with places that looked like real clubs, streets, alleyways, office buildings etc. People looked good it focused a lot on clothes and made real good looking clothes. Faces looked like real faces though they usually would block it with hair or something but nothin distorted at all.
I think I'll stick to quality, I like things looking actually looking like their descriptions with 98% accuracy.
Final thoughts
I think if we had a toggleable, image settings details that appeared at the bottom of the screen and people knew to turn it on before taking a screenshot, it would help identify quality patterns among all players without having to specifically do something like this thread. We could just compare screenshots and always know why things look the way they do. Good for the dev phase I would think.
That nobunaga example above, I tried it with wombo's ukiyo-e style, all the art was amazing and super fitting.