By gerogero
Updated: April 9, 2025
Complex prompt structure and order:
Simple Prompt Example:
1girl, hatsune miku, angel, masterpiece, general
Result: Basic, but creative interpretation
Complex/Upsampled Prompt (using TIPO):
1girl, hatsune miku.
An illustration of a girl with long white hair and wings. she is wearing a school uniform with a red bow on her head and a pair of headphones on her ears. the wings are spread out behind her, creating a sense of movement and energy. the overall style of the illustration is anime-inspired.
solo, skirt, feathered wings, necktie, smile, very long hair, collared shirt, long hair, headset, blue eyes, aqua necktie, looking at viewer, black footwear, black skirt, twintails, grey shirt, bare shoulders, detached sleeves, full body, zettai ryouiki, closed mouth, miniskirt, sleeveless, boots, thighhighs, shirt, standing, wing collar, aqua hair, sleeveless shirt, pleated skirt, angel wings, absurdly long hair, wings, black thighhighs,
masterpiece, general.
Result: More detailed, consistent results with specific elements (wings, uniform details, etc.) being more precisely rendered
with this negative prompt:
worst quality, comic, multiple views, bad quality, low quality, lowres, displeasing, very displeasing, bad anatomy, bad hands, scan artifacts, monochrome, greyscale, twitter username, jpeg artifacts, 2koma, 4koma, guro, extra digits, fewer digits, jaggy lines, unclear
(masterwork, x, y, z, masterpiece, best quality, hyper-detailed, 8k uhd::1.4),
x = image type, i.e. sketch, photo, portrait, manga page, etc.
y = character name(s)
z = artist (if specified)
(followed by character count (1girl, 2girls), character description, location, action, other details)
Example:
(masterwork, portrait, princess midna, fan no hitori, award-winning, masterpiece, best quality, hyper-detailed, 8k uhd::1.4), 1girl, large breasts, blue eyes, skinny, slim, sexy, gaunt, smile, blue textured bodysuit, cleavage, fur trim, outdoors, castle, looking at viewer, anime coloring, shiny skin,Cinematic Light,
Negative prompt: lowres, worst quality, bad quality, bad anatomy, sketch, jpeg artifacts, signature, watermark, artist name, old, oldest
Steps: 26, baseModel: SDXL, quantity: 4, width: 832, height: 1216, Seed: 2455922073, draft: false, nsfw: true, workflow: txt2img, Clip skip: 2, CFG scale: 6, Sampler: Euler a, fluxMode: undefined
Been messing with it a ton lately and thought I’d share some stuff I’ve learned.
My testing has revealed several key differences from other model types like Pony:
On a subjective note, I’ve found that I just tend to like the images produced more versus something like Pony. Also, I find that it’s just easier to work with and to get better images quickly.
Based on both testing and official documentation:
As I mentioned above, you definitely need to massage the prompt a bit more than most other models, at least at this point in time (Illustrious XL v0.1). As a result, you need to have quality tags in your positive prompt and a pretty extensive negative prompt to get it to work well.
From the paper, their actual example quality tags are much simpler:
Technically these are all you need to produce a good image, according to the Illustrious paper. I’ve also corroborated this in my own testing.
However, I’ve found that these prompt tags can produce fairly consistent results when used in combination with those tags above:
perfect quality, best quality, absolutely eye-catching,
and with more realistic/detailed imagery:
perfect quality, best quality, absolutely eye-catching, ambient occlusion, raytracing,
After a lot of experimentation, I found this works the best, on average, pretty much every time:
lowres, (bad), bad anatomy, bad hands, extra digits, multiple views,fewer, extra, missing, text, error, worst quality, jpeg artifacts, low quality, watermark, unfinished, displeasing, oldest, early,chromatic aberration, signature,artistic error, username, scan
or as a shorter alternative:
lowres, worst quality, bad quality, bad anatomy, sketch, jpeg artifacts, signature, watermark, artist name, old, oldest
Here’s a couple examples to show how 90% of my prompts look:
1girl, felici4, anatomically correct, proper proportions,
long white hair, large breasts, domino mask, black lips, athletic build,
well-defined standing pose, dynamic pose,
detailed urban environment, night scene, city lights,
glowing blonde hair, blue eyes, looking at viewer, shining skin,
from below angle, professional lighting,
masterpiece, best quality, absurdres, newest
-or-
perfect quality, high quality, masterpiece, absolutely eye-catching, ambient occlusion, raytracing, felici4, 1girl, long hair, white hair, large breasts, (mask), domino mask, blue eyes, black lips, skinny, glowing blonde hair, rainbow inner hair, looking at viewer, shining glossy skin, perfect huge breast, (goosebumps:1.1), solo, excessive sweat, ((from below))
lowres, (bad), bad anatomy, bad hands, extra digits, multiple views,fewer, extra, missing, error, worst quality, jpeg artifacts, low quality, watermark, unfinished, displeasing, oldest, early,chromatic aberration, signature,artistic error, username, scan
Steps: 24, baseModel: SDXL, quantity: 4, width: 832, height: 1216, Seed: 1934634232, draft: false, nsfw: true, workflow: txt2img, Clip skip: 2, CFG scale: 5.5, Sampler: Euler a,
1girl, ashley_grah4m, anatomically correct, proper proportions,
neon lighting, glowing blonde hair, rainbow inner hair, blue eyes,
well-defined pose, standing pose, balanced pose,
detailed environment, professional lighting, clear composition,
looking at viewer, seductive expression,
masterpiece, best quality, absurdres, newest
-or-
masterpiece, best quality, ashley_grah4m, 1girl, neon, glowing blonde hair, rainbow inner hair, blue eyes,looking at viewer, seductive
lowres, (bad), bad anatomy, bad hands, extra digits, multiple views,fewer, extra, missing, text, error, worst quality, jpeg artifacts, low quality, watermark, unfinished, displeasing, oldest, early,chromatic aberration, signature,artistic error, username, scan
Steps: 24, baseModel: SDXL, quantity: 4, width: 832, height: 1216, Seed: 1115576560, draft: false, nsfw: true, workflow: txt2img, Clip skip: 2, CFG scale: 5.5, Sampler: Euler a,
Based on paper findings and testing, backgrounds require specific attention:
detailed environment, [location type], clear composition
[material types], [structural elements], [decorative elements]
[time of day], [lighting type], [atmosphere effects]
Indoor Scenes:
luxurious room, detailed architecture, marble floor, ornate furniture,
crystal chandeliers, tall windows, decorative columns,
warm ambient lighting, soft shadows, volumetric lighting
Outdoor Urban:
detailed cityscape, modern architecture, glass buildings,
city streets, urban details, store fronts,
night scene, neon lighting, street lamps, ambient occlusion
Natural Settings:
detailed landscape, rolling hills, dense forest,
rocky outcrops, flowing water, detailed foliage,
golden hour lighting, atmospheric haze, dynamic clouds
1girl, sprThja, anatomically correct, proper proportions, full body, toned figure, large breasts, black hair, long flowing hair, rabbit ears, dark alluring eyes, confident smile, standing pose, well-defined pose, balanced pose, looking at viewer, purple leotard, deep cleavage, purple leggings, purple gloves, yellow cape, detached collar, luxurious detailed room, marble floor, ornate furniture, detailed architecture, night scene, professional lighting, warm ambient lighting, soft shadows, clear composition, masterpiece, best quality, absurdres, newest
If you want to give them a try, here’s some of my own models that I’ve trained on various Illustrious merges as well as Illustrious itself:
from above,
from below,
close-up,
portrait,
POV,
birds-eye,
wide shot,
isometric,
(+ view, depending)
Cinematic Light,
Hollywood Lighting,
Backlighting,
Rim lighting,
Soft lighting,
harsh lighting,
Dramatic light,
film-style contrast,
soft shadows,
harsh shadows,
This guide was created to bring inspiration to this visual vocabulary. There is a short description for each pose so that you can connect the word ...
GPT-4o, released on March 25, 2025 went viral soon after release, bolstered by the Studio Ghibli animation style trend. Most people are curious if ...
This guide is intended to get you generating quality NSFW images as quickly as possible with Automatic1111 Stable Diffusion WebUI. We’ll be u...
This tutorial will provide a comprehensive guide on using Tencent’s Hunyuan Video model in ComfyUI for text-to-video generation. We will walk you t...