A dark fantasy dryad with bark-like wooden skin and skull decorations, featuring intricate details and a haunting forest spirit appearance.
A soul knight with glowing white hair levitates in a misty, moonlit fantasy tavern night setting, surrounded by ghostly fog and shadowy medieval buildings.

Recommended Prompts

photoshoot, photo studio, RAW photo, editorial photograph, film stock photograph, cinematic, posing, amateur photo, analog, raw, f2, 35mm, an (amateur photo), flash photography, taken on an old camera, polaroid, 8k, highly detailed, (high quality, best quality:1.3), Extremely high-resolution, film grain, Kiki from Netherlands, long (waist-length:1.2) combed auburn hair, ((macro lens)), dark atmosphere, ((focused on eyes)), feminine expressions, photography, dslr, 35mm, Fujifilm Superia Premium 400, Nikon D850 film stock photograph, Kodak Portra 400 f1.6 lens, 8k, UHD

amateur photo of a --subject-- --doing something-- / --wearing something--, analogue, raw, polaroid, looking at camera, (from side:1.2), detailed skin texture, natural skin texture, perfect eyes, perfect iris, high detail eyes, detailed iris, detailed cloth texture, detailed facial features, aesthetic, depth of field, cinematic light,(high quality, best quality:1.3), Extremely high-resolution, Kodak Portra 400 camera f1.6 lens, Fujifilm Superia Premium 400, Nikon D850, <lora:add_detail:0.33>

photoshoot, photo studio, RAW photo, posing, editorial photograph, cinematic, film stock photograph, 35mm, 8k, photography, Nikon D850 film stock photograph, UHD

Recommended Negative Prompts

(worst quality, low quality:1.4), illustration, 3d, 2d, painting, cartoons, sketch, blur, blurry, blurred, bokeh, unclear, grainy, low resolution, downsampling, aliasing, dithering, distorted, jpeg artifacts, compression artifacts, overexposed, high-contrast, bad-contrast, poorly drawn, cropped, out of frame, [deformed, disfigured, suspenders], ((deformed iris, deformed pupils:1.4)), (physically-abnormal:2), (joint-inequalities:2), (limbs-inequalities:2), deformed, disfigured, poorly drawn hands, poorly drawn feet, poorly drawn face, out of frame, mutation, mutated, extra limbs, extra legs, extra arms, dull eyes, mismatched eyes, bad anatomy, signature, watermark, artist name, text, error

(worst quality, low quality:1.4), illustration, 3d, 2d, painting, cartoons, sketch, blur, blurry, blurred, bokeh, unclear, grainy, low resolution, downsampling, aliasing, dithering, distorted, jpeg artifacts, compression artifacts, overexposed, high-contrast, bad-contrast, poorly drawn, cropped, out of frame, deformed, disfigured, dull eyes, mismatched eyes, bad anatomy, signature, watermark, artist name, text, error

Recommended Parameters

samplers

DPM++ 2S a Karras

steps

20 - 40

cfg

4 - 9

resolution

1024x1024, 1024x576, 576x1024, 1536x1536, 1024x1024

Recommended Hires (High-Resolution) Parameters

upscaler

4xLSDIRplusC, 4x Foolhardy Remacri, 4x NKMD Superscale, ESRGAN 4x, 4xUltraSharp

upscale

1.5

steps

15 - 20

denoising strength

0.6

Tips

Try to run a generation without any negatives first for best results.

Start with very simple prompts before adding complexity.

Negative embeddings and loras were not used by the author; feel free to experiment with them.

Inpainting is rarely needed; only use to fix specific issues.

Use manual MBW merging for improved quality.


PhotoPedia XL 📷


Press on ❤️ to follow / consider leaving a review on model review. Feedback is appreciated!

Read the info below to get the high quality images (click on show more)⬇

7 / January / 23 - Still working on a dataset to fine-tune the model. Currently sitting on 700 ish photos, a lot more planned to add.

❗ Update - 11 / December / 23

  • 4.5 is out. Very happy how it turned out!

  • MBW'd the living hell out of this one. Must have went back and forth through more than 30 merges to get here.

    • You can right click the images and open them in new tab and zoom in quite a lot to check smaller details.

  • Addressing certain issues such as hands / eyes. Eyes are much better. Hands are better as well. Generations can still produce some nasty monstrosities though

  • Slight overall feel change

    • Edited the model info

      • Merge recipe further down the page

      • Updating generation parameters and suggestions for 4.5

      • Will post links to suggested resources soon + more

    • Will be posting more galleries soon, keep an eye out


📣 GENERATION PARAMETERS & SUGGESTIONS ( As of 4.5 )

ℹ️ 4.5 generations were made with the following settings

H/W: 1024/1024 - 1024/576 - 576/1024 - I tend to stick with 1024/1024
Sampler - DPM++ 2S a Karras is the main sampler I used for 4.5
Steps - 40 - Can go lower
CFG - 4-9 - I went lower than usual, usually 4-5

Generate an image with Hires enabled

Hires upscale: 1.5 
Hires steps: 15-20
Hires denoising: ~ 0.6
Upscaler: 4xLSDIRplusC

Send to Extras. Upscale once by 1.5. Send the upscaled image to Extras again, 1.5 upscale again. I usually run this in batches.

Upscale 1: 4x Foolhardy Remacri - 1.5 upscale in Extras tab
Upscale 2: 4x NKMD Superscale

I have not used ADetailer, negative embeddings or loras.

ℹ️ MAIN PROMPT

photoshoot, photo studio, RAW photo, editorial photograph, film stock photograph, cinematic, posing, amateur photo, analog, raw, f2, 35mm, an (amateur photo), flash photography, taken on an old camera, polaroid, 8k, highly detailed, (high quality, best quality:1.3), Extremely high-resolution, film grain, Kiki from Netherlands, long (waist-length:1.2) combed auburn hair, ((macro lens)), dark atmosphere, ((focused on eyes)), feminine expressions, photography, dslr, 35mm, Fujifilm Superia Premium 400, Nikon D850 film stock photograph, Kodak Portra 400 f1.6 lens, 8k, UHD

ℹ️ NEGATIVE PROMPT

(worst quality, low quality:1.4), illustration, 3d, 2d, painting, cartoons, sketch, blur, blurry, blurred, bokeh, unclear, grainy, low resolution, downsampling, aliasing, dithering, distorted, jpeg artifacts, compression artifacts, overexposed, high-contrast, bad-contrast, poorly drawn, cropped, out of frame, [deformed, disfigured, suspenders], ((deformed iris, deformed pupils:1.4)), (physically-abnormal:2), (joint-inequalities:2), (limbs-inequalities:2), deformed, disfigured, poorly drawn hands, poorly drawn feet, poorly drawn face, out of frame, mutation, mutated, extra limbs, extra legs, extra arms, dull eyes, mismatched eyes, bad anatomy, signature, watermark, artist name, text, error

Try to run a generation without any negatives first. I am personally too lazy to test each one of them and their impact on generations and the current prompt did just fine x)

Try to run a generation with very simple prompts before you start adding more

ℹ️ I did not use any negative embeddings or loras to change the outputs. Should you use them? As I always say, feel free to experiment, I am pretty sure that some of the negative embeddings will work very well with the model. I generally try not to use too any.

ℹ️ INPAINTING

I almost never bother with it. I am happy with most of the generations as they are (well, discarding the bad stuff that comes along from time to time), unless I really love the image and want to fix something wrong with it.

ℹ️ ON-SITE GENERATIONS

I have not spent much time testing it out. No suggestions as of now. I run SD locally.

MY A1111

•  version: v1.6.0  •  python: 3.10.6  •  torch: 2.0.1+cu118  •  xformers: 0.0.20  •  gradio: 3.41.2

MY HARDWARE

•  13th Gen Intel(R) Core(TM) i7-13700KF / 24 cores / 5.4 GHz

•  NVIDIA GeForce RTX 4070 / 12 GB

•  RAM 32 GB / 5200 MHz


Further plans

Updating


ℹ️ License & Use

This model is open access and available to all, with a CreativeML OpenRAIL-M license further specifying rights and usage.

  • 1. You can't use the model to deliberately produce nor share illegal or harmful outputs or content.

  • 2. The authors claims no rights on the outputs you generate, you are free to use them and are accountable for their use which must not go against the provisions set in the license.

  • 3. You may re-distribute the weights. If you do, please be aware you have to include the same use restrictions as the ones in the license and share a copy of the modified CreativeML OpenRAIL-M to all your users (please read the license entirely and carefully).

You agree not to use the Model or Derivatives of the Model:

  • In any way that violates any applicable national, federal, state, local or international law or regulation

  • For the purpose of exploiting, harming or attempting to exploit or harm minors in any way

  • To generate or disseminate verifiably false information and/or content with the purpose of harming others

  • To generate or disseminate personal identifiable information that can be used to harm an individual

  • To defame, disparage or otherwise harass others

  • For fully automated decision making that adversely impacts an individual’s legal rights or otherwise creates or modifies a binding, enforceable obligation

  • For any use intended to or which has the effect of discriminating against or harming individuals or groups based on online or offline social behavior or known or predicted personal or personality characteristics

  • To exploit any of the vulnerabilities of a specific group of persons based on their age, social, physical or mental characteristics, in order to materially distort the behavior of a person pertaining to that group in a manner that causes or is likely to cause that person or another person physical or psychological harm

  • For any use intended to or which has the effect of discriminating against individuals or groups based on legally protected characteristics or categories

  • To provide medical advice and medical results interpretation

  • To generate or disseminate information for the purpose to be used for administration of justice, law enforcement, immigration or asylum processes, such as predicting an individual will commit fraud/crime commitment (e.g. by text profiling, drawing causal relationships between assertions made in documents, indiscriminate and arbitrarily-targeted use).

You are solely responsible for any legal liability resulting from unethical use of this model(s)


OLDER INFO BELLOW


💭 Although the end goal of the merges is to reach a certain level of realism, I do recommend you to try all the versions provided, they have slight differences, but I guess certain versions might be more appealing to your eye. I personally cannot decide on which is my favorite so far, they all have their uniqueness.


📣 GENERATION PARAMETERS & SUGGESTIONS (used in v4 and older)

ℹ️ I ran most of the generations with the following settings

Sampler - I used many. Try running DPM++ 2S a Karras
Steps - 65 - Can go lower / higher, depending on your sampler.
CFG - 9 - You probably might want to go lower on this.
Hires upscale: 1.5 
Hires steps: 15-20
Hires denoising: ~ 0.5
Upscaler - ESRGAN 4x / 4xUltraSharp

I have not used ADetailer so far with this merge, having some issues with it. I do recommend it though. 

I didn't inpaint generated images.

I didnt use any negative embeddings. I suggest testing on your own to see which one you like the most as some might change the image more than others.

I used Detail Tweaker LoRA usually around 0.3.

Upscale in A1111 in Extras

OR

Upscale outside A1111 - Using Waifu2x GUI 

ℹ️ Prompt Example

amateur photo of a --subject-- --doing something-- / --wearing something--, analogue, raw, polaroid, looking at camera, (from side:1.2), detailed skin texture, natural skin texture, perfect eyes, perfect iris, high detail eyes, detailed iris, detailed cloth texture, detailed facial features, aesthetic, depth of field, cinematic light,(high quality, best quality:1.3), Extremely high-resolution, Kodak Portra 400 camera f1.6 lens, Fujifilm Superia Premium 400, Nikon D850, <lora:add_detail:0.33>


📌 Merge info / recipe

1️⃣ Type 1 ✔️First merge

2️⃣ Type 2 ✔️Fine blend between digital / realistic

  • RealCartonXL + RealVisionXL = M21 ( Sum Twice / Normal / MBW )

  • LEOHello + XXMixXL = M22 ( Sum Twice / Normal / MBW )

  • PhotoPedia XL + M21 + M22 = M23 ( Sum Twice / Normal / MBW )

  • M23 + M21 + M22 = PhotoPedia XL Type 2 ( 3 Sum / Normal / MBW )

3️⃣ Type 3 ✔️ Increased realism / Slightly more NSFW

  • PhotoPedia 1 + XXMixXL + PhotoPedia 2 = PP1 ( 3 Sum / Normal / 0.45A - 0.27B )

  • PhotoPedia 2 + PP1 + RealVisionXL = PP2 ( Sum Twice / Normal / 0.57A - 0.32B )

  • PhotoPedia 2 + LEOHello + RealVisionXL = PP3 ( Sum Twice / Normal / MBW )

  • PhotoPedia 2 + Devil + JuggXL5 = PhotoPedia XL Type 3 ( Add diff / Normal / 1.0 A )

4️⃣Step 4 - ✔️ Manual MBW

  • PhotoPedia 3 + JuggXL5 = PE1 ( Weight Sum / cosineA / MBW )

    • alpha,beta : ( 0.8, 0.25 )

      • weights_alpha : [ 0.7, 0.8, 0.8, 0.4, 0.2, 0.9, 0.2, 0.4, 0.5, 0.5, 0.9, 0.7, 0.5, 0.6, 0.3, 0.9, 0.2, 0.4, 0.9, 0.8, 0.2, 0.3, 0.4, 0.6, 0.7 ]

  • PE1 + RealVisionXL + PE1 = PE2 ( Sum Twice / cosineA / MBW )

    • alpha,beta : ( 0.8, 0.25 )

      • weights_alpha : [ 0.7, 0.8, 0.8, 0.4, 0.2, 0.9, 0.2, 0.4, 0.5, 0.5, 0.9, 0.7, 0.5, 0.6, 0.3, 0.9, 0.2, 0.4, 0.9, 0.8, 0.2, 0.3, 0.4, 0.6, 0.7 ]

      • weights_beta : [ 0.5, 0.4, 0.6, 0.6, 0.4, 0.4, 0.6, 0.5, 0.4, 0.5, 0.5, 0.6, 0.4, 0.6, 0.6, 0.3, 0.7, 0.6, 0.4, 0.6, 0.4, 0.6, 0.4, 0.4, 0.6, 0.7 ]

  • PE2 + JuggXL5 + PE2 = PE3 ( Sum Twice / cosineA / MBW )

    • alpha,beta : ( 0.8, 0.25 )

      • weights_alpha : [ 0.7, 0.8, 0.8, 0.4, 0.2, 0.9, 0.2, 0.4, 0.5, 0.5, 0.9, 0.7, 0.5, 0.6, 0.3, 0.9, 0.2, 0.4, 0.9, 0.8, 0.2, 0.3, 0.4, 0.6, 0.7 ]

      • weights_beta : [ 0.5, 0.4, 0.6, 0.6, 0.4, 0.4, 0.6, 0.5, 0.4, 0.5, 0.5, 0.6, 0.4, 0.6, 0.6, 0.3, 0.7, 0.6, 0.4, 0.6, 0.4, 0.6, 0.4, 0.4, 0.6, 0.7 ]

  • PE3 + RealVisionXL = PhotoPedia XL Type 4 ( Weight Sum / cosineA / MBW )

    • MBW - RING08_5

      • weights : [ 0.2, 0 ,0, 0, 0.3, 0.3, 0.4, 1, 1, 1, 0.3, 0.4, 0.4, 1, 0.8, 1, 1, 0.4, 0.2, 0.2 ]

  • Initially, I wanted to test AutoMBW for this step, but I decided to leave it for another time, it takes way too long. Instead, I went with manual block editing.

  • If the weights look silly, they probably are x) As long as it improves the overall quality!

5️⃣Step 5 WIP / Unreleased / Will get to it eventually

  • PhotoPedia XL Type 4 + LCM / Turbo ?


6️⃣Step 6 Unreleased / No ETA

Further down the line - possible fine-tuning on own dataset


Contributor

Previous
experimental - m100-style - v0.1 20Epoch
Next
Anime Art-Style - Anime_Martz v1

Model Details

Model type

Checkpoint

Base model

SDXL 1.0

Model version

4.5

Model hash

3865942aef

Creator

Discussion

Please log in to leave a comment.

Images by PhotoPedia XL - 4.5

A dark fantasy dryad with bark-like wooden skin and skull decorations, featuring intricate details and a haunting forest spirit appearance.
A soul knight with glowing white hair levitates in a misty, moonlit fantasy tavern night setting, surrounded by ghostly fog and shadowy medieval buildings.

base model Images

Photorealistic scene of undead characters including zombies and skeletons walking through a spooky cemetery illuminated by glowing jack-o'-lanterns under a dark, ominous sky.

digital Images

Photorealistic black and white portrait featuring a young woman with curly hair, cat-eye shaped dark eyes, and vibrant red lips against a red background.
Highly detailed digital illustration of a mandrill's head with vibrant red face, yellow eyes, intricate black and white patterns, and feathered fur texture on black background.
Close-up portrait of a Nepalese woman with smooth pale light brown skin, metallic cross earrings, iridescent blue and gold face markings, glossy red lips, and shimmering eyeshadow against a clear blue sky.
Elegant woman in red floral dress and large red hat standing under a tree on a green slope, with a man and a woman by the rocky coastline and a distant tower, in Pierre Brissaud illustration style.
Digital artwork showing a figure standing before a towering vertical blue aura with glitch effects and black fog on a dark background.
Hyper-realistic cyberpunk woman with crimson red hair and red glasses wearing a black leather jacket and tight futuristic pants, standing on a neon-lit city street at night.
Adorable anthropomorphic fluffy monster wearing a colorful cat costume hooded jacket, bright large eyes, standing with a playful and happy expression
Surreal sci-fi landscape with glitch art and graffiti, featuring hovering geometric buildings.
Young witch with ginger hair and green eyes rides a broomstick under the full moon, wearing a black dress and witch hat.
Blue anthropomorphic wolf character in anime style, created using Stable Diffusion AI.

photo Images

A realistic mouse wearing detailed golden armor and a dark cloak standing on a cobblestone street with cinematic dramatic lighting and film grain effects.
A semi-realistic portrayal of an exotic belly dancer with long dark braided hair, wearing a flowing green dress with gold jewelry, standing in an Arabian garden adorned with pink rose vines on sandstone walls.
A dark, dramatic urban street scene at night with wet pavements reflecting street lamps and a stormy, ominous sky above a deserted city.
Photorealistic ballet dancer posed on pointe with arms extended, wearing an intricately patterned dress forming a cascading Fibonacci spiral under chiaroscuro lighting.
Closeup portrait of an anthro white wolf female soldier with striking red eyes, visible fangs, and a slightly open mouth showing her tongue, wearing a soldier helmet in natural light.
A small turquoise colored glass cat figure with large eyes and pink nose, sitting on an open hand against a dark background.
Close-up photorealistic portrait of Elsa with blonde braided hair, blue eyes, and soft realistic lighting.
A surreal scene where a river flows out of an oil painting depicting mountains and a ship, cascading onto a beige sofa and spreading across a wooden floor in a warm living room.
Portrait of a Japanese girl wearing a floral kimono with exposed shoulders and detailed black tattoos, her hair decorated with vibrant flowers and falling petals, set against a serene lake background with soft backlighting.
Close-up side profile of a young fashion model with shoulder-length black hair and green eyes, wearing a beige pinstripe blazer and posing with an unamused expression in natural light with sunflare

polaroid Images

Close-up of a cute baby toy wearing a detailed pink robotic suit with analog vintage film grain effect and blue and pink tones.
Close-up portrait of a woman in an office wearing a white blouse and brown skirt, with rim lighting and detailed skin texture, captured with cinematic film style.
A dark, hooded cyborg samurai adorned in an intricate metallic mask holds a glowing red neon orb staff, featuring translucent details and a hyperpunk aesthetic.
A hippie girl wearing a peach off-shoulder dress stands in a yellow flower field at dawn in a Poland landscape with muted colors and soft focus.
Closeup of a beautiful mysterious woman with a subtle confident smile, wearing an elegant blue sequined haute couture dress with fireworks exploding in the background.
A young nun wearing a traditional habit with a large cross necklace inside a rococo church, illuminated by soft sunburst light, captured in a vintage, film grain style polaroid photo.
Close-up portrait of a young woman with long auburn hair wearing a light denim jacket, shot in a dark studio with cinematic lighting and film grain effect.
Close-up portrait of a woman with red hair and blue eyes wearing a black blazer and white shirt against a plain background, captured with 35mm analog film style.
Photorealistic medium shot of a pale woman with long black hair and brown eyes, smiling happily while wearing a denim jacket and black hoodie, with a ruined city background.
Teen girl with black twintails wearing a formal black dress in a softly lit indoor setting with cinematic lighting and bokeh background.

realism Images

A young blonde princess with braided hair crouching beside a campfire in a forest clearing during a tribal party, surrounded by figures in the background near bonfires.
Realistic portrayal of a woman with striking emerald green eyes, wearing a crown made of delicate crystal shards and a gown resembling frozen waterfalls, illuminated by refracted icy blue and silver light in a dark glacial cave.
An office worker sitting at a desk with his head in his hands, illuminated by a glowing laptop screen, surrounded by stacks of reports and energy drink cans resembling golden chalices, under luxurious Baroque curtains.
A roaring Tyrannosaurus Rex chasing a young woman walking in a dense jungle, depicted in the detailed style of Sergey Krasovskiy.
A woman holding a lit candle with a pitch black dark background illuminating half of her face with warm candlelight.
A detailed digital painting of a rusted military propeller plane flying mid-air over the ocean, with spinning propellers and an open cockpit showing passengers, under a clear blue sky with clouds.
Close-up photo of a redheaded girl with freckles and blue eyes standing among tall grasses in intense sunlight, showcasing detailed natural features and analog film grain effect.
A supernatural female face with glowing eyes emerging from jungle foliage and glowing plants, a luminous waterfall flows from her mouth, digital fantasy art.
Dramatic close-up portrait of an elderly man with white hair and glowing yellow eyes, wearing dark detailed armor and holding a round shield against a solid black background.
A rusty and malfunctioning vintage coffee maker emitting synthetic steam, with a robotic arm twitching, sitting on a stained countertop under flickering fluorescent lights.