Tuesday, December 26, 2023

My Top 10 AI Images of 2023 - - - - - - - - - - - #4 The Hanged Man challenge

AI excels in numerous areas, but it also has its limitations. These limitations can be quite frustrating, particularly when you consider how AI can effortlessly generate incredibly detailed images within seconds. Yet, we still encounter peculiar errors and anomalies. As of this writing, some of the most common challenges include: generating coherent text messages, depicting realistic human hands, simulating archery skills, and my personal favorite, recreating "The Hanged Man" card from a traditional tarot card set.

This might be my favorite image of my Top 10. Quite a hauntingly beautiful image.
I added details of him being infected by a fungus - more on that later . . .

I stumbled upon this AI limitation in a Reddit group chat. A user was expressing their frustration with attempting to create a tarot card set using AI-generated images but faced a significant obstacle – they couldn't get AI to depict "The Hanged Man." Many of us in the group, including myself, attempted to generate the image, but no one came close to the traditional depiction of the card. (Which typically features a man hanging upside-down by a rope tied to one of his legs, the other leg bent horizontally, with the other end of the rope secured to a tree branch above.)

An example of the classic Hanged Man tarot card

Some argue this happens because the AI is censored from showing a hanging person. Others said, AI doesn't quite understand what "upside-down" means and still needs to learn about terms of placement, locations, directions, and alignment. I agree with the latter. Often you can not direct AI where you want things to be placed in a text prompt, it's very hit-or-miss.

My first attempt at The Hanged Man (left) and the flipped version (right)
The gravity-defying skull chains, lowered branches and bottom light source ruin the hanging aspect.
Even though he has two left hands, I liked this image enough to use as a reference for the top image.

Certain individuals believed that a potential solution to the problem was to create an image of a hanging man and then flip it upside-down to achieve the desired effect. However, this approach proved to be ineffective. This is primarily due to the presence of elements like the ground, trees, and tree branches in the background of this particular tarot card. Additionally, AI algorithms incorporate factors such as gravity's impact on clothing and the positioning of upper lighting sources in their image generation process. Consequently, when you flip an AI-generated image, it often still appears as if you simply flipped the original AI image.