Found 46 bookmarks
Newest
Nano Banana can be prompt engineered for extremely nuanced AI image generation | Max Woolf's Blog
Nano Banana can be prompt engineered for extremely nuanced AI image generation | Max Woolf's Blog
While I've had reasonably good results from using a similar image prompt structure with Gemini (Nano Banana) that I use with Midjourney and other tools, this article describes more complex prompting strategies like structuring prompts with JSON. There are also some easier tips like adding MUST in all caps to signal importance.
The reason is that information asymmetry between what generative image AI can and can’t do has only grown in recent months: many still think that ChatGPT is the only way to generate images and that all AI-generated images are wavy AI slop with a piss yellow filter.
·minimaxir.com·
Nano Banana can be prompt engineered for extremely nuanced AI image generation | Max Woolf's Blog
Whisk AI - Free AI Image Generator
Whisk AI - Free AI Image Generator
Whisk is more about enhancing your image prompts than about the image generation itself. If you give it even a basic, vague prompt, it will suggest enhancements so you get better results faster. This is an experimental tool from Google Labs, so expect that the performance may be uneven and that the tool may disappear in the future without warning.
·whiskailabs.com·
Whisk AI - Free AI Image Generator
In Defense of AI Art — Craig Boehman
In Defense of AI Art — Craig Boehman
Craig Boehman responds to the criticisms of AI art from his perspective as an artist while sharing examples of his work and notes on the tools he used to generate and refine his art.
If you’re an artist and using AI in whole or in part to create your art, do you think someone “off the street” is going to employ AI and create something better than you, a seasoned pro? Consider for a moment the smartphone camera. “Anyone can take pictures these days, photography isn’t art”. A photographer with a smartphone could surely do better, right? The differences between an average smartphone user and what a professional photographer can do with a smartphone are potentially as vast and wide as the Grand Canyon.
“The fear has sometimes been expressed that photography would in time entirely supersede the art of painting. Some people seem to think that when the process of taking photographs in colors has been perfected and made common enough, the painter will have nothing more to do.” — Henrietta Clopath, 1901
·craigboehman.com·
In Defense of AI Art — Craig Boehman
WaveSpeedAI
WaveSpeedAI
WaveSpeed gives you access to multiple different image and video generation tools by paying for credits based on what you actually use. This looks like a promising option to test out tools or generate a few videos without committing to a larger subscription.
·wavespeed.ai·
WaveSpeedAI
Ideogram Character
Ideogram Character
This is another tool I'm adding to my list to test out. Give Ideogram a single reference image and then generate multiple remixed images. In the demo, I notice that the expressions don't seem to change much, so that might be a limitation. The basic character creation is free, with additional features for more precise editing on their paid subscription plan.
·about.ideogram.ai·
Ideogram Character
Real or AI Quiz: Can You Tell the Difference? » Britannica
Real or AI Quiz: Can You Tell the Difference? » Britannica
Test your ability to distinguish between real and AI images with this quiz. I work with AI images a lot, and I still only got 8/10 on this quiz. I appreciate the explanations about what to look for and tips for critical review of media.
Texture and Pattern Repetition: AI sometimes struggles with complex textures or patterns, leading to noticeable repetition or awkward transitions. Students should look for unnatural patterns in textures like hair, skin, clothing, or background elements.
·britannicaeducation.com·
Real or AI Quiz: Can You Tell the Difference? » Britannica
This Image Wasn’t a Stock Photo – and It Changed the Way I Build Training
This Image Wasn’t a Stock Photo – and It Changed the Way I Build Training
Michelle Bonkosky shares her process for using ChatGPT for generating a unique image for a training workbook. I appreciate how she shows the process of iterating and refining her prompts; that's a key point. She also includes some sample prompts for images for training assets.
·chelab.substack.com·
This Image Wasn’t a Stock Photo – and It Changed the Way I Build Training
Creating with Gen-4 Image References
Creating with Gen-4 Image References
Runway Gen-4 offers a more controlled workflow for generating images with consistent characters and scenes. Add either a single reference image of a character or multiple reference images to combine multiple characters in a scene or specify the setting or objects. This is a more complex workflow than just chatting with ChatGPT, but it gives you more precision and more consistent results. This is Runway's documentation on using image references.
·help.runwayml.com·
Creating with Gen-4 Image References
Supporting Learning with AI-Generated Images: A Research-Backed Guide - MIT Sloan Teaching & Learning Technologies
Supporting Learning with AI-Generated Images: A Research-Backed Guide - MIT Sloan Teaching & Learning Technologies
Suggestions and examples for using AI-generated images in meaningful ways to support learning, without adding confusing or distracting images. Consider cognitive load and the purpose of your images.
A study by Sung and Mayer (2012) suggests that any graphic in a learning experience will fall into one of these three categories: Instructive images: These visuals directly support learning and facilitate essential cognitive processing of core concepts. For example, a diagram illustrating Porter’s Five Forces can help students better understand this business strategy framework. Decorative images: These graphics enhance aesthetics but don’t influence learning. For example, an image of a business handshake can be visually appealing but won’t support or obstruct students’ understanding of negotiation strategies. Distracting images: Sung and Mayer call this category “seductive” images. While these visuals may relate to the topic, they impede learning because they require extraneous cognitive processing. As an example, consider a complex organizational chart of a full corporation in a lesson on team leadership. The image connects broadly to the lesson but also highlights a lot of irrelevant details, distracting students from the key concepts.
·mitsloanedtech.mit.edu·
Supporting Learning with AI-Generated Images: A Research-Backed Guide - MIT Sloan Teaching & Learning Technologies
The recent history of AI in 32 otters
The recent history of AI in 32 otters
Ethan Mollick shows the progression of AI image and video generation with iterations of a prompt about otters using wifi on a plane. He also explains the difference between diffusion and multimodal image generation models (Midjourney vs ChatGPT). These tools get such different results because the underlying technology and approach is different.
While LLMs generate text one word at a time, always moving forward, diffusion models start with random static and transform the entire image simultaneously through dozens of steps. It is like the difference between writing a story sentence by sentence versus starting with a marble block and gradually sculpting it into a statue, every part of the image is being refined at once, not built up sequentially.
But what makes diffusion models interesting is not their increasing ability to make photorealistic images, but rather the fact that they can create images in various styles.
Unlike diffusion models that transform noise into images, multimodal generation lets Large Language Models directly create images by adding tiny patches of color one after another, just as they add words one after another. This gives AIs deep control over the images it creates.
·oneusefulthing.org·
The recent history of AI in 32 otters
AI image generators tend to exaggerate stereotypes
AI image generators tend to exaggerate stereotypes
The examples in this article are all from older images, but the problems of bias in AI image generators remain. Unless you are explicitly prompting to avoid stereotypes, AI image generators reflect the bias of the images they trained on. Even if you do prompt to avoid stereotypes, it can still be a problem.
·snexplores.org·
AI image generators tend to exaggerate stereotypes
How to achieve character consistency
How to achieve character consistency
How to video from Flora about how to create images with character consistency across different scenes. This is a more time consuming and technical process involving training a LoRA (low-rank adaptation) on an initial set of images for a character. This probably works best with real people, but there may be ways to adapt this workflow for elearning with generated characters. This is more effort than I would do for most projects, but might be worth exploring if I need something higher end for a specific project.
·youtube.com·
How to achieve character consistency
AI Art Generator: Free AI Image Generator & Editor | OpenArt
AI Art Generator: Free AI Image Generator & Editor | OpenArt
Generate consistent character images in multiple scenes starting from a single image. You can also use image to video tools.This integrates with other tools and gives you the option to train your own AI model with your style for illustrations. The free plan is limited, but there are paid plans at different levels.
·openart.ai·
AI Art Generator: Free AI Image Generator & Editor | OpenArt
Say What You See - Google Arts & Culture
Say What You See - Google Arts & Culture
Learn how to write better prompts for images with this tool. Describe what you see in an image and see how close your generated image is to the original. This tool uses AI to analyze your results and how accurate you were.
·artsandculture.google.com·
Say What You See - Google Arts & Culture
FLORA
FLORA
Combine AI text, image, and video generation tools together for more complex workflows. I haven't tested this tool yet, but it might be worth using the free plan to experiment and see what's possible.
·florafauna.ai·
FLORA
AI-Generated Images: Missed Opportunities and Moral Shrugs
AI-Generated Images: Missed Opportunities and Moral Shrugs
Tom McDowell reflects on how much of our current AI image generation is very derivative and replicates specific artists. I agree with a lot of this. We could use these tools to create brand new and useful images, but there's a lot of use right now that feels problematic.
·idtips.substack.com·
AI-Generated Images: Missed Opportunities and Moral Shrugs
The power of generative marketing: Can generative AI create superhuman visual marketing content?
The power of generative marketing: Can generative AI create superhuman visual marketing content?
This abstract of a marketing research paper explains how they compared AI-generated images and ads to human-created ads. The AI images were not just comparable, they were better than what people created. While this research is specific to marketing, I think it's relevant to images we use in training and elearning.
First, we prompt seven state-of-the-art generative text-to-image models (DALL-E 3, Midjourney v6, Firefly 2, Imagen 2, Imagine, Realistic Vision, and Stable Diffusion XL Turbo) to create 10,320 synthetic marketing images, using 2,400 real-world, human-made images as input. 254,400 human evaluations of these images show that AI-generated marketing imagery can surpass human-made images in quality, realism, and aesthetics. Second, we give identical creative briefings to commissioned human freelancers and the AI models, showing that the best synthetic images also excel in ad creativity, ad attitudes, and prompt following.
·papers.ssrn.com·
The power of generative marketing: Can generative AI create superhuman visual marketing content?
illustration.app - AI Vector Illustration Generator
illustration.app - AI Vector Illustration Generator
I haven't tested it out yet, but this looks like another viable option for creating vector illustrations in consistent styles. One feature I haven't seen elsewhere is the color palette generator. Give it a word like desert, nature, or neon, and it will generate a color palette that you can use in your images. You can edit and recolor images or make the background transparent directly in this tool.
·illustration.app·
illustration.app - AI Vector Illustration Generator