Replies: 1 comment 4 replies
-
Docs for segmentation are here https://github.com/mcmonkeyprojects/SwarmUI/blob/master/docs/Features/Prompt%20Syntax.md#automatic-segmentation-and-refining Also you can just type a Most issues with quality on segment come down to one of:
For example re "yes you can literally just type legs in", here I used the prompt I also enabled two advanced parameters: That generated this initial image: .. So, yeah, literally
You're just preaching to the choir here, while simultaneously not realizing you yourself are well advanced beyond the stage of a normal user. Normal people don't know or care what a yolo model is, but would love to easily add detail to parts of an image, which is why segment by default just lets you type in something and it searches for what you want without needing any models or any other nonsense. Customizing yolo models is an advanced/poweruser feature.
re regional prompting, that's intentionally undocumented as I want to build out a proper interface for it (atm it's non-obvious and I don't want to write guides for how to do something that shouldn't be so hard to do in the first place). There are video tutorials for Swarm available btw, such as this one https://www.youtube.com/watch?v=HKX8_F1Er_w (Cefurkan 90 minute zero-to-hero with swarm) |
Beta Was this translation helpful? Give feedback.
-
Hello mcmonkey,
first of all, thank you for your great work.
I come from Auto1111 and ForgeUI, have been experimenting with SwarmUI almost daily for about 2 months now and find SwarmUI a very useful user interface that is fast and stable.
I like a lot of things, but some aspects are catastrophic from a user's perspective and lead to frustration, especially because there's no proper guide for it. While it's helpful that each function has a small explanation (behind the question mark on the side), most explanations are incomprehensible to me.
Like most users, I'm not a python expert. Therefore, it took days to understand how Segmentation works. Countless attempts with yoloV8 models and tinkering with the values. It's great that you can also optimize a second face with it, but it took forever to figure out how to do that. And I haven't gotten any further with segmenting so far. You say: "segment:AnyTextHere", but that doesn't work. How do I approach it when I want to improve specific areas like hair, legs, nose, hands, ...? This requires detailed instructions and links to the usable models. For me it's just impossible to figure out.
Now I've read an issue mentioned that after "segment:text..." you should enter the entire prompt plus the area to be segmented. No idea if that works, but why? It's unnecessarily complicated. If you want to segment multiple areas, the prompt would become unmanageably long.
I can't get along with "object" and "region" at all. It only produces garbage or, in most cases, no change at all. How exactly does it work? Please also provide examples. It could be helpful, I think.
Please keep in mind that most users are neither programmers nor python experts. The majority wants to turn ideas into images and not pursue a degree in computer science. Spending hours searching in various forums is absolutely frustrating.
Do you know why Auto1111 and ForgeUI are so popular? Because they are easy to handle. SwarmUI is a great user interface, but it's still too complicated.
My tips and wishes (what's urgently needed):
Please expand your "Basic Usage" guide to include all basic functions. All functions of SwarmUI should at least be described there and provided with brief examples. Especially: Masking, Regional Prompting, Segmentation, Img2Img/Edit/Inpaint etc.
If possible, incorporate the segmentation (possibly also the "object" and "region" functions) into the dropdown menu on the left side. Yes, there's already something there, but I don't understand it. It should be simply structured and understandable, like Refiner and ControlNet are - big praise for that by the way!
Really great would be extensions like Adetailer (for segmentation) or something like ForgeCouple (for regional prompting) or similarly structured functions in the dropdown.
Thanks again for your great work. I would recommend SwarmUI to everyone if it were easier to handle, or if the usage of the above-mentioned functions were clearly explained in the "Basic Usage" guide of the Readme.
Best regards,
svenjuga
Beta Was this translation helpful? Give feedback.
All reactions