This is a simplified guide to an AI model called Photomaker maintained by Tencentarc. If you like these kinds of analysis, you should join AImodels.fyi or follow us on Twitter.
Model overview
PhotoMaker
is a text-to-image AI model developed by TencentARC that allows users to input one or a few face photos along with a text prompt to receive a customized photo or painting within seconds. The model can be adapted to any base model based on SDXL or used in conjunction with other LoRA modules. PhotoMaker
produces both realistic and stylized results, as shown in the examples on the project page. Similar models include photomaker, GFPGAN, and PixArt-XL-2-1024-MS.
Model inputs and outputs
PhotoMaker
takes one or more face photos and a text prompt as input, and generates a customized photo or painting as output. The model is capable of producing both realistic and stylized results, allowing users to experiment with different artistic styles.
Inputs
- Face photos: One or more face photos that the model can use to generate the customized image.
- Text prompt: A description of the desired image, which the model uses to generate the output.
Outputs
- Customized photo/painting: The generated image, which can be either a realistic photo or a stylized painting, depending on the input prompt.
Capabilities
PhotoMaker
is capable of generating ...
Top comments (0)