Tailoring Multimodal AIGC to Time-Honored Brands: A Stable Diffusion-Based Framework for Visual Generation and Evaluation

Authors

  • Xinbao Zhang Nanfang College Guangzhou Author
  • Jinjian Li Nanfang College Guangzhou Author
  • Shizhen Zhang Hubei University of Arts and Science Author
  • Yuwei Chen Nanfang College Guangzhou Author

DOI:

https://doi.org/10.70695/IAAI202504A6

Keywords:

Time-honored Brand; Stable Diffusion; Cultural Feature Embedding; Multimodal Control; Efficient Parameter Fine-Tuning; Reliability Calibration; Visual Generation

Abstract

To address the dual requirements of cultural expression and engineering implementation in the visual design of time-honored brands, this study proposes an adaptive optimization architecture based on Stable Diffusion. The framework employs Textual Inversion to derive composable cultural tokens and utilizes LoRA/DreamBooth parameters for the efficient fine-tuning of both generic and proprietary styles. By integrating ControlNet and IP-Adapter, the system achieves a fusion of layout and style priors, while a dual-channel gating mechanism enables collaborative control over semantics and composition. During inference, reliability in prompt adherence is calibrated through CFG-Rescale, attention reweighting, and temperature scaling. Extensive experiments on publicly available multimodal datasets and real-world brand scenarios demonstrate a significant improvement in the alignment between objective metrics and human evaluations. The method's stability and necessity are confirmed through robustness tests and component ablation studies, while A/B testing reveals its distinct advantages in cost-effectiveness and operational efficiency. This research ultimately provides a replicable and verifiable technical solution for the visual generation needs of both cultural heritage and commercial brands.

Published

2025-12-31

How to Cite

Zhang, X., Li, J., Zhang, S., & Chen, Y. (2025). Tailoring Multimodal AIGC to Time-Honored Brands: A Stable Diffusion-Based Framework for Visual Generation and Evaluation. Innovative Applications of AI, 2(4), 86-99. https://doi.org/10.70695/IAAI202504A6