SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation

Haitao Zhou1
Chuang Wang1
1Beihang University   2The University of Hong Kong

SVGDreamer teaser
Given a text prompt, SVGDreamer++ can generate a variety of vector graphics. SVGDreamer++ is a versatile tool that can work with various vector styles without being limited to a specific prompt suffix. We utilize various colored suffixes to indicate different styles. The style is governed by vector primitives.
SVGDreamer teaser
Our proposed tool, SVGDreamer++, excels at generating editable vector graphics.
TL;DR: SVGDreamer++ comprehensively improves the editability and visual generation quality of SVGDreamer.

Abstract

Recently, text-guided scalable vector graphics (SVG) synthesis has demonstrated significant potential in domains such as iconography and sketching. However, SVGs generated from existing Text-to-SVG methods often lack editability and exhibit deficiencies in visual quality and diversity. In this paper, we propose a novel text-guided vector graphics synthesis method to address these limitations. To enhance the editability of output SVGs, we introduce a Hierarchical Image VEctorization (HIVE) framework that operates at the semantic object level and supervises the optimization of components within the vector object. This approach facilitates the decoupling of vector graphics into distinct objects and component levels. Our proposed HIVE algorithm, informed by image segmentation priors, not only ensures a more precise representation of vector graphics but also enables fine-grained editing capabilities within vector objects. To improve the diversity of output SVGs, we present a Vectorized Particle-based Score Distillation (VPSD) approach. VPSD addresses over-saturation issues in existing methods and enhances sample diversity. A pre-trained reward model is incorporated to re-weight vector particles, improving aesthetic appeal and enabling faster convergence. Additionally, we design a novel adaptive vector primitives control strategy, which allows for the dynamic adjustment of the number of primitives, thereby enhancing the presentation of graphic details. Extensive experiments validate the effectiveness of the proposed method, demonstrating its superiority over baseline methods in terms of editability, visual quality, and diversity. We also show that our new method supports up to six distinct vector styles, capable of generating high-quality vector assets suitable for stylized vector design and poster design.


Methodology


🚀🚀 SVGDreamer upgrade to → SVGDreamer++
1. SIVE upgrade to → HIVE + Adaptive Vector Primitives Control
2. VPSD upgrade to → VPSD + Adaptive Vector Primitives Control

method
Overview of SVGDreamer++.
Our method consists of two phases: Hierarchical image vectorization (Sec. 4.1) and optimized synthesis of diverse SVGs via VPSD~(Sec. 3.2). And an additional module, called Adaptive Vector Primitives Control (Sec. 4.2), can be plugged into HIVE and VPSD in a plug-and-play way. In HIVE we introduced two stages of mask generation (as shown in the dotted box). Coarse mask generation guided by prompt words and fine-grained mask generation guided by attention distribution are used to decouple the components of vector graphics. The result from HIVE can be used as input for further generation of VPSD. We maintain $k$ sets of SVG parameters in VPSD for obtaining diverse results. In addition, the brown dotted box represents adaptive vector primitive control technology, which dynamically builds vector paths based on gradient graphs to improve the quality of SVG synthesis.

Citation

@article{xing2024svgdreamer++,
  title={SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation},
  author={Xing, Ximing and Yu, Qian and Wang, Chuang and Zhou, Haitao and Zhang, Jing and Xu, Dong},
  journal={arXiv preprint arXiv:2411.17832},
  year={2024}
}


Acknowledgements

We thank Ximing Xing for providing us with the source code of the web page to help us build the project home page.