HieroSA (Chinese)

Paper | GitHub

We propose HieroSA (Hieroglyph Stroke Analyzer) 🏺, a framework for capturing stroke-level structural representations of hieroglyphic and logographic scripts. It automatically converts characters into normalized stroke-segment representations ✍️, without relying on handcrafted rules or script-specific priors.

HieroSA supports both modern logographic scripts and ancient hieroglyphs 🌍, enabling cross-lingual structural generalization. Experimental results demonstrate that it effectively captures character-level structure and semantics 🧩, providing a solid foundation for downstream analysis and understanding of hieroglyphic writing systems.

More Details

Please refer to our GitHub Repository for more details about this model, including environment setup and inference scripts.

Citation

If you find our work helpful for your research, please consider citing our work.

@article{luo2026hierosa,
    title={Enabling Stroke-Level Structural Analysis of Hieroglyphic Scripts without Language-Specific Priors}, 
    author={Fuwen Luo and Zihao Wan and Ziyue Wang and Yaluo Liu and Pau Tong Lin Xu and Xuanjia Qiao and Xiaolong Wang and Peng Li and Yang Liu},
    journal={arXiv preprint arXiv:2601.05508},
    year={2026}
}

Downloads last month: 14

Safetensors

Model size

5B params

Tensor type

BF16

Model tree for roufaen/HieroSA

Base model

Qwen/Qwen3-VL-4B-Instruct

Finetuned

(142)

this model

Paper for roufaen/HieroSA

Enabling Stroke-Level Structural Analysis of Hieroglyphic Scripts without Language-Specific Priors

Paper • 2601.05508 • Published 5 days ago