[1]杨 起,刘牧耕,马 郓.一种面向UI 手稿识别的数据集制作方法[J].郑州大学学报(工学版),2022,43(06):1-7.[doi:10.13705/j.issn.1671-6833.2022.06.009]
 YANG Q,LIU M G,MA Y.An Efficient Approach to Creating Hand-Drawn Dataset for UI Manuscript Recognition[J].Journal of Zhengzhou University (Engineering Science),2022,43(06):1-7.[doi:10.13705/j.issn.1671-6833.2022.06.009]
点击复制

一种面向UI 手稿识别的数据集制作方法()
分享到:

《郑州大学学报(工学版)》[ISSN:1671-6833/CN:41-1339/T]

卷:
43卷
期数:
2022年06期
页码:
1-7
栏目:
出版日期:
2022-09-02

文章信息/Info

Title:
An Efficient Approach to Creating Hand-Drawn Dataset for UI Manuscript Recognition
作者:
杨 起 刘牧耕 马 郓
北京大学深圳研究生院信息工程学院;北京大学计算机学院;北京大学人工智能研究院;

Author(s):
YANG QLIU M GMA Y
School of Information Engineering, Shenzhen Graduate School of Beijing University; School of Computer, Peking University; Institute of Artificial Intelligence Research at Peking University;

关键词:
Keywords:
DOI:
10.13705/j.issn.1671-6833.2022.06.009
文献标志码:
A
摘要:
UI 手稿识别是图像目标检测技术在软件工程领域的重要应用。 由于 UI 手稿图像与自然图像有着较大的差异,而且主要依靠人工绘制,所以制作用于深度学习模型训练的 UI 手稿数据集往往比较困难,耗费大量人力。 针对此问题,通过对 UI 手稿数据集的制作流程进行优化改进,提出了一种 UI 手稿数据集快速制作方法 UIsketcher。 在 UIsketcher 方法中,用户只需要完成一些基础 UI 组件的绘制,不需要任何框选标注,即可自动生成用于深度学习模型训练的数据集。 与传统方法进行对比实验,结果表明:用户只需要绘制相对于传统方法 25%的组件数量,即可得到相似的训练效果;若绘制传统方法 75%的组件数量,训练效果将更好,可达到比传统方法更高的准确率。
Abstract:
UI manuscript recognition is one of the important applications of image object detection in the area of software engineering. Due to the significant difference between UI manuscript images and natural images where UI manuscript images usually need to be drawn manually, it is difficult to build UI manuscript dataset for deep learning because of the dependency on tremendous manual efforts. To address the issue, in this study an approach called UIsketcher was proposed to efficiently generate UI manuscript dataset based on optimizing the current workflow. In UIsketcher, users should just draw some basic elements without labeling, and then the dataset could be automatically generated for training deep learning model. According to the experiment with UIsketcher, only 25% drawing workload of the traditional methods could get the similar training results. If the workload was 75%, the final accuracy was even better than that of traditional methods.
更新日期/Last Update: 2022-10-03