期刊
ACM TRANSACTIONS ON GRAPHICS
卷 42, 期 4, 页码 -出版社
ASSOC COMPUTING MACHINERY
DOI: 10.1145/3592391
关键词
material appearance; perception; descriptions
This paper introduces a novel dataset called text2fabric, which links free-text descriptions to various fabric materials. The dataset consists of 15,000 natural language descriptions associated with 3,000 corresponding images of fabric materials. By analyzing the dataset, the authors identify a compact lexicon, set of attributes, and key structure that accurately describe fabrics and provide directions for generalization to other materials.
We introduce text2fabric, a novel dataset that links free-text descriptions to various fabric materials. The dataset comprises 15,000 natural language descriptions associated to 3,000 corresponding images of fabric materials. Traditionally, material descriptions come in the form of tags/keywords, which limits their expressivity, induces pre-existing knowledge of the appropriate vocabulary, and ultimately leads to a chopped description system. Therefore, we study the use of free-text as a more appropriate way to describe material appearance, taking the use case of fabrics as a common item that non-experts may often deal with. Based on the analysis of the dataset, we identify a compact lexicon, set of attributes and key structure that emerge from the descriptions. This allows us to accurately understand how people describe fabrics and draw directions for generalization to other types of materials. We also show that our dataset enables specializing large vision-language models such as CLIP, creating a meaningful latent space for fabric appearance, and significantly improving applications such as fine-grained material retrieval and automatic captioning.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据