🌙

一个模式统治所有:Schema.org如何塑造搜索世界

One schema to rule them all: How Schema.org models the world of search

Journal of the Association for Information Science and Technology (JASIST) · 2023
被引 23
ABS 3

中文导读

研究了Schema.org这一跨领域结构化数据模型的元数据词汇,通过分析其发布历史和层级结构,揭示其主题领域和知识表示方式,并讨论其在事实核查和COVID-19中的全球意义。

Abstract

Abstract Several industry‐specific metadata initiatives have historically facilitated structured data modeling for the web in domains such as commerce, publishing, social media, and so forth. The metadata vocabularies produced by these initiatives allow developers to “wrap” information on the web to provide machine‐readable signals for search engines, advertisers, and user‐facing content on apps and websites, thus assisting with surfacing facts about people, places, and products. A universal iteration of such a project called Schema.org started in 2011, resulting from a partnership between Google, Microsoft, Yahoo, and Yandex to collaborate on a single structured data model across domains. Yet, few studies have explored the metadata vocabulary terms in this significant web resource. What terms are included, upon what subject domains do they focus, and how does Schema.org represent knowledge in its conceptual model? This article presents findings from our extraction and analysis of the documented release history and complete hierarchy on Schema.org 's developer pages. We provide a semantic network visualization of Schema.org , including an analysis of its modularity and domains, and discuss its global significance concerning fact‐checking and COVID‐19. We end by theorizing Schema.org as a gatekeeper of data on the web that authors vocabulary that everyday web users encounter in their searches.

计算机科学万维网元数据信息检索数据科学