经济学中的文本算法

Text Algorithms in Economics

Annual Review of Economics · 2023
被引 167 · 同刊同年前 7%
人大 A-ABS 3

中文导读

综述了经济学中用于算法文本分析的方法,介绍了文档、词语和序列的向量表示,定义了四项核心实证任务,并指出了当前文献的局限性,尤其关注算法输出的验证挑战。

Abstract

This article provides an overview of the methods used for algorithmic text analysis in economics, with a focus on three key contributions. First, we introduce methods for representing documents as high-dimensional count vectors over vocabulary terms, for representing words as vectors, and for representing word sequences as embedding vectors. Second, we define four core empirical tasks that encompass most text-as-data research in economics and enumerate the various approaches that have been taken so far to accomplish these tasks. Finally, we flag limitations in the current literature, with a focus on the challenge of validating algorithmic output.

文本算法文本表示实证任务