迁移学习与会计披露的文本分析：将大数据方法应用于小数据集

Transfer Learning and Textual Analysis of Accounting Disclosures: Applying Big Data Methods to Small(er) Datasets

Accounting Horizons · 2021

被引 47 · 同刊同年前 5%

人大 BABS 3

Federico Siano · 波士顿大学
Peter D. Wysocki · 波士顿大学

中文导读

介绍并应用机器迁移学习方法分析会计披露，以BERT语言模型和季度盈余披露的情感分析为例，展示预训练、微调和上下文捕捉等概念，证明该方法易于实施、成本低且性能优于现有文本分析工具。

Abstract

SYNOPSIS We introduce and apply machine transfer learning methods to analyze accounting disclosures. We use the examples of the new BERT language model and sentiment analysis of quarterly earnings disclosures to demonstrate the key transfer learning concepts of: (1) pre-training on generic “Big Data,” (2) fine-tuning on small accounting datasets, and (3) using a language model that captures context rather than stand-alone words. Overall, we show that this new approach is easy to implement, uses widely available and low-cost computing resources, and has superior performance relative to existing textual analysis tools in accounting. We conclude with suggestions for opportunities to apply transfer learning to address important accounting research questions. Data Availability: Data are available from the public sources cited in the text. JEL Classifications: G31; G32; M21; M41.

会计学文本分析迁移学习自然语言处理大数据

阅读原文 ↗