无情感词典:一种用于商业和治理文档领域特定文本分析的新方法

Sentiment-devoid lexicons: A novel method for domain-specific textual analysis in business and governance documents

INFORMATION & MANAGEMENT · 2024
被引 2
人大 A-ABS 3

中文导读

提出并测试了一种从无情感文档中构建领域特定词典的方法,用于分析SEC调查相关文档,该词典在预测IT控制弱点、IT审计费用和网络风险方面优于五个基准词典。

Abstract

Our study proposes and tests a method for developing domain-specific dictionaries tailored for textual analysis in information systems research. Traditionally, dictionaries have been widely used for content classification according to sentiment; however, we introduce an alternative approach focused on creating dictionaries from sentiment-devoid documents. We demonstrate this method by developing a dictionary specific to Securities and Exchange Commission (SEC) investigations. Analyzing 150,432 publicly available SEC documents, we gained insights into the semantics of communications between the SEC and firms. To evaluate the dictionary, we analyzed SEC comment letters to predict the likelihood of firms reporting information technology control weaknesses (ITCWs), information technology audit fees, and cyber risks . Our dictionary outperformed five benchmarking dictionaries, explaining a higher proportion of variance in ITCW likelihood, information technology audit fees , and cyber risks. This study enhances the effectiveness of dictionaries in analyzing sentiment-devoid business and governance documents and results in a specialized dictionary for SEC communications.

公司治理文本分析自然语言处理证券交易委员会