Supervised Machine Learning for Text Analysis in R
这本书详细介绍了如何将文本数据纳入监督学习的工作流程,涵盖自然语言特征、机器学习和深度学习,并强调处理文本时的关键步骤和潜在陷阱。
The authors divide the book into natural language features, machine learning with text and deep learning with text. Overall, this book provides an excellent and practical guide for incorporating textual data into the workflow of supervised learning problems. From a technical perspective, it is rigorous in its detailing of key steps in the processing and preparation of text, including potential pitfalls and biases that can be introduced into analyses by the unwary.