A data-driven condition-based preventive maintenance strategy for high gravity reactors in petrochemical industry: from the perspective of reinforcement learning