A Reinforcement Learning-Based Framework to Generate Routing Solutions and Correct Violations in VLSI Physical Design