Fine-tuning Deep RL with Gradient-Free Optimization