Once the trained policy has been trained, it can be deployed in all the supported as well as custom environments.
Setting the task as GRID-Isaac-CustomRL-v0 and specifying the environment in the env.yaml enables users to use the trained policy in diverse environments.
A sample agent.yaml file for inference is shown below: