Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Passing current trajectory step to select_action function #174

Open
Federico-PizarroBejarano opened this issue Nov 5, 2024 · 0 comments
Assignees
Labels
invalid This doesn't seem right

Comments

@Federico-PizarroBejarano
Copy link
Collaborator

In some controllers we are getting the current step of the trajectory from the environment, via the info, such as

step = self.extract_step(info)
In others, we keep track of the step in the controller itself via self.traj_step, such as We should choose one and standardize it.

The benefit of the info approach is that the iteration is coming from the environment, and thus the controller is stationary, i.e., if you call a select_action function twice with the same arguments you will get the same action. The drawback is that in real experiments on real hardware, there is no info dictionary since the environment isn't giving you more information. One way around this is creating an info dictionary in the real experiment script and passing that in with the relevant data.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
invalid This doesn't seem right
Projects
None yet
Development

No branches or pull requests

3 participants