robopal.wrappers.human_demo_wrapper module¶

class robopal.wrappers.human_demo_wrapper.Collection(num_samples: int = 0, model_file: str = None, states: List[numpy.ndarray] = <factory>, actions: List[numpy.ndarray] = <factory>, rewards: List[numpy.ndarray] = <factory>, dones: List[numpy.ndarray] = <factory>, obs: Dict[str, List[numpy.ndarray]] = <factory>, next_obs: Dict[str, List[numpy.ndarray]] = <factory>)[源代码]¶

基类：object

actions: List[ndarray]¶

dones: List[ndarray]¶

model_file: str = None¶

next_obs: Dict[str, List[ndarray]]¶

num_samples: int = 0¶

obs: Dict[str, List[ndarray]]¶

rewards: List[ndarray]¶

states: List[ndarray]¶

class robopal.wrappers.human_demo_wrapper.HumanDemonstrationWrapper(env: ManipulateEnv, *, device: BaseDevice = None, collections_dir: str = '/home/docs/checkouts/readthedocs.org/user_builds/robopal/checkouts/latest/docs/source/../../robopal/collections/collections_1748360434_2707884', collect_freq=1, max_collect_horizon=100, is_drop_unsuccess_exp=True, is_drop_invalid_action=True, saved_action_type='velocity', is_render_actions=False)[源代码]¶

基类：object

close()[源代码]¶

get_action()[源代码]¶: compute next actions from the device outputs

reset(seed=None, options=None)[源代码]¶

save_collection()[源代码]¶: Save the collected data in the hdf5 file.

step(action: ndarray | Dict[str, ndarray])[源代码]¶: Input actions should be normalized, since the action will un-normalize before applying to the environment.