Skip to content

question regarding to the settings and evaluating the training #4

Open
@waynezw0618

Description

Hi Novatig:
thanks for replying. the updated readme is helpful. Simultaneously I have read your paper of glider, seems the controling results are sensitive to the training methods and the setting. now I implement the a boat navigation case based on the boatNav. everything works fine with runTest for me. then I use RACE to train, the case is to control the boat to turning some angle and move to another location without limitation the final nose angle and termination speed in the reward. but I get quite little success sample during training. most of the time the boat got lost, does that influence the training? would you please make some suggestion? RACE? DQN? PPO? or change some setting?
furthermore, after training, I want to evaluate solution by restart, I use the command "boatNav --restart /public3/home/sc51719/soft/smarties/apps/boadNav/evalue RACE.json " but I got many trajectories. which i suppose to be one trajectory with successful reach the point i specified, since I suppose restart is going to load the trained network to evaluate the performance of one task. but there are even results of the boat got lost. what is meaning of that? sometimes there is event not any reach the harbor
can you provide me some suggestion?

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions