question regarding to the settings and evaluating the training

Hi Novatig:
thanks for replying. the updated readme is helpful. Simultaneously I have read your paper of glider, seems the controling results are sensitive to the training methods and the setting. now I implement the a boat navigation case based on the boatNav.  everything works fine with runTest for me. then I use RACE to train, the case is to control  the boat to turning some angle and move to another location without limitation the final nose angle and termination speed in the reward. but I get quite little success sample during training. most of the time the boat got lost, does that influence the training? would you please make some suggestion? RACE? DQN? PPO? or change some setting?
furthermore,  after training, I want to evaluate solution by restart, I use the command  "boatNav --restart /public3/home/sc51719/soft/smarties/apps/boadNav/evalue RACE.json " but I got many trajectories. which i suppose to be one trajectory with successful reach the point i specified, since I suppose restart is going to load the trained network to evaluate the performance of one task. but there are even results of  the boat got lost. what is meaning of that?  sometimes there is event not any reach the harbor
can you provide me some suggestion?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

question regarding to the settings and evaluating the training #4

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

question regarding to the settings and evaluating the training #4

Description

Activity

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions