* Add consistent scores validation * revert osworld_run_maestro.py changes
* update for autoglm-v * Update run_autoglm.py --------- Co-authored-by: hanyullai <hanyullai@outlook.com>