data/ ├── objectgoal_hm3d/ │ ├── train/ │ ├── val/ │ └── val_mini/ ├── scene_datasets/ │ └── hm3d/ │ ├── minival ...
Abstract: Traditional user simulators often rely on manually designed agendas, resulting in generated responses lacking diversity and spontaneity. However, building user simulators with large language ...
Abstract: With the assistance of language descriptions, Visual-Language (VL) object tracking can obtain more accurate semantic information compared to traditional Visual-Only object tracking. However, ...