Major Merge of Linfeng's project on "Seeing is Believing" by lf-zhao · Pull Request #303 · rai-opensource/predicators

lf-zhao · 2025-03-30T04:50:59Z

Change logs

This PR includes the major work for RSS 2025 submission "Seeing is Believing: Belief-Space Planning with Foundation Models as Uncertainty Estimators".

Add belief-space VLM-based predicates and operators
Add new VLM-based belief state update perceiver for Spot
Add synthetic/mock Spot robot environment (real image based synthetic environment, "RISE")
Add lots of visualization and tests for the synthetic tasks
Include a few real-image synthetic tasks for experiments
Update VLM/LLM-based planners and lots of interface
Include running scripts for experiments
Many other changes

Note for BKLVA Approach

BKLVA (Belief-space Planning with K-fluents and Language-based Goal Grounding) is an approach for integrated perception and belief-space planning using large vision-language models (VLMs) as state estimation modules. It extends task planning and perception pipeline to a symbolic belief space using belief-space predicates and operators.

Key Features

VLM Perception: Uses vision-language models to evaluate visual predicates from images
Task Planner: PDDL-based symbolic planner that generates plans in belief space
Belief Space Planning: Support for tasks involving uncertainty about object properties
Execution Monitoring: Detects unexpected outcomes and triggers replanning when needed

Quick Start

Running Experiments

Directly run:

# Example on two-cup pick-place synthetic task
python predicators/main.py --env mock_spot_pick_place_two_cup \
  --approach oracle --seed 0 --perceiver mock_spot_perceiver \
  --mock_env_vlm_eval_predicate True --num_train_tasks 0 \
  --num_test_tasks 1 --bilevel_plan_without_sim True \
  --horizon 20

Using mock_experiments.py:

# Run all planners on a single environment
python scripts/mock_experiments.py --env mock_spot_drawer_cleaning

# Run specific planner
python scripts/mock_experiments.py --env mock_spot_drawer_cleaning --planner vlm_closed_loop

4.. Using run_local_experiments.sh (recommended for systematic evaluation):

# Run experiments with multiple seeds
./predicators_deploy/run_local_experiments.sh 5 mock_spot_drawer_cleaning mock_spot_sort_weight

Available Planners

BKLVA (Oracle): Basic bilevel planning with VLM perception
BKLVA with Execution Monitoring: Oracle with replanning on unexpected outcomes
BKLVA Open Loop: Oracle without replanning
LLM Closed Loop: LLM-based planning with execution monitoring
VLM Closed Loop: VLM-based planning with execution monitoring
VLM Captioning: Scene captioning for state estimation
VLM Captioning Open Loop: Scene captioning without replanning

Available Environments

Pick and Place Tasks:
- mock_spot_pick_place: Simple pick and place
- mock_spot_pick_place_two_cup: Two-cup manipulation
Belief-Space Tasks:
- mock_spot_cup_emptiness: Cup content detection
- mock_spot_drawer_cleaning: Drawer manipulation
- mock_spot_sort_weight: Weight-based sorting

Environment Setup

export PYTHONPATH="${PYTHONPATH:-$PWD}"
export PYTHONUNBUFFERED=1
export PYTHONHASHSEED=0

Results Organization

results_deploy/<timestamp>_<env>_<planner>/: Experiment results
runlogs/run_<env>_planner_<planner>_seed_<N>.txt: Detailed logs

Documentation

For detailed documentation, see:

…te saving/loading logic+format, procesing code, loading and saving each state, cleaning up, and more

…loading, update loading and saving logic for env + individual states, cleanup

… updates, include processing unknown/known predicates without information loss

…te has no information loss, clean up

…loading it

…ding/exploration and planning, save ground atoms correctly in state,

… saved env for predicate labels

…from drawer

lf-zhao added 30 commits January 21, 2025 00:37

add unposed image dataclass

8ded816

add UnposedImageWithContext

ca205e9

major update, now collector can complete collection and running: upda…

51dee2a

…te saving/loading logic+format, procesing code, loading and saving each state, cleaning up, and more

major update to mock env: update Mock Obs classes for saving and for …

aa8a1c6

…loading, update loading and saving logic for env + individual states, cleanup

move mock obs into separate file to avoid cyclic import

8c8382c

the manual creator class is basically dummy now

d695b8b

minor

7a1683d

new docs: guide of testing planning using mock robot env

3812630

update to perceiver

cc4e0c2

test on loading

915bf81

major update to test manual collected env: fully working

f5981f6

update another test - this may be outdated

4c910cf

update docs on perceiver

c845572

[in progress] major update to mock perceiver

6a169f3

major update to mock spot perceiver: correctly handle predicate value…

40bf20c

… updates, include processing unknown/known predicates without information loss

minor

b683ec6

minor

a5424af

major update: add consistency check for unknown/known, make sure upda…

ea93041

…te has no information loss, clean up

working on updating the ground atoms saving logic

43fbd44

update

e4af8e4

add new field for saving atom and new GroundTruthPredicate class for …

aa55d28

…loading it

major update: clean up state storage logic, separate state graph buil…

a5980c8

…ding/exploration and planning, save ground atoms correctly in state,

update the test for saving env with images

b0a396a

update previous test

2f57972

update test

fee4c9f

update; not fully working with latest ver

6fec032

update test for new init

d82f218

new utils for computing ground atom combinations without state

e6e3975

major update to env: use GroundTruePredicate that directly loads from…

facabb9

… saved env for predicate labels

fixng saving and loading

125a1c4

lf-zhao added 22 commits March 24, 2025 22:24

minor cleaning of the mock env file

ae331b7

update experiment run script

f60ac2f

major, new script for running experiments locally in bash

d8abbf3

major, collecting results for running locally in bash/python

77f944c

add readme and another viz script

bcb0368

script for kill local runs

8dd22ef

update

e4cf2c9

update

24f1f0f

update for saving a name for result collection

247f298

fix cup emptiness one

958b80c

update experiment

2a9b1d9

fix OR goals in belief case

f8be4a2

similar fix for VLM planner - handling OR goals properly

d0c9a23

add new task: retrieving two green cups (one larger and one smaller) …

938dc30

…from drawer

add new 2-cup picking task env; minor fixes

7717795

add GT factory

ab341a8

fixes for OR goal atoms; other update

1e1c473

remove

55d6cf7

add new upgraded scripts for ssh (not fully tested)

0dca556

add script for running local experiments

4b3d1ee

add scripts for running ssh remote experiments

80a7995

update commands for BKLVA approach

7e19337

lf-zhao requested a review from NishanthJKumar March 30, 2025 04:50

lf-zhao assigned lf-zhao and linfeng-z Mar 30, 2025

lf-zhao added 5 commits April 10, 2025 15:43

minor update

a79fc09

add a ray version of sam2

b5c0f10

add server side for gemini pointing + sam2

ad43d77

add utils for gemini pointing

9c2db03

add client side for gemini pointing + sam2

a40f8ce

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Major Merge of Linfeng's project on "Seeing is Believing"#303

Major Merge of Linfeng's project on "Seeing is Believing"#303
lf-zhao wants to merge 499 commits intomasterfrom
lis-spot/merging-linfeng-submission-latest

lf-zhao commented Mar 30, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

lf-zhao commented Mar 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Change logs

Note for BKLVA Approach

Key Features

Quick Start

Running Experiments

Available Planners

Available Environments

Environment Setup

Results Organization

Documentation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lf-zhao commented Mar 30, 2025 •

edited

Loading