Skip to content

Implementation of Sparse Graphical Memory (SGM) and Search on the Replay Buffer (SoRB) in PyTorch

License

Notifications You must be signed in to change notification settings

etaoxing/sgm-sorb

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Sparse Graphical Memory (SGM) and Search on the Replay Buffer (SoRB) in PyTorch

Example usage

pip install -e .

python run_PointEnv.py configs/config_PointEnv.py

Results

SoRB (re-planning with closest waypoint) trajectory visualization

Search comparison

policy: no search
start: [0.03271197 0.99020872]
goal: [0.81310241 0.028764  ]
steps: 300
----------
policy: search
start: [0.03271197 0.99020872]
goal: [0.81310241 0.028764  ]
steps: 127

SoRB (open loop planning) trajectory visualization

Search comparison

policy: no search
start: [0.03271197 0.99020872]
goal: [0.81310241 0.028764  ]
steps: 300
----------
policy: search
start: [0.03271197 0.99020872]
goal: [0.81310241 0.028764  ]
steps: 111

State graph visualization

  1. SoRB state graph (per critic in ensemble) SoRB state graph

  2. SGM state graph (ensembled)

SGM state graph

Initial SparseSearchPolicy (|V|=202, |E|=1894) has success rate 0.20, evaluated in 14.26 seconds
Filtered SparseSearchPolicy (|V|=202, |E|=986) has success rate 0.80, evaluated in 8.44 seconds
Took 10000 cleanup steps in 84.45 seconds
Cleaned SparseSearchPolicy (|V|=202, |E|=955) has success rate 1.00, evaluated in 6.69 seconds

Credits

References

[1]: Michael Laskin, Scott Emmons, Ajay Jain, Thanard Kurutach, Pieter Abbeel, Deepak Pathak, "Sparse Graphical Memory for Robust Planning", 2020.

[2]: Benjamin Eysenbach, Ruslan Salakhutdinov, Sergey Levine, "Search on the Replay Buffer: Bridging Planning and Reinforcement Learning", 2019.

Releases

No releases published

Packages

No packages published

Languages