Skip to content

Commit

Permalink
readme
Browse files Browse the repository at this point in the history
  • Loading branch information
yrlu committed May 28, 2017
1 parent e623499 commit e2f44a1
Show file tree
Hide file tree
Showing 9 changed files with 20 additions and 1 deletion.
21 changes: 20 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,7 @@ Implementations of selected inverse reinforcement learning / imitation learning
#### Algorithms implemented

- [linear inverse reinforcement learning (Ng & Russell 2000)](#linear-inverse-reinforcement-learning)
- [maximum entropy inverse reinforcement learning (Ziebart et al. 2008)](#maximum-entropy-inverse-reinforcement-learning)

## Linear inverse reinforcement learning

Expand All @@ -13,4 +14,22 @@ Implementations of selected inverse reinforcement learning / imitation learning
$ python linear_irl_gridworld.py --act_random=0.3 --gamma=0.5 --l1=10 --r_max=10
```

<img src="imgs/rmap_gt.jpg" width="200"> <img src="imgs/vmap_gt.jpg" width="200"> <img src="imgs/rmap_lirl.jpg" width="200"> <img src="imgs/rmap_lirl_3d.jpg" width="200">
<img src="imgs/rmap_gt.jpg" width="200"> <img src="imgs/vmap_gt.jpg" width="200"> <img src="imgs/rmap_lirl.jpg" width="200"> <img src="imgs/rmap_lirl_3d.jpg" width="200">

## Maximum entropy inverse reinforcement learning

- Following Ziebart et al. 2008 paper: [Maximum Entropy Inverse Reinforcement Learning](https://www.aaai.org/Papers/AAAI/2008/AAAI08-227.pdf)
- `$ python maxent_gridworld.py --help` for options descriptions

<!-- ```
$ python maxent_gridworld.py --gamma=0.8 --n_trajs=100 --l_traj=20 --no-rand_start --learning_rate=0.01 --n_iters=20
```
<img src="imgs/rmap_gt_maxent.jpg" width="200"> <img src="imgs/vmap_gt_maxent.jpg" width="200"> <img src="imgs/rmap_maxent.jpg" width="200"> <img src="imgs/rmap_maxent_3d.jpg" width="200">
-->

```
$ python maxent_gridworld.py --height=10 --width=10 --gamma=0.8 --n_trajs=100 --l_traj=50 --no-rand_start --learning_rate=0.01 --n_iters=20
```

<img src="imgs/rmap_gt_maxent_10.jpg" width="200"> <img src="imgs/vmap_gt_maxent_10.jpg" width="200"> <img src="imgs/rmap_maxent_10.jpg" width="200"> <img src="imgs/rmap_maxent_3d_10.jpg" width="200">
Binary file added imgs/rmap_gt_maxent.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added imgs/rmap_gt_maxent_10.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added imgs/rmap_maxent.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added imgs/rmap_maxent_10.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added imgs/rmap_maxent_3d.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added imgs/rmap_maxent_3d_10.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added imgs/vmap_gt_maxent.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added imgs/vmap_gt_maxent_10.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit e2f44a1

Please sign in to comment.