[Feature] Consistent Dropout #2399

N00bcak · 2024-08-15T15:30:11Z

Description

Introduces Consistent Dropout (Hausknecht & Wagener, 2022) to TorchRL.

Consists of the following changes:

ConsistentDropout PyTorch module
Companion TensorDictModule: ConsistentDropoutModule
Tests for ConsistentDropoutModule

Motivation and Context

Addresses a feature request.
Fleshes out a PR draft #1587.

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds core functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)
Example (update in the folder of examples)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

I have read the CONTRIBUTION guide (required)
My change requires a change to the documentation.
I have updated the tests accordingly (required for a bug fix or a new feature).
I have updated the documentation accordingly.

pytorch-bot · 2024-08-15T15:30:14Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2399

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 8 Unrelated Failures

As of commit 3231780 with merge base 6aa4b53 ():

NEW FAILURES - The following jobs have failed:

Habitat Tests on Linux / tests (3.9, 12.1) / linux-job (gh)
RuntimeError: Command docker exec -t 8f8dedc71dd868db7724a1bf63aa1a72fd40d11e79650e254e813a1efcee4aa3 /exec failed with exit code 139
Libs Tests on Linux / unittests-gym (3.9, 12.1) / linux-job (gh)
RuntimeError: Command docker exec -t 9e960e3181dff173511480976674b5f0c1eca2d4942947a2c6395cb4b63388ca /exec failed with exit code 1

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

Continuous Benchmark (PR) / CPU Pytest benchmark (gh) (detected as infra flaky with no log or failing log classifier)
Continuous Benchmark (PR) / GPU Pytest benchmark (gh) (detected as infra flaky with no log or failing log classifier)
Unit-tests on Windows / unittests-cpu / windows-job (gh) (detected as infra flaky with no log or failing log classifier)

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vmoens · 2024-08-30T17:52:54Z

I need to make a second pass over this but here's what I got running:

from torchrl.modules import ConsistentDropoutModule, get_primers_from_module

from tensordict.nn import TensorDictModule as Mod, TensorDictSequential as Seq
import torch
from torchrl.envs import GymEnv
from tensordict import TensorDict

m = Seq(
    Mod(torch.nn.Linear(3, 4), in_keys=["observation"], out_keys=["intermediate"]),
    ConsistentDropoutModule(p=0.5, input_shape=(4,), in_keys="intermediate"),
    Mod(torch.nn.Linear(4, 1), in_keys=["intermediate"], out_keys=["action"]),
)
primer = get_primers_from_module(m)
env = GymEnv("Pendulum-v1").append_transform(primer)
r = env.reset()
env.rollout(32, m)["next", "mask_13266468624"]

With this you generate your mask during reset, such that it stays the same during execution.

This works, but as you can see we need to change the size of the primer, which means that we won't be able to use the same primer with different envs:

from torchrl.modules import ConsistentDropoutModule, get_primers_from_module

from tensordict.nn import TensorDictModule as Mod, TensorDictSequential as Seq
import torch
from torchrl.envs import GymEnv, SerialEnv, StepCounter
from tensordict import TensorDict

m = Seq(
    Mod(torch.nn.Linear(3, 4), in_keys=["observation"], out_keys=["intermediate"]),
    ConsistentDropoutModule(p=0.5, input_shape=(2, 4,), in_keys="intermediate"),
    Mod(torch.nn.Linear(4, 1), in_keys=["intermediate"], out_keys=["action"]),
)
primer = get_primers_from_module(m)
env0 = GymEnv("Pendulum-v1").append_transform(StepCounter(5))
env1 = GymEnv("Pendulum-v1").append_transform(StepCounter(6))
env = SerialEnv(2, [lambda env=env0: env, lambda env=env1: env])
env = env.append_transform(primer)
r = env.reset()

There should be a way to solve this though!

…tent-dropout # Conflicts: # test/test_exploration.py

N00bcak added 2 commits August 15, 2024 18:26

draft implementation of ConsistentDropout/ConsistentDropoutModule

7eb2684

draft changes to documentation

5bc41e4

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 15, 2024

vmoens changed the title ~~[New Feature] Consistent Dropout~~ [Feature] Consistent Dropout Aug 30, 2024

vmoens added the enhancement New feature or request label Aug 30, 2024

make primers - edit doc

c1847be

vmoens force-pushed the consistent-dropout branch from da6c42e to c1847be Compare August 30, 2024 17:51

vmoens and others added 9 commits September 2, 2024 12:13

minor edits

4605595

draft implementation of ConsistentDropout/ConsistentDropoutModule

c73cca1

draft changes to documentation

bdd81c9

make primers - edit doc

9045237

minor edits

6ea96ea

amend

422f685

Merge remote-tracking branch 'N00bcak/consistent-dropout' into consis…

e6c54c6

…tent-dropout # Conflicts: # test/test_exploration.py

Merge remote-tracking branch 'origin/main' into consistent-dropout

be095c4

amend

3231780

vmoens merged commit 0ad8e59 into pytorch:main Sep 10, 2024
63 of 69 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Consistent Dropout #2399

[Feature] Consistent Dropout #2399

N00bcak commented Aug 15, 2024

pytorch-bot bot commented Aug 15, 2024 •

edited

Loading

vmoens commented Aug 30, 2024 •

edited

Loading

[Feature] Consistent Dropout #2399

[Feature] Consistent Dropout #2399

Conversation

N00bcak commented Aug 15, 2024

Description

Motivation and Context

Types of changes

Checklist

pytorch-bot bot commented Aug 15, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2399

❌ 2 New Failures, 8 Unrelated Failures

vmoens commented Aug 30, 2024 • edited Loading

pytorch-bot bot commented Aug 15, 2024 •

edited

Loading

vmoens commented Aug 30, 2024 •

edited

Loading