Sigmoid with residue #869

opfromthestart · 2023-09-21T20:36:00Z

I don't know if this is something that should be in dfdx but it is useful for my use case where I need my model to return probabilities between 0 and 1, where each input is independent of all the others. If I use normal sigmoid I get vanishing gradients, so I made this.

PReLU forward and backward now working properly

Actually runs now

Changed some defaults to match Pytorch

Now gradients wont be set to 0 for large inputs

Now Sigmoidr layer can be used

opfromthestart · 2023-09-22T02:08:51Z

right now it returns a minimum gradient of 0.0001, this maybe should be configurable.

coreylowman · 2023-10-25T16:55:21Z

Since this can be accomplished by doing (x.negate().exp() + 1.0).recip() I'm inclined to not merge this. I know that this specialized kernel will be more efficient, but I think this is niche enough where I don't want to add to core.

Thanks for the PR!

opfromthestart · 2023-10-27T20:48:30Z

This is not entirely right. While the forward pass is the same, the backwards pass is different. It makes sure that the gradients through the function are always at least some small epsilon. As in, if x=100000000, sigmoid(x)=1, which should make sigmoid_dx(x)=0 but instead it is 0.00001. I should definitely refactor the forward pass to do that, however.

opfromthestart added 30 commits March 19, 2023 00:55

PReLU forward works-ish

f1ce9b5

Doctest works now

068183c

Merge branch 'coreylowman:main' into main

ef62baa

Better-ish implementation

f2fe0c4

PReLU forward and backward now working properly

Merge branch 'main' of github.com:opfromthestart/dfdx into main

94f7a20

Now has actual layers to use

f3ef318

fmt, clippy, tests

b132fcb

Cleaning

a42e0d2

Remove one unneeded generic

c248c7e

Cuda maybe working (idk)

4caa8d8

Actually runs now

LeakyReLU now should work

d9c1346

Changed some defaults to match Pytorch

Fix test

641f429

Conforms to standards

feeedff

Some requested changes

5378860

Formatting

530c5ff

Clippy

0604b57

another fmt

252ae49

No prelu kernel

fbdb13b

Fmt

1a8fc60

Explicit zero

56dca8b

fmt

a885559

Merge branch 'coreylowman:main' into main

066bcea

Actually exports needed things

f136a82

Fixes

c7caf43

fmt

f4660eb

Fix nighly reature

b51a12a

Separate into own file

6487470

Better error message

b4faf1c

Merge branch 'main' into main

04b215d

Actual better merge

8fde4e4

opfromthestart and others added 14 commits April 3, 2023 19:40

fmt

c73115b

Merge branch 'main' into main

2b51b6e

Merge branch 'main' of github.com:opfromthestart/dfdx into main

4a4354a

Merge branch 'coreylowman:main' into main

7d58f15

Reshape

675a380

Merge branch 'main' of github.com:opfromthestart/dfdx into main

198c355

Remove unnecessary nightly

171fe3b

Single tuple impls

b74f3e3

Merge branch 'main' of github.com:opfromthestart/dfdx into main

5edff73

Better impl

601b0f5

fmt

886ac4f

Merge branch 'main' of github.com:opfromthestart/dfdx into main

d38df63

Added sigmoid with residue

2c5e602

Now gradients wont be set to 0 for large inputs

Added actual layer

b5dccc0

Now Sigmoidr layer can be used

Make gradient on edges smaller

30bd707

coreylowman closed this Oct 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sigmoid with residue #869

Sigmoid with residue #869

opfromthestart commented Sep 21, 2023

opfromthestart commented Sep 22, 2023

coreylowman commented Oct 25, 2023

opfromthestart commented Oct 27, 2023 •

edited

Loading

Sigmoid with residue #869

Sigmoid with residue #869

Conversation

opfromthestart commented Sep 21, 2023

opfromthestart commented Sep 22, 2023

coreylowman commented Oct 25, 2023

opfromthestart commented Oct 27, 2023 • edited Loading

opfromthestart commented Oct 27, 2023 •

edited

Loading