local subgradient = S*torch.sign(weight) #4

MrLinNing · 2018-05-03T02:38:48Z

L1 sparsity should be torch.abs(weight), can you detail more about it?
local subgradient = S*torch.sign(weight)

The text was updated successfully, but these errors were encountered:

liuzhuang13 · 2018-05-03T20:49:40Z

The (sub)gradient of absolute value function (L1 sparsity loss) is the sign function. Here we compute the subgradient directly without defining loss.

MrLinNing · 2018-05-04T02:40:34Z

thank you! @liuzhuang13
Why you used subgradient? did you try directly defining loss ?

liuzhuang13 · 2018-05-04T04:00:21Z

Because absolute value function is not differentiable at point x=0, so it is subgradient instead of gradient. But in practice, the weight x never becomes 0 so it is actually equivalent to gradient.

Unlike Pytorch, in Torch there is no automatic differentiation, so I found this to be the most convenient way to do the thing we wanted, and we just used it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

local subgradient = S*torch.sign(weight) #4

local subgradient = S*torch.sign(weight) #4

MrLinNing commented May 3, 2018

liuzhuang13 commented May 3, 2018

MrLinNing commented May 4, 2018

liuzhuang13 commented May 4, 2018

local subgradient = S*torch.sign(weight) #4

local subgradient = S*torch.sign(weight) #4

Comments

MrLinNing commented May 3, 2018

liuzhuang13 commented May 3, 2018

MrLinNing commented May 4, 2018

liuzhuang13 commented May 4, 2018