About the first-order approximation #6

tht106 · 2020-12-31T03:05:42Z

Hi, thank you for this fascinating work and providing a demo of MLDG.

Two quick questions:

Did you use the first-order approximation in the MLP version of MLDG. The codes in ops.py look like an operation of the first-order approximation.

`

    if not stop_gradient:
        grad_weight = autograd.grad(meta_loss, weight, create_graph=True)[0]

        if bias is not None:
            grad_bias = autograd.grad(meta_loss, bias, create_graph=True)[0]
            bias_adapt = bias - grad_bias * meta_step_size
        else:
            bias_adapt = bias

    else:
        grad_weight = Variable(autograd.grad(meta_loss, weight, create_graph=True)[0].data, requires_grad=False)

        if bias is not None:
            grad_bias = Variable(autograd.grad(meta_loss, bias, create_graph=True)[0].data, requires_grad=False)
            bias_adapt = bias - grad_bias * meta_step_size
        else:
            bias_adapt = bias

    return F.linear(inputs,
                    weight - grad_weight * meta_step_size,
                    bias_adapt)
else:
    return F.linear(inputs, weight, bias)`

I am also wondering the meaning of the parameter "--stop_gradient". What would happen when we set it ture?

The text was updated successfully, but these errors were encountered:

CinKKKyo · 2023-05-12T00:45:58Z

Hi, thank you for this fascinating work and providing a demo of MLDG.

Two quick questions:

Did you use the first-order approximation in the MLP version of MLDG. The codes in ops.py look like an operation of the first-order approximation.

`

    if not stop_gradient:
        grad_weight = autograd.grad(meta_loss, weight, create_graph=True)[0]

        if bias is not None:
            grad_bias = autograd.grad(meta_loss, bias, create_graph=True)[0]
            bias_adapt = bias - grad_bias * meta_step_size
        else:
            bias_adapt = bias

    else:
        grad_weight = Variable(autograd.grad(meta_loss, weight, create_graph=True)[0].data, requires_grad=False)

        if bias is not None:
            grad_bias = Variable(autograd.grad(meta_loss, bias, create_graph=True)[0].data, requires_grad=False)
            bias_adapt = bias - grad_bias * meta_step_size
        else:
            bias_adapt = bias

    return F.linear(inputs,
                    weight - grad_weight * meta_step_size,
                    bias_adapt)
else:
    return F.linear(inputs, weight, bias)`

I am also wondering the meaning of the parameter "--stop_gradient". What would happen when we set it true?

The meaning of the parameter "--stop_gradient" also make me confused. Did you figure it out?

CharmsGraker · 2023-11-28T06:10:22Z

Hi, thank you for this fascinating work and providing a demo of MLDG.
Two quick questions:

Did you use the first-order approximation in the MLP version of MLDG. The codes in ops.py look like an operation of the first-order approximation.

`
    if not stop_gradient:
        grad_weight = autograd.grad(meta_loss, weight, create_graph=True)[0]

        if bias is not None:
            grad_bias = autograd.grad(meta_loss, bias, create_graph=True)[0]
            bias_adapt = bias - grad_bias * meta_step_size
        else:
            bias_adapt = bias

    else:
        grad_weight = Variable(autograd.grad(meta_loss, weight, create_graph=True)[0].data, requires_grad=False)

        if bias is not None:
            grad_bias = Variable(autograd.grad(meta_loss, bias, create_graph=True)[0].data, requires_grad=False)
            bias_adapt = bias - grad_bias * meta_step_size
        else:
            bias_adapt = bias

    return F.linear(inputs,
                    weight - grad_weight * meta_step_size,
                    bias_adapt)
else:
    return F.linear(inputs, weight, bias)`
I am also wondering the meaning of the parameter "--stop_gradient". What would happen when we set it true?
The meaning of the parameter "--stop_gradient" also make me confused. Did you figure it out?
setting stop_gradient=True is to avoid large budget when excuting meta-optimization.
In my opinion, if stop_gradient=True, the whole algorithm could be reckoned as training objections of F(theta), G(theta) alternatively.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the first-order approximation #6

About the first-order approximation #6

tht106 commented Dec 31, 2020

CinKKKyo commented May 12, 2023

CharmsGraker commented Nov 28, 2023

About the first-order approximation #6

About the first-order approximation #6

Comments

tht106 commented Dec 31, 2020

CinKKKyo commented May 12, 2023

CharmsGraker commented Nov 28, 2023