9赞

pytorch动态网络以及权重共享实例

作者：mobiledu2402851377 | 2021-11-03 03:29

今天小编就为大家分享一篇pytorch动态网络以及权重共享实例，具有很好的参考价值，希望对大家有所帮助。一起跟随小编过来看看吧

pytorch 动态网络+权值共享

pytorch以动态图著称，下面以一个栗子来实现动态网络和权值共享技术:

# -*- coding: utf-8 -*-
import random
import torch


class DynamicNet(torch.nn.Module):
  def __init__(self, D_in, H, D_out):
    """
    这里构造了几个向前传播过程中用到的线性函数
    """
    super(DynamicNet, self).__init__()
    self.input_linear = torch.nn.Linear(D_in, H)
    self.middle_linear = torch.nn.Linear(H, H)
    self.output_linear = torch.nn.Linear(H, D_out)

  def forward(self, x):
    """
    For the forward pass of the model, we randomly choose either 0, 1, 2, or 3
    and reuse the middle_linear Module that many times to compute hidden layer
    representations.

    Since each forward pass builds a dynamic computation graph, we can use normal
    Python control-flow operators like loops or conditional statements when
    defining the forward pass of the model.

    Here we also see that it is perfectly safe to reuse the same Module many
    times when defining a computational graph. This is a big improvement from Lua
    Torch, where each Module could be used only once.
    这里中间层每次向前过程中都是随机添加0-3层，而且中间层都是使用的同一个线性层，这样计算时，权值也是用的同一个。
    """
    h_relu = self.input_linear(x).clamp(min=0)
    for _ in range(random.randint(0, 3)):
      h_relu = self.middle_linear(h_relu).clamp(min=0)
    y_pred = self.output_linear(h_relu)
    return y_pred


    # N is batch size; D_in is input dimension;
    # H is hidden dimension; D_out is output dimension.
    N, D_in, H, D_out = 64, 1000, 100, 10

    # Create random Tensors to hold inputs and outputs
    x = torch.randn(N, D_in)
    y = torch.randn(N, D_out)

    # Construct our model by instantiating the class defined above
    model = DynamicNet(D_in, H, D_out)

    # Construct our loss function and an Optimizer. Training this strange model with
    # vanilla stochastic gradient descent is tough, so we use momentum
    criterion = torch.nn.MSELoss(reduction='sum')
    optimizer = torch.optim.SGD(model.parameters(), lr=1e-4, momentum=0.9)
    for t in range(500):
      # Forward pass: Compute predicted y by passing x to the model
      y_pred = model(x)

      # Compute and print loss
      loss = criterion(y_pred, y)
      print(t, loss.item())

      # Zero gradients, perform a backward pass, and update the weights.
      optimizer.zero_grad()
      loss.backward()
      optimizer.step()

这个程序实际上是一种RNN结构，在执行过程中动态的构建计算图

References: Pytorch Documentations.

以上这篇pytorch动态网络以及权重共享实例就是小编分享给大家的全部内容了，希望能给大家一个参考，也希望大家多多支持。

推荐阅读

程序员
在Visual Studio中更改或添加默认编辑器

如何解决《在VisualStudio中更改或添加默认编辑器》经验，为你挑选了1个好方法。 ... [详细]
程序员
Twitter文本js,不计算包含URL的文本的长度#!

如何解决《Twitter文本js,不计算包含URL的文本的长度#!》经验，为你挑选了1个好方法。 ... [详细]
程序员
在下面的java程序中,我不了解执行流程和"this"关键字执行情况？

如何解决《在下面的java程序中,我不了解执行流程和"this"关键字执行情况？》经验，为你挑选了1个好方法。 ... [详细]
程序员
谷歌iframe'在底部引起额外的填充

如何解决《谷歌iframe'在底部引起额外的填充》经验，为你挑选了1个好方法。 ... [详细]
程序员
总数列表直到阈值

如何解决《总数列表直到阈值》经验，为你挑选了1个好方法。 ... [详细]
程序员
使用BUFG来驱动时钟负载

如何解决《使用BUFG来驱动时钟负载》经验，为你挑选了1个好方法。 ... [详细]
程序员
ggplot2 1.01中没有更多geom_label()？

如何解决《ggplot21.01中没有更多geom_label()？》经验，为你挑选了1个好方法。 ... [详细]
程序员
在单元测试中使用TestHiveContext/HiveContext

如何解决《在单元测试中使用TestHiveContext/HiveContext》经验，为你挑选了0个好方法。 ... [详细]
程序员
在预推钩中克隆GIT仓库时出现"工作树已经存在"的例外情况

如何解决《在预推钩中克隆GIT仓库时出现"工作树已经存在"的例外情况》经验，为你挑选了1个好方法。 ... [详细]
程序员
检查哈希是否包含另一个哈希

如何解决《检查哈希是否包含另一个哈希》经验，为你挑选了1个好方法。 ... [详细]
程序员
使用find()方法在投影中出错

如何解决《使用find()方法在投影中出错》经验，为你挑选了1个好方法。 ... [详细]
程序员
skflow回归预测多个值

如何解决《skflow回归预测多个值》经验，为你挑选了1个好方法。 ... [详细]
程序员
Laravel Elixir Browserify失败!:意外的令牌 - 使用VueJs

如何解决《LaravelElixirBrowserify失败!:意外的令牌-使用VueJs》经验，为你挑选了1个好方法。 ... [详细]
程序员
强制子类在重写时调用父方法

如何解决《强制子类在重写时调用父方法》经验，为你挑选了2个好方法。 ... [详细]
程序员
如何确定字段在SQL Server 2008 R2中是否具有前导零？

如何解决《如何确定字段在SQLServer2008R2中是否具有前导零？》经验，为你挑选了1个好方法。 ... [详细]
程序员
警告:建议不要在没有服务器身份验证的情况下建立SSL连接

如何解决《警告:建议不要在没有服务器身份验证的情况下建立SSL连接》经验，为你挑选了1个好方法。 ... [详细]
程序员
webstorm忽略.tsconfig.json文件中的'excluded'目录

如何解决《webstorm忽略.tsconfig.json文件中的'excluded'目录》经验，为你挑选了0个好方法。 ... [详细]
程序员
VBA中的SQL语句

如何解决《VBA中的SQL语句》经验，为你挑选了1个好方法。 ... [详细]
程序员
android - 子活动完成时刷新父活动

如何解决《android-子活动完成时刷新父活动》经验，为你挑选了1个好方法。 ... [详细]
程序员
R中的绘图函数与对数刻度参数显示负值

如何解决《R中的绘图函数与对数刻度参数显示负值》经验，为你挑选了1个好方法。 ... [详细]

mobiledu2402851377

这个屌丝很懒，什么也没留下！

关注作者

Tags | 热门标签

RankList | 热门文章