megengine.module.Conv2d¶
- class Conv2d(in_channels, out_channels, kernel_size, stride=1, padding=0, dilation=1, groups=1, bias=True, conv_mode='cross_correlation', compute_mode='default', **kwargs)[源代码]¶
- 对输入张量进行二维卷积 - For instance, given an input of the size \((N, C_{\text{in}}, H, W)\), this layer generates an output of the size \((N, C_{\text{out}}, H_{\text{out}}, W_{\text{out}})\) through the process described as below: \[\text{out}(N_i, C_{\text{out}_j}) = \text{bias}(C_{\text{out}_j}) + \sum_{k = 0}^{C_{\text{in}} - 1} \text{weight}(C_{\text{out}_j}, k) \star \text{input}(N_i, k)\]- 其中 \(\star\) 是有效的2D互相关运算; \(N\) 是批大小; \(C\) 表示通道数; \(H\) 是以像素为单位输入平面的高度; \(W\) 是以像素为单位的平面宽度。 - 通常,输出的特征图的形状可以被下面的方式推导出来: - input: \((N, C_{\text{in}}, H_{\text{in}}, W_{\text{in}})\) - output: \((N, C_{\text{out}}, H_{\text{out}}, W_{\text{out}})\) where \[\text{H}_{out} = \lfloor \frac{\text{H}_{in} + 2 * \text{padding[0]} - \text{dilation[0]} * (\text{kernel_size[0]} - 1) - 1}{\text{stride[0]}} + 1 \rfloor\]\[\text{W}_{out} = \lfloor \frac{\text{W}_{in} + 2 * \text{padding[1]} - \text{dilation[1]} * (\text{kernel_size[1]} - 1) - 1}{\text{stride[1]}} + 1 \rfloor\]- 当 groups == in_channels 且 out_channels == K * in_channels ,其中 K 是正整数,该操作也被称为深度方向卷积(depthwise convolution)。 - In other words, for an input of size \((N, C_{in}, H_{in}, W_{in})\), a depthwise convolution with a depthwise multiplier K, can be constructed by arguments \((in\_channels=C_{in}, out\_channels=C_{in} \times K, ..., groups=C_{in})\). - 参数
- in_channels ( - int) – 输入数据中的通道数。
- out_channels ( - int) – 输出数据中的通道数。
- kernel_size ( - Union[- int,- Tuple[- int,- int]]) – size of weight on spatial dimensions. If kernel_size is an- int, the actual kernel size would be- (kernel_size, kernel_size).
- padding ( - Union[- int,- Tuple[- int,- int]]) – 输入数据空域维度两侧的填充(padding)大小。仅支持填充0值。默认:0
- dilation ( - Union[- int,- Tuple[- int,- int]]) – 二维卷积运算的空洞(dilation)。默认:1
- groups ( - int) – number of groups into which the input and output channels are divided, so as to perform a “grouped convolution”. When- groupsis not 1,- in_channelsand- out_channelsmust be divisible by- groups, and there would be an extra dimension at the beginning of the weight’s shape. Default: 1
- bias ( - bool) – 是否将偏置(bias)加入卷积的结果中。默认:True
- conv_mode ( - str) – Supports cross_correlation. Default: cross_correlation
- compute_mode ( - str) – When set to “default”, no special requirements will be placed on the precision of intermediate results. When set to “float32”, “float32” would be used for accumulator and intermediate result, but only effective when input and output are of float16 dtype.
 
 - 注解 - weightusually has shape- (out_channels, in_channels, height, width),
- if groups is not 1, shape will be - (groups, out_channels // groups, in_channels // groups, height, width)
 
- biasusually has shape- (1, out_channels, *1)
 - 实际案例 - import numpy as np import megengine as mge import megengine.module as M m = M.Conv2d(in_channels=3, out_channels=1, kernel_size=3) inp = mge.tensor(np.arange(0, 96).astype("float32").reshape(2, 3, 4, 4)) oup = m(inp) print(oup.numpy().shape) - 输出: - (2, 1, 2, 2) - 方法 - apply(fn)- 对当前模块中的所有模块应用函数 - fn,包括当前模块本身。- buffers([recursive])- 返回该模块中对于buffers的一个可迭代对象。 - calc_conv(inp, weight, bias)- children(**kwargs)- 返回一个可迭代对象,可遍历所有属于当前模块的直接属性的子模块。 - disable_quantize([value])- 设置 - module的- quantize_diabled属性,并返回- module。- eval()- 当前模块中所有模块的 - training属性(包括自身)置为- False,并将其切换为推理模式。- forward(inp)- load_state_dict(state_dict[, strict])- 加载一个参数字典,这个字典通常使用 - state_dict得到。- modules(**kwargs)- 返回一个可迭代对象,可以遍历当前模块中的所有模块,包括其本身。 - named_buffers([prefix, recursive])- 返回可遍历模块中 key 与 buffer 的键值对的可迭代对象,其中 - key为从该模块至 buffer 的点路径(dotted path)。- named_children(**kwargs)- 返回可迭代对象,可以遍历属于当前模块的直接属性的所有子模块(submodule)与键(key)组成的”key-submodule”对,其中'key'是子模块对应的属性名。 - named_modules([prefix])- 返回可迭代对象,可以遍历当前模块包括自身在内的所有其内部模块所组成的key-module键-模块对,其中'key'是从当前模块到各子模块的点路径(dotted path)。 - named_parameters([prefix, recursive])- 返回一个可迭代对象,可以遍历当前模块中key与 - Parameter组成的键值对。其中- key是从模块到- Parameter的点路径(dotted path)。- named_tensors([prefix, recursive])- Returns an iterable for key tensor pairs of the module, where - keyis the dotted path from this module to the tensor.- parameters([recursive])- 返回一个可迭代对象,遍历当前模块中的所有 - Parameter- register_forward_hook(hook)- 给模块输出注册一个回调函数。 - 给模块输入注册一个回调函数。 - replace_param(params, start_pos[, seen])- Replaces module's parameters with - params, used by- ParamPackto- rtype
 - state_dict([rst, prefix, keep_var])- tensors([recursive])- Returns an iterable for the - Tensorof the module.- train([mode, recursive])- 当前模块中所有模块的 - training属性(包括自身)置为- mode。- 将所有参数的梯度置0。