Dynamic Convolution

Dynamic convolution is a novel operator design that increases the representational power of lightweight CNNs, without increasing their computational cost or altering their depth or width. Developed by Chen et al., dynamic convolution uses multiple parallel convolution kernels, with the same size and input/output dimensions, in place of a single kernel per layer. How dynamic convolution works The different convolution kernels in dynamic convolution are generated attention weights through a squ

Grouped Convolution

What is Grouped Convolution? A Grouped Convolution is a type of convolutional neural network (CNN) that uses multiple kernels per layer, resulting in multiple channel outputs per layer. The main purpose of using Grouped Convolutions in a neural network is to make the network learn a varied set of low-level and high-level features. This leads to wider networks that are better at recognizing different types of data. The History of Grouped Convolution The idea of using Grouped Convolutions was

Groupwise Point Convolution

A Groupwise Point Convolution is a special type of convolution that is used in image processing, computer vision, and deep learning. It involves using multiple sets of convolution filters to process a single input image, which leads to improved accuracy and efficiency when compared to standard convolution techniques. What is convolution? Convolution is a mathematical operation that is used to combine two functions in order to create a third function that describes how one function modifies th

Invertible 1x1 Convolution

An invertible 1x1 convolution is a type of mathematical operation used in flow-based generative models. Its purpose is to reverse the ordering of channels within an image. This technique is used to create more complex and dynamic images for a variety of purposes, such as in computer graphics or machine learning. What is a convolution? Before diving further into what an invertible 1x1 convolution is, it's important to understand the basics of a convolution. A convolution is a mathematical oper

Lightweight Convolution

Explaining LightConv at an 8th Grade Level LightConv is a way to analyze sequences of data, like music, speech, or text, to understand patterns and predict what comes next. It does this by breaking the sequence down into smaller parts, called channels, and looking at how those parts interact with each other. One of the key things that makes LightConv different from other methods is that it has a fixed context window. That means it only looks at a certain number of parts at a time, rather than

Masked Convolution

Masked Convolution is a type of convolution that is used for image generation models. It is introduced with the PixelRNN generative models for producing better images with only those pixels that are already visited. In this article, we will delve deeper into the concept of masked convolution, its use cases, and its benefits. What is Masked Convolution? Convolution is a mathematical operation that is used for image processing tasks such as feature extraction, object detection, and image classi

Mixed Depthwise Convolution

Understanding MixConv: Mixing up Multiple Kernel Sizes In the world of convolutional neural networks (CNNs), there is a type of convolution called depthwise convolution. A depthwise convolution applies a single kernel size to all channels. However, a new and more innovative type of convolution has been developed and is called MixConv or Mixed Depthwise Convolution. This type of convolution mixes up multiple kernel sizes in a single convolution and is based on the insight that depthwise convolut

Octave Convolution

Octave Convolution (OctConv) is a method that reduces the memory and computation cost of storing and processing feature maps that vary spatially "slower" at a lower spatial resolution. By taking in feature maps containing tensors of two frequencies one octave apart, OctConv extracts information directly from the low-frequency maps without the need of decoding it back to the high-frequency. The Motivation Behind Octave Convolution The motivation behind Octave Convolution is that in natural ima

Pointwise Convolution

Pointwise Convolution is a method used in image processing that involves a small kernel, known as a 1x1 kernel which iterates through every single point of an image. The kernel has a depth based on the number of channels present in an input image making it one of the most efficient classes of convolutions. What is Convolution? Convolution is a mathematical operation used in image and signal processing where two functions are multiplied together and then integrated over an interval. In image p

PP-OCR

Understanding PP-OCR: A Revolutionary OCR System PP-OCR is an OCR system that comprises three main components, namely text detection, detected boxes rectification, and text recognition. OCR stands for Optical Character Recognition, which is the technology that enables computers to recognize printed or written text characters. Unlike the traditional OCR systems, PP-OCR is a revolutionary OCR system that can recognize text areas in images with high precision and accuracy. Text Detection: Locati

Selective Kernel Convolution

A Selective Kernel Convolution is a type of convolution that is used in deep learning to enable neurons to adjust their receptive field sizes among multiple kernels with different kernel sizes. In simple terms, this means that the convolution is able to adaptively adjust the size and shape of the filters that it uses to analyze data. What Is Convolution? Before diving deeper into Selective Kernel Convolution, it's important to understand what convolution is. Convolution is a mathematical proc

ShapeConv

Understanding ShapeConv: A Shape-aware Convolutional Layer for Depth Feature Processing in Indoor RGB-D Semantic Segmentation ShapeConv is a type of convolutional layer that is designed for extensively processing the depth feature in indoor RGB-D semantic segmentation. This convolutional layer has been engineered for efficient and purposeful depth feature decomposition before any processing happens, making it a valuable tool for researchers and developers looking to enhance their depth feature

Span-Based Dynamic Convolution

Span-Based Dynamic Convolution is a cutting-edge technique used in the ConvBERT architecture to capture local dependencies between tokens. Unlike classic convolution, which relies on fixed parameters shared for all input tokens, Span-Based Dynamic Convolution uses a kernel generator to produce different kernels for different input tokens, providing higher flexibility in capturing local dependencies. The Limitations of Classic and Dynamic Convolution Classic convolution is limited in its abili

Spatially Separable Convolution

Overview of Spatially Separable Convolution in Deep Learning In the world of deep learning, convolution is one of the basic operations used in image processing, natural language processing and many other fields. A convolution is a mathematical operation that is used to extract features and patterns from input data. It is the building block of convolutional neural networks (CNNs), which are a type of deep learning model that is very good at recognizing patterns in images and video. One of the k

Submanifold Convolution

Submanifold Convolution (SC) is a computer science technique used in tasks with sparse data, such as semantic segmentation of 3D point clouds. Introduction to Submanifold Convolution In recent times, computer scientists and data analysts have been striving to come up with better ways to effectively and efficiently handle data. One such technique is the submanifold convolution (SC). This method has been developed to help perform tasks that involve sparse data, such as 3D semantic segmentation

Switchable Atrous Convolution

Overview of Switchable Atrous Convolution (SAC) Switchable Atrous Convolution (SAC) is a technique used in computer vision to improve the accuracy of object detection in images. It works by changing the computation of the convolutional layers in a neural network, allowing for different atrous rates and switch functions to be used. The result is a more accurate and efficient object detection system. What is Convolution? Convolution is a mathematical operation used in computer vision to analyz

Prev 12 2 / 2