Neural Network Architecture

description34 papers

group3 followers

lightbulbAbout this topic

Neural Network Architecture refers to the structured design of artificial neural networks, encompassing the arrangement of layers, types of neurons, and connections between them. It determines how data is processed and learned, influencing the network's performance in tasks such as classification, regression, and pattern recognition.

lightbulbAbout this topic

Key research themes

1. How can neural network architecture be optimized for computational efficiency without sacrificing accuracy?

This research area focuses on designing and scaling neural network architectures to achieve high accuracy on specified tasks while minimizing computational complexity and hardware resource usage. It is critical for deploying neural networks on resource-limited devices and speeding up inference by reducing operations and hardware area.

Finding Storage- and Compute-Efficient Convolutional Neural Networks

by Daniel Becking

2021, Master's Thesis, Technische Universität Berlin

Key finding: Proposed a two-step paradigm integrating compound model scaling (a lightweight NAS approach) and Entropy-Constrained Trained Ternarization (EC2T), a simultaneous pruning and ternary quantization algorithm, which compresses... Read more

articleView Paper downloadDownload

An Efficient Approach for Neural Network Architecture.pdf

by Kasem Khalil

2019

Key finding: Introduced a neural network hardware design that reduces the number of physical hidden layers by half (from N to N/2) through multiplexing input and output layers while maintaining the same accuracy as traditional N-layer... Read more

articleView Paper downloadDownload

Architecture of A Novel Low-Cost Hardware Neural Network

by Kasem Khalil

2020, 2020 IEEE 63rd International Midwest Symposium on Circuits and Systems (MWSCAS)

Key finding: Designed a neural network architecture sharing multipliers and adders between two hidden layers, cutting the number of these critical hardware components by half and reducing hardware cost by 63%. The method maintained... Read more

articleView Paper downloadDownload

Finding the Optimal Topology of an Approximating Neural Network

by Stoyan Cheresharov

2023, Mathematics

Key finding: Derived analytical formulas to estimate upper bounds on the number of hidden layers and neurons in networks trained via algorithms using the Jacobi matrix (e.g., Levenberg-Marquardt). These bounds aid in selecting compact yet... Read more

articleView Paper downloadDownload

Heuristic Architecture Search Using Network Morphism for Chest X-Ray Classification

by Pavlo Radiuk

2021, Heuristic Architecture Search Using Network Morphism for Chest X-Ray Classification

Key finding: Developed a heuristic architecture search method leveraging network morphism combined with hill-climbing and functional saving, achieving competitive chest X-ray classification accuracy (73.2% validation accuracy, 84.5% AUC)... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What methodologies and algorithms enable automated search and optimization of neural network architectures to improve performance and reduce manual design efforts?

This research theme investigates algorithmic frameworks and search strategies such as genetic algorithms, evolutionary methods, modular search spaces, and heuristics to automate the process of neural network architecture design. Automating architecture search accelerates model development, improves generalization, and allows discovering architectures difficult to design manually, helping in diverse tasks from image classification to medical imaging.

A Framework for Exploring and Modelling Neural Architecture Search Methods

by Nadiia Hrypynska

2022

Key finding: Proposed a systematic framework that categorizes and benchmarks NAS methods by summarizing architecture search decisions and strategies, applying quantitative and qualitative metrics for prototyping and comparison. This... Read more

articleView Paper downloadDownload

Heuristic Architecture Search Using Network Morphism for Chest X-Ray Classification

by Pavlo Radiuk

2021, Heuristic Architecture Search Using Network Morphism for Chest X-Ray Classification

Key finding: Presented a novel heuristic architecture search using enforced hill-climbing and network morphism to efficiently explore architectures. The method found high-performing architectures within 28 GPU hours on medical image... Read more

articleView Paper downloadDownload

Use of genetic algorithms with backpropagation in training of feedforward neural networks

by Michael McInerney

2024, IEEE International Conference on Neural Networks

Key finding: Developed hybrid training algorithms combining genetic algorithms (GA) and backpropagation (BP) that leverage GA’s global search to escape local minima and BP’s efficiency in fine-tuning. The GA-BP hybrids achieved faster... Read more

articleView Paper downloadDownload

Modular search space for automated design of neural architecture

by Pavlo Radiuk

2021, Proceedings of the O.S. Popov ОNAT

Key finding: Proposed a modularized neural architecture search space composed of parameterized building blocks derived from NAS-Bench-201 benchmark, represented as multisectoral networks described unambiguously by vectors. Applied to a... Read more

articleView Paper downloadDownload

Design of ANN Based Non-Linear Network Using Interconnection of Parallel Processor

by nitish pathak

2023, Computer Systems Science and Engineering

Key finding: Explored an ANN design leveraging massive parallelism with many interconnected processing elements distributed over parallel processors, achieving effective optimization for nonlinear resource allocation problems.... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. How do architectural elements and training hyperparameters influence neural network learning dynamics and generalization?

This theme examines the role of architectural design choices, such as the number of layers, neurons, and activation functions, as well as learning hyperparameters like learning rate and regularization, on convergence, error minimization, and avoidance of local minima. Understanding these influences is vital to achieve stable and efficient learning with good generalization while preventing issues like overfitting or chaotic training behavior.

Design and regularization of neural networks: the optimal use of a validation set

by C. Svarer

2024, Neural Networks for Signal Processing VI. Proceedings of the 1996 IEEE Signal Processing Society Workshop

Key finding: Derived novel gradient-based algorithms for estimating regularization parameters and optimizing neural net architectures using a validation set. Proposed iterative schemes jointly optimizing weights and hyperparameters that... Read more

articleView Paper downloadDownload

Multilayer neural networks – as determined systems

by Ivan Kuno

2023, Computational Problems of Electrical Engineering

Key finding: Analyzed the effect of learning rate (η) on multilayer neural network training, observing bifurcation and chaotic behavior when η exceeds a critical threshold (~0.62 for a 3-layer network with 4 neurons per layer). Found that... Read more

articleView Paper downloadDownload

Use of genetic algorithms with backpropagation in training of feedforward neural networks

by Michael McInerney

2024, IEEE International Conference on Neural Networks

Key finding: Identified limitations of backpropagation training related to sensitivity to learning rate and momentum and susceptibility to local minima. Showed that integrating GA with BP alleviates these issues by global exploration with... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Neural Network Architecture

Deep neural networks optimization for resource-constrained environments: techniques and models

by Raafi Careem

2025, Indonesian Journal of Electrical Engineering and Computer Science

This paper aims to present a comprehensive review of advanced techniques and models with a specific focus on deep neural network (DNN) for resource-constrained environments (RCE). The paper contributes by highlighting the RCE devices,... more

descriptionView Paper arrow_downwardDownload

Radar Tracking System Using Contextual Information on a Neural Network Architecture in Air Combat Maneuvering

by José Castillo

2025, International Journal of Distributed Sensor Networks

Air surveillance radar tracking systems present a variety of known problems related to uncertainty and lack of accurately in radar measurements used as source in these systems. In this work, we feature the theoretical aspects of a... more

descriptionView Paper arrow_downwardDownload

Applying Singular Value Decomposition (SVD) to CNNs: A Path Toward Lightweight and Efficient Architecture

by Dr. Khawla Hussein

2025, Journal of Information Systems Engineering and Management

This paper aims to investigate whether applying the Singular Value Decomposition (SVD) technique can reduce the workload of Convolutional Neural Networks (CNNs) without compromising image classification results. Usual methods for reducing... more

descriptionView Paper arrow_downwardDownload

BEYOND SCALE: RE-EVALUATING THE FOUNDATIONS OF ARTIFICIAL INTELLIGENCE

by Vlad Arbatov

2025

The scaling hypothesis, the long-standing paradigm that posited predictable performance gains in artificial intelligence from increasing model size, data, and compute, is encountering significant and compounding limitations. Empirical... more

descriptionView Paper arrow_downwardDownload

Surrogate-Assisted Evolutionary Deep Learning Using an End-to-End Random Forest-Based Performance Predictor

by Gary Yen

2025, IEEE Transactions on Evolutionary Computation

Convolutional neural networks (CNNs) have shown remarkable performance in various real-world applications. Unfortunately, the promising performance of CNNs can be achieved only when their architectures are optimally constructed. The... more

descriptionView Paper arrow_downwardDownload

Completely Automated CNN Architecture Design Based on Blocks

by Gary Yen

2025, IEEE transactions on neural networks and learning systems

The performance of Convolutional Neural Networks (CNNs) highly relies on their architectures. In order to design a CNN with promising performance, extensive expertise in both CNNs and the investigated problem domain is required, which is... more

descriptionView Paper arrow_downwardDownload

Dropout is a special case of the stochastic delta rule: faster and more accurate deep learning

by Stephen Hanson

2025, arXiv (Cornell University)

descriptionView Paper arrow_downwardDownload

Channel-Prioritized Convolutional Neural Networks for Sparsity and Multi-fidelity

by Chin-Laung Lei

2025

We propose a novel convolutional neural networks (CNNs) training procedure to allow dynamically trade-offs between different resource and performance requirements. Our approach prioritizes the channels to enable structured sparsity and... more

descriptionView Paper arrow_downwardDownload

Channel-Prioritized Convolutional Neural Networks for Sparsity and Multi-fidelity

by Chin-Laung Lei

2025

descriptionView Paper arrow_downwardDownload

NetBooster: Empowering Tiny Deep Learning By Standing on the Shoulders of Deep Giants

by Zhongzhi Yu

2025, arXiv (Cornell University)

Tiny deep learning has attracted increasing attention driven by the substantial demand for deploying deep learning on numerous intelligent Internet-of-Things devices. However, it is still challenging to unleash tiny deep learning's full... more

descriptionView Paper arrow_downwardDownload

Simulation of Single and Multilayer of Artificial Neural Network using Verilog

by Rajesh Vansdadiya

2024, International Journal For Scientific Research and Development

Artificial neural network play an important role in VLSI circuit to find and diagnosis multiple fault in digital circuit. In this paper, the example of single layer and multilayer neural network had been discussed secondly implement those... more

descriptionView Paper arrow_downwardDownload

R&D on Bulk-Synchronous Parallel (BSP) Computing for General-Purpose Processors (GPP) and BSP Computing Systems: A Paradigmatic Approach

by Dimitrios Sargiotis

2024, R&D on Bulk-Synchronous Parallel (BSP) Computing for General-Purpose Processors (GPP) and BSP Computing Systems: A Paradigmatic Approach

This study explores the integration of neural network emulators into Computer-Aided Design (CAD) environments, aiming to enhance neural architecture modeling. The objective is to merge the computational power of neural networks with the... more

descriptionView Paper arrow_downwardDownload

DETECTION OF DIABETIC RETINOPATHY USING OPTIMAL MOBILENETV2 MODEL

by preeti payal

2024, Industrial Engineering Journal

Diabetic retinopathy (DR) is a major cause of vision loss globally, making early detection vital for effective intervention. However, manual screening is prone to errors and inefficiency. Automated solutions using deep learning models... more

descriptionView Paper arrow_downwardDownload

The Φπε Framework: A Unified Approach to Quantum Coherence, Consciousness, and Universal Harmony

by Andrew Kadziolka

2024, academia.edu

This research paper presents an advanced exploration of the Φπε framework, an integrative model that unifies quantum mechanics, consciousness studies, and metaphysical principles under the auspices of harmony (Φ), quantum entanglement... more

descriptionView Paper arrow_downwardDownload

L 0 -ARM: Network Sparsification via Stochastic Binary Optimization

by paper bot

2024

We consider network sparsification as an L0-norm regularized binary optimization problem, where each unit of a neural network (e.g., weight, neuron, or channel, etc.) is attached with a stochastic binary gate, whose parameters are jointly... more

descriptionView Paper arrow_downwardDownload

GNN-RL Compression: Topology-Aware Network Pruning using Multi-stage Graph Embedding and Reinforcement Learning

by Sixing Yu

2024, arXiv (Cornell University)

Model compression is an essential technique for deploying deep neural networks (DNNs) on power and memory-constrained resources. However, existing model-compression methods often rely on human expertise and focus on parameters' local... more

descriptionView Paper arrow_downwardDownload

Deep Neural Network Compression With Single and Multiple Level Quantization

by Aojun Zhou

2024, Proceedings of the AAAI Conference on Artificial Intelligence

Network quantization is an effective solution to compress deep neural networks for practical usage. Existing network quantization methods cannot sufficiently exploit the depth information to generate low-bit compressed network. In this... more

descriptionView Paper arrow_downwardDownload

Sparse Iso-FLOP Transformations for Maximizing Training Efficiency

by Vithursan Thangarasa

2024, arXiv (Cornell University)

Recent research has focused on weight sparsity in neural network training to reduce FLOPs, aiming for improved efficiency (test accuracy w.r.t training FLOPs). However, sparse weight training often sacrifices accuracy, requiring extended... more

Figure 1: Top-1 Accuracy vs. Training FLOPs for variants of ResNet on ImageNet. Sparse-IFT provides significant accuracy gains across different models and sparsity levels while using the same FLOP budget as its dense counterpart. Vithursan Thangarasa*! Shreyas Saxena’' Abhay Gupta! Sean Lie!

Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency Figure 2: Different members of the Sparse-IFT family, each parameterized by a single hyperparameter (i.e., sparsity level. s). Black and white squares denote non-active and active weights, respectively. Green block indicates a non-linear activation function (e.g., ReLU). Derived with sparsity set at 50% as an example, all transformations are Iso-FLOP to the dense feedforward function fg,, making them suitable drop-in replacements for f9,. Details about each member are in Section 2.3.

Figure 3: The relationship between the structure and weights of Sparse-IFT ResNet-18 networks are analyzed through a graph perspective in terms of performance. Top row: we assess the relationship between Ar;,,ay and Ajm59- Bottom row: investigates the correlation between Ar and \. The Pareto curvature heatmap visually represents the classifica- tion performance, with varying color gradients symbolizing the spectrum from low to high test accuracy on CIFAR-100.

Figure 4: Ablation studies with Sparse-IFT on the ResNet-18 model for CIFAR-100 across sparsity € {50%, 75%, 90%}. (left) Sparse Wide IFT trained with dynamic unstructured and structured sparsity. (middle) Sparse-IFT family members trained with RigL, where Sparse Wide performs the best. (right) Sparse Wide IFT trained in a sparse and dense manner. Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency

Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency Figure 5: Benchmarking unstructured sparsity during (left) inference on Neural Magic’s DeepSparse runtime and (right) training acceleration on the Cerebras CS-2. In both setups, we measure the relative increase in latency or training speed for Sparse-IFT variants against the dense model.

Figure 6: Illustrates the dynamic interplay between the (top row) Iterative Mean Difference Bound, Arq) and test accuracy, as well as the correlation between the Ramanujan Gap, Ar and test accuracy throughout the training process. This illustrates the evolving relationship between spectral graph properties and network performance, shedding light on the connectivity dynamics of the Sparse-IFT networks trained with DST. Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency

Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency Figure 7: Measured speedup versus theoretical speedup at varying sparsity levels for a GPT-3 layer 12k x 12k matrix multiplication (MatMul) (Lie, 2021).

Table 1: Cardinality of search space for sparsity masks of different members of the Sparse-IFT family.

Table 2: Sparse Wide IFT with ResNet-18 trained using var- ious sparse training methods on CIFAR-100 across different sparsity levels (columns). Best accuracy for each sparse training method is highlighted in bold. parallel branches, and hidden dimension size, respectively. Sparse Doped maintains a constant search space by allocat- ing FLOPs between a low-rank and an unstructured sparse weight matrix. Therefore, dynamic sparse training (DST) becomes crucial for effectively traversing this larger sparse parameter subspace, as discussed in Section 3.

Table 4: Sparse-IFT on ImageNet. Best result for each transformation and architecture is highlighted in bold.

Table 3: Sparse Wide IFT with various efficient architectures on CIFAR-100 across different levels of sparsity (columns).

Table 5: Sparse Wide IFT variants of ResNet-18 as back- bones for: (a) object detection on MS COCO, (b) semantic segmentation on Cityscapes.

Table 7: Evaluation on the importance of utilizing the non-linear activation across different members of Sparse-IFT with ResNet-18 on CIFAR100 across different values of sparsity (columns). Non-linear activations enhance the representational capacity of Sparse-IFT, leading to higher accuracy. All reported results are the average over 3 random seeds. C.3. Computer Vision

Table 8: Results with ResNet-18 on CIFAR-100 across different values of sparsity (columns). Best accuracy for each sparse training method is highlighted in bold. The original dense ResNet-18 model obtains an accuracy of 77.00.2. All reported results are over 3 random seeds.

Table 10: Comparison of structured sparse and unstructured sparse methods on CIFAR-100 test accuracy on ResNet-18. Table 9: Evaluation of Sparse Wide and Sparse Parallel IFT with various compute efficient architectures on CIFAR-100 across different values of sparsity (columns). Using Sparse Parallel IFT, all architectures outperform the dense baseline by a significant margin.

We also set up and experimented using the method proposed by Jiang et al. (2022) to train with fine-grained sparse block structures dynamically. However, the algorithm uses agglomerative clustering which led to a much slower runtime and quickly ran out of memory even at 50% sparsity using the Sparse Wide IFT on a single Nvidia V100 (16 GB).

Table 11: Object detection results on COCO minival in the RetinaNet framework. Sparse Wide IFT configurations of RetinaNet outperform the dense baseline by a large margin on all metrics while using similar FLOPs.

Table 12: Semantic segmentation results on the Cityscapes val set using DeepLabV3+. Sparse Wide IFT configurations ResNet-18 backbones outperform the dense baseline on all metrics while using similar FLOPs. D. Natural Language Processing: Experimental Settings

Table 13: Size, architecture, and learning hyperparameters (batch size and learning rate) of the GPT-3 Small model, which is trained using Chinchilla optimal configurations (+ 20 tokens per parameter)

Table 14: Sizes and architecture definitions of the dense GPT-3 Small model and its Sparse Wide IFT variants Table 15: Performance Evaluation of Dense and Sparse Wide IFT GPT-3 Small Models at 50% and 75% sparsity levels across five tasks (i.e., ARC, HellaSwag, TruthfulQA, MMLU, and Winogrande) on the Open LLM Leaderboard

descriptionView Paper arrow_downwardDownload

Neuroplasticity-Based Pruning Method for Deep Convolutional Neural Networks

by NANCY GUADALUPE ARANA DANIEL

2024, Applied Sciences

In this paper, a new pruning strategy based on the neuroplasticity of biological neural networks is presented. The novel pruning algorithm proposed is inspired by the knowledge remapping ability after injuries in the cerebral cortex.... more

descriptionView Paper arrow_downwardDownload

Noisy Heuristics NAS: A Network Morphism based Neural Architecture Search using Heuristics

by Suman Sapkota

2024, arXiv (Cornell University)

Network Morphism based Neural Architecture Search (NAS) is one of the most efficient methods, however, knowing where and when to add new neurons or remove dis-functional ones is generally left to black-box Reinforcement Learning models.... more

descriptionView Paper arrow_downwardDownload

Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency

by Vithursan Thangarasa

2024

descriptionView Paper arrow_downwardDownload

A Comprehensive Survey on Hardware-Aware Neural Architecture Search

by Naigang Wang

2024, ArXiv

Neural Architecture Search (NAS) methods have been growing in popularity. These techniques have been fundamental to automate and speed up the time consuming and error-prone process of synthesizing novel Deep Learning (DL) architectures.... more

Fig. 1. Generic CNN architecture. For each layer an operator is chosen among a pre-defined list (convolution, dilated convolution, depthwise convolution, maxpooling, batch_normalization...) For instance, certain problems require task-specific models, e.g. EfficientNet [11] for image classification and ResNest [12] for semantic segmentation, instance segmentation and object detection. These networks differ on the proper configuration of their architectures and their hyperparameters. The hyperpa- rameters here refer to the pre-defined properties related to the architecture or the training algorithm.

Fig. 3. Overview of conventional NAS components Fig. 2. Accuracy of various CNN models on ImageNet for Image Classification task with the number of parameters. Inspired by [14

Fig. 6. Overview of efficient deep learning techniques

Fig. 7. The agent-environment interaction in a Markov Decision Process. Source: [38] As illustrated in figure 7, at each step, the agent observes the state of the environment sends and receives a reward for its previous action. It then selects its next action. The reward guides the agent to improve its policy such that better actions are chosen in the future. The policy of an agent is the algorithm that allows it to choose between multiple actions.

Fig. 8. Overview of the evolutionary algorithm steps. A genetic algorithm is a type of evolutionary algorithms that encodes the individuals into numerical vectors, called chromosomes. The chromosome is represented as a set of parameters that defines a particular individual. A selection cri- terium is used to select a set of candidate individuals of which the fittest are mutated and recombined by crossover to create the next generation. This algorithm is the simplest version of the evolutionary methods as the chromosomes are numerical vectors and the mutations can be simple permutations.

Fig. 9. Overview of different hardware-aware NAS designs.

Fig. 10. Architecture search spaces types. (a) Global search space, (b) Cell-based search space, and (c) Hierarchical search space. In orange the operators considered during the search.

Fig. 11. Statistics on targeted platforms

Fig. 12. Statistics about the type of networks described by the HW-NAS search spaces

COMPARISON OF HARDWARE COST MEASUREMENT METHODS. THE HARDWARE COST IS MEASURED ON TESLA K80 GPU.

Fig. 14. Comparison of hardware cost measurement methods. LUT stands for Look Up Table. The speedups are calculated w.r.t the real-time measurements. The exact statistics are displayed in table V.

Fig. 16. Results of different search algorithms on NAS-Bench-201. searche without any further training achieves decent results within 17sec of search. By dividing the benchmarks into N mini-batches, they increase their training efficacy. The higher this number is (V), the higher the over-fitting probability on the benchmark. Therefore, using small datasets with complex search algorithms does not yield any good results in terms of accuracy of the final architecture or efficiency of the search.

SUMMARY OF HARDWARE COST ESTIMATION METHODS

af Ahhh A DETAILED OVERVIEW OF 15 MOST POPULAR HW-NAS. HWT: HARDWARE TRANSFERABILITY, P: PROXY DATASETS, THPO: TRAINING HYPERPARAMETER OPTIMIZATION, OS: OPEN SOURCE. FOR A COMPLETE LIST OF ALL HW-NAS WORKS, PLEASE VISIT THE FOLLOWING LINK HTTPS://TINYURL.COM/Y6458SKT

descriptionView Paper arrow_downwardDownload

Neural Architecture Transfer

by Wolfgang Banzhaf

2024, IEEE Transactions on Pattern Analysis and Machine Intelligence

Neural architecture search (NAS) has emerged as a promising avenue for automatically designing task-specific neural networks. Most existing NAS approaches require one complete search for each deployment specification of hardware or... more

descriptionView Paper arrow_downwardDownload

EDGE AI: QUANTIZATION AS THE KEY TO ON-DEVICE SMARTNESS

by IAEME Publication

2024, IAEME PUBLICATION

In recent developments, the significance of Edge AI has come to the forefront. Edge devices, which encompass a wide array of IoT devices and embedded systems, benefit from the deployment of efficient and compact neural network models.... more

descriptionView Paper arrow_downwardDownload

Blockout: Dynamic Model Selection for Hierarchical Deep Networks

by Howard Zhou

2024, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Most deep architectures for image classification-even those that are trained to classify a large number of diverse categories-learn shared image representations with a single model. Intuitively, however, categories that are more similar... more

descriptionView Paper arrow_downwardDownload

Thanks for Nothing: Predicting Zero-Valued Activations with Lightweight Convolutional Neural Networks

by moran shkolnik

2024, arXiv (Cornell University)

Convolutional neural networks (CNNs) introduce state-ofthe-art results for various tasks with the price of high computational demands. Inspired by the observation that spatial correlation exists in CNN output feature maps (ofms), we... more

descriptionView Paper arrow_downwardDownload

Completely Automated CNN Architecture Design Based on Blocks

by Shreya Sinha

2024, IEEE Transactions on Neural Networks and Learning Systems

descriptionView Paper arrow_downwardDownload

Motor pattern generation is robust to neural network anatomical imbalance favoring inhibition but not excitation

by Myriam de Graaf

2024

Animals display rich and coordinated motor patterns during walking and running. Previous modelling as well as experimental results suggest that the balance between excitation and inhibition in neural networks may be critical for... more

descriptionView Paper arrow_downwardDownload

LSTM Framework for Classification of Radar and Communications Signals

by Jesus Grajal

2024, arXiv (Cornell University)

Although radar and communications signal classification are usually treated separately, they share similar characteristics, and methods applied in one domain can be potentially applied in the other. We propose a simple and unified scheme... more

descriptionView Paper arrow_downwardDownload

Design and Analysis of Convolutional Neural Network for RF Signal Modulation Classification for In-Orbit Deployment

by Guru Subramanyam

2024, 2021 IEEE Cognitive Communications for Aerospace Applications Workshop (CCAAW)

descriptionView Paper arrow_downwardDownload

Pruning Convolutional Filters Using Batch Bridgeout

by Najeeb stuman khan

2024, IEEE Access

State-of-the-art computer vision models are rapidly increasing in capacity, where the number of parameters far exceeds the number required to fit the training set. This results in better optimization and generalization performance.... more

descriptionView Paper arrow_downwardDownload

LCS: Learning Compressible Subspaces for Adaptive Network Compression at Inference Time

by ANURAG RANJAN

2024, ArXiv

When deploying deep learning models to a device, it is traditionally assumed that available computational resources (compute, memory, and power) remain static. However, real-world computing systems do not always provide stable resource... more

Figure 1: (a) Depiction of our method for learning a linear subspace of networks w* parameterized by @ € [a1, 2]. When compressing with compression function f and compression level , we ob- tain a spectrum of networks which demonstrate an efficiency-accuracy trade-off. (b) Our algorithm.

Figure 2: Analysis of observed batch-wise means 4 and stored BatchNorm means ys during testing for models trained with TopK unstructured sparsity. The models are trained with different target sparsities and evaluated with various inference-time sparsities. (a)-(b): The distribution of | — f2| across all layers. (c)-(d): The average value of |sz — | for individual layers. (e)-(f): The correlation between the average of |4z — | and test set error. Note that in (b) and (d), sparsities of 0 and 0.493 produce near-identical results, thus those curves are overlapping.

Figure 3: Our method for structured sparsity using a linear subspace (LCS+L+IN) and a point sub- space (LCS+P+IN) compared to Universal Slimming (US) (Yu & Huang, 2019), Network Slimming (NS) (Yu et al., 2018), and Learning Efficient Convolutions (LEC) (Liu et al., 2017). LEC does not provide an open-source implementation of their method for ResNet18, so we omit it. We do not allow fine-tuning or recalibration.

Figure 4: Our method for unstructured sparsity using a linear subspace (LCS+L+GN) and a point subspace (LCS+P+GN) compared to networks trained for a particular TopK target. The TopK target refers to the fraction of weights that remain unpruned during training.

Figure 5: Our method for quantization using a linear subspace (LCS+L+GN) and a point subspace (LCS+P+GN) compared to networks trained for a particular bit width target.

Figure 6: Standard evaluation of a linear subspace with network f(w* (a) ,y(a)) (Learned line), and evaluation when evaluating with reversed compression levels, f(w*(a, (1 — «)) (Reversed line).

Figure 7: Analysis of the mean absolute difference between observed batch-wise means f2 and stored BatchNorm means yp during testing for cPreResNet models trained with NS (Yu et al., 2018) or US (Yu & Huang, 2019). (a)-(b): The distribution of |41 — ,2| across all layers. (c)-(d): The average value of |ts — ,4| for each individual BatchNorm layer. (e)-(f): The correlation between the average of | — {| and test set error.

Figure 8: Analysis of the mean absolute difference between observed batch-wise means f2 and stored BatchNorm means jz during testing for cPreResNet models trained with different quantization bit widths. (a)-(b): The distribution of |42 — f4| across all layers. (c)-(d): The average value of |ps — | for each individual BatchNorm layer. (e)-(f): The correlation between the average of |4 — f4| and test set error.

Figure 9: Our method for structured sparsity using a linear subspace (LCS+L+IN) and a point sub- space (LCS+P+IN), compared to Universal Slimming (US) (Yu & Huang, 2019), Network Slimming (NS) (Yu et al., 2018), and Learning Efficient Convolutions (LEC) (Liu et al., 2017).

Figure 10: Our method for quantization using a linear subspace (LCS+L+GN) and a point subspace (LCS+P+GN) compared to networks trained for a particular bit width target.

descriptionView Paper arrow_downwardDownload

EcoNAS: Finding Proxies for Economical Neural Architecture Search

by Shuai Yi

2024, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

descriptionView Paper arrow_downwardDownload

Synchronization-Aware NAS for an Efficient Collaborative Inference on Mobile Platforms

by Yung-Kyun Noh

2024, Proceedings of the 24th ACM SIGPLAN/SIGBED International Conference on Languages, Compilers, and Tools for Embedded Systems

Previous neural architecture search (NAS) approaches for mobile platforms have achieved great success in designing a slim-but-accurate neural network that is generally wellmatched to a single computing unit such as a CPU or GPU. However,... more

descriptionView Paper arrow_downwardDownload

Single Circuit in V1 Capable of Switching Contexts during Movement Using an Inhibitory Population as a Switch

by Doris Voina

2024, Neural Computation

descriptionView Paper arrow_downwardDownload

A biologically inspired architecture with switching units can learn to generalize across backgrounds

by Doris Voina

2024

Humans and other animals navigate different landscapes and environments with ease, a feat that requires the brain’s ability to rapidly and accurately adapt to different visual domains, generalizing across contexts/backgrounds. Despite... more

descriptionView Paper arrow_downwardDownload

Stimulus novelty uncovers coding diversity in visual cortical circuits

by Xiaoxuan Jia

2024

The detection of novel stimuli is critical to learn and survive in a dynamic environment. Though novel stimuli powerfully affect brain activity, their impact on specific cell types and circuits is not well understood. Disinhibition is one... more

descriptionView Paper arrow_downwardDownload

Rapid-INR: Storage Efficient CPU-free DNN Training Using Implicit Neural Representation

by Stephen Fitzmeyer

2024, arXiv (Cornell University)

Implicit Neural Representation (INR) is an innovative approach for representing complex shapes or objects without explicitly defining their geometry or surface structure. Instead, INR represents objects as continuous functions. Previous... more

descriptionView Paper arrow_downwardDownload

Sparse Coding in a Dual Memory System for Lifelong Learning

by Dr Fahad Sarfraz

2024, arXiv (Cornell University)

Efficient continual learning in humans is enabled by a rich set of neurophysiological mechanisms and interactions between multiple memory systems. The brain efficiently encodes information in non-overlapping sparse codes, which... more

descriptionView Paper arrow_downwardDownload

Signals Intelligence System with Software-Defined Radio

by Petru Cotfas

2024, Applied Sciences

In this paper, we present the implementation of a system that identifies the modulation of complex radio signals. This is realized using an artificial intelligence model developed, trained, and integrated with Microsoft Azure cloud. We... more

Figure 1. Component interconnections. The implemented system has an architecture based on the symbiosis between the two components: hardware and software. They intertwine to create a homogeneous and efficient system. Figure 1 shows how the system components are interconnected.

How the components of the system work and communicate can be seen in Figure 2. In the upper part of Figure 2, the components that are used to train and validate the artificia intelligence network are presented. The database, after it is prepared, is loaded in the Microsoft Azure Auto ML Service, where the artificial intelligence network is configured. Following the training and validation part, the trained model is realized. In the lower part of the image, the components used to test the trained model are presented. The SDR platform is used to send and receive modulated signals. The modulated samples received are inserted into the integrated trained model (integrated into Microsoft Azure cloud) that generates the answer with the identified modulation type. In this database, we find signals that are modulated using the following 24 modulation schemes: OOK, 4ASK, 8ASK, BPSK, QPSK, 8PSK, 16PSK, 32PSK, 16APSK, 32APSK, 64APSK, 128APSK, 16QAM, 32QAM, 64QAM, 128QAM, 256QAM, AM-SSB-WC, AM-SSB-SC, AM- DSB-WC, AM-DSB-SC, FM, GMSK, and OQPSK.

Figure 4. BPSK modulation/demodulation topology. The first implemented topology can be seen in Figure 4. In this topology, “BPSK” trans- mission and rece 8-bit integers wit ption are implemented. The data source is a random one that generates h values between 0 and 255. The integers are divided into 1-bit symbols and “BPSK” modulated. The modulated symbols are then interpolated to achieve a rate of eight samples per symbol. After sampling, an RRC (root-raised-cosine) filter is applied to prepare the transmission samples. The transmission is carried out with a sampling frequency equal to 1 MHZ. The RF frequency on which the data are transmitted is equal to 2.45 GHz. The transmitted signal can be seen in Figure 5. The reception of the signal is performed on he same RF frequency as the transmission and with the same sampling frequency. The received samples are written in a “CSV” file. This file is then used in order to test the trained model.

Figure 6. Received BPSK signal test. 4. Discussion

Table 1. BladeRF 2.0 micro xA4 [18]. For the artificial neural network implementation, several solutions were analyzed. The analyzed solutions were selected in order to allow the implementation without being an expert in this field. The first two analyzed platforms were DLHUB [21] and ANNHUB [22]. Both of them are developed by the same company, ANSCenter (Sydney, Australia) [23]. The two platforms offer the user the opportunity to create training models in a simple and efficient way without the need for advanced knowledge in the field of artificial intelligence. The resulting model can be exported to multiple programming languages (LabVIEW, C++, Arduino, Python), where it can be integrated to form an entire system. Following the integrations and tests performed with these two platforms, the following disadvantages were found:

Table 2. Obtained dataset. The extracted data is written in a “CSV” file to ensure compatibility with the “Azure AutoML” service that is used to train and integrate the intelligent network. It must be clearly defined which data constitutes the inputs from which the characteristics necessary for training will be extracted and which data represent the outputs according to which the classification of the obtained results will be carried out. In order to extract the samples from the dataset, a shell script was used. The script has put the data in a separate folder for each modulation and inside the modulation folder in separate folders for test and training. The extracted complex data is written to a “CSV” (comma-separated values) file, which is the final database that is used for training, validation, and testing. In Table 2, the internal structure and content of the database can be seen.

Table 3. Accuracy of models. The most important characteristic of the trained model is the confusion matrix. It can be seen in Table 4. As can be seen in the confusion matrix, the best results are recorded for the modulations: FM, BPSK, GMSK, OOK, and 4ASK. The worst results are recorded for the modulations: 8PSK, 16APSK, 32APSK, OPSK, and 8ASK.

Table 6. Cloud-based neural networks vs. local neural networks.

Table 7. Comparison of cloud-based neural networks and local neural networks results. The main further research direction consists of improving the neural network model by using a database with more modulation types that have a wider range of noise and interference. Another future research direction consists of the integration of the trained model into a system that will detect modulation type in real-time and will be able to demodulate it directly without human intervention.

descriptionView Paper arrow_downwardDownload

Increased region of surround stimulation enhances contextual feedback and feedforward processing in human V1

by Isa Rao

2024, bioRxiv (Cold Spring Harbor Laboratory)

The majority of synaptic inputs to the primary visual cortex (V1) are non-feedforward, instead originating from local and anatomical feedback connections. Animal electrophysiology experiments show that feedback signals originating from... more

descriptionView Paper arrow_downwardDownload

Increased region of surround stimulation enhances contextual feedback and feedforward processing in human V1

by Isa Rao

2024, bioRxiv

descriptionView Paper arrow_downwardDownload

COMMUNICATION SIGNALS MODULATIONS CLASSIFICATION BASED ON NEURAL NETWORK ALGORITHMS

by YAHYA BENREMDANE

2024, Computer Science & Information Technology (CS & IT)

This paper aims to find an automatic solution for the modulation’s classification of different types of radio signals by relying on Artificial Intelligence. This project is part of a long process of Communications Intelligence looking for... more

descriptionView Paper arrow_downwardDownload

Evaluating Robustness to Noise and Compression of Deep Neural Networks for Keyword Spotting

by Miguel Arjona Ramírez

2023, IEEE Access

Keyword Spotting (KWS) has been the subject of research in recent years given the increase of embedded systems for command recognition such as Alexa, Google Home, and Siri. Performance, model size, processing time, and robustness to noise are fundamental in these systems. Furthermore, applications in embedded systems demand computationally efficient models that can be implemented in current technology. In this work, an approach for keyword recognition is evaluated using three deep learning models namely LeNet-5, SqueezeNet, and EfficientNet-B0. We evaluate transfer learning, pruning and quantization strategies in training and test using noisy and clean speech signals. In addition, compression techniques such as pruning and quantization were assessed in terms of the size reduction of the model footprint and the accuracy obtained in each case. Using the Google's Speech Commands dataset and additive babble noise signal, our keyword recognition approach achieves an accuracy of 94.6% using an unstructured pruning of 80% of the parameters of the original SqueezeNet network with a reduction of 70% in the model size. INDEX TERMS Speech recognition, machine learning algorithms, speech analysis, spectral analysis, pruning, quantization, keyword spotting. I. INTRODUCTION Voice commands are becoming a natural way to interact with consumer electronic devices [1], [2]. Systems with speech command recognition such as Amazon's Alexa, Apple's Siri, and Google's Assistant are examples of this popularity. These smart devices often use some embedded system (e.g., microcontrollers [3], microprocessors, field-programmable gate arrays, or dedicated devices [4]) with limited resources, making the implementation of speech recognition algorithms dependent on hardware limitations [5]. Typically, microcontrollers, the cheapest approach, have small memory capacity (i.e., a few kilobytes) and require energy-saving strategies since, in most cases, these edge devices are always active and they are generally powered by batteries. Additionally, they must have low latency and The associate editor coordinating the review of this manuscript and approving it for publication was Mounim A. El Yacoubi .

descriptionView Paper arrow_downwardDownload

Leveraging spiking deep neural networks to understand the neural mechanisms underlying selective attention

by h.steven scholte

2023

Spatial attention enhances sensory processing of goal-relevant information and improves perceptual sensitivity. Yet, the specific neural mechanisms underlying the effects of spatial attention on performance are still contested. Here, we... more

descriptionView Paper arrow_downwardDownload

Cyclic orthogonal convolutions for long-range integration of features

by Jezabel Rodriguez Garcia

2023, ArXiv

In Convolutional Neural Networks (CNNs) information flows across a small neighbourhood of each pixel of an image, preventing long-range integration of features before reaching deep layers in the network. We propose a novel architecture... more

descriptionView Paper arrow_downwardDownload

Are Straight-Through gradients and Soft-Thresholding all you need for Sparse Training?

by Antoine Vanderschueren

2023, arXiv (Cornell University)

Turning the weights to zero when training a neural network helps in reducing the computational complexity at inference. To progressively increase the sparsity ratio in the network without causing sharp weight discontinuities during... more

descriptionView Paper arrow_downwardDownload

Are Straight-Through gradients and Soft-Thresholding all you need for Sparse Training?

by Antoine Vanderschueren

2023, 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

descriptionView Paper arrow_downwardDownload

Adaptation of MobileNetV2 for Face Detection on Ultra-Low Power Platform

by L. Andrea Dunbar

2023, 2022 9th Swiss Conference on Data Science (SDS)

Designing Deep Neural Networks (DNNs) running on edge hardware remains a challenge. Standard designs have been adopted by the community to facilitate the deployment of Neural Network models. However, not much emphasis is put on adapting... more

descriptionView Paper arrow_downwardDownload

As large as it gets: Learning infinitely large Filters via Neural Implicit Functions in the Fourier Domain

by Margret Keuper

2023, arXiv (Cornell University)

Motivated by the recent trend towards the usage of larger receptive fields for more context-aware neural networks in vision applications, we aim to investigate how large these receptive fields really need to be. To facilitate such study,... more

descriptionView Paper arrow_downwardDownload