Neural Information Processing: 30th International Conference, ICONIP 2023, Changsha, China, November 20–23, 2023, Proceedings, Part VIII (Communications in Computer and Information Science, 1962) 981998131X, 9789819981311

The nine-volume set constitutes the refereed proceedings of the 30th International Conference on Neural Information Proc

149 93 91MB

English Pages 639 [638] Year 2023

Table of contents :
Preface
Organization
Contents – Part VIII
Theory and Algorithms
Multi-model Smart Contract Vulnerability Detection Based on BiGRU
1 Introduction
2 Related Work
3 S-BiGRU Smart Contracts Detection Method
3.1 S-BiGRU Multi-model Smart Contracts Vulnerability Detection Framework
3.2 Model Training
4 Experiment and Analysis
4.1 Evaluation Indicators
4.2 Smart Contract Dataset
4.3 Model Parameter Selection
4.4 Experimental Design and Results
5 Conclusion
References
Time-Warp-Invariant Processing with Multi-spike Learning
1 Introduction
2 Method
2.1 Neuron Model
2.2 Time-Warp Invariance
2.3 Time-Warp-Invariant Multi-spike Learning Rule
3 Application to Speech Recognition
3.1 Dataset
3.2 Result
4 Conclusion
References
ECOST: Enhanced CoST Framework for Fast and Accurate Time Series Forecasting
1 Introduction
2 Proposed Method
2.1 Timing Decomposition
2.2 Feature Extraction
2.3 Contrast Learning
2.4 Prediction
3 Experiments
3.1 Datasets
3.2 Evaluation Setup
3.3 Implemention Details
4 Results and Analyses
5 Conclusion
References
Adaptive CNN-Based Image Compression Model for Improved Remote Desktop Experience
1 Introduction
2 Related Works
3 Algorithm for Remote Desktop Image and Compression Quality Evaluation
4 Convolutional Neural Network-Based Image Compression Codec
4.1 Overview
4.2 Attention Mechanism
4.3 Loss Function
5 Experiments and Analysis
5.1 Data Set and Parameters
5.2 Experimental Results
6 Conclusion
References
LCformer: Linear Convolutional Decomposed Transformer for Long-Term Series Forecasting
1 Introduction
2 Relative Works
3 Approach
3.1 LCformer
3.2 Local Convolution Module
3.3 Linear Self Attention Mechanism
3.4 Sequence Decomposition Structure
3.5 Encoder
3.6 Decoder
4 Experiments
4.1 Data Set and Evaluation Metrics
4.2 Implementation Settings
4.3 Performance Comparison
4.4 Case Study
5 Model Analysis
5.1 Ablation Studies
5.2 Time Complexity Analysis
6 Conclusion
References
Fashion Trend Forecasting Based on Multivariate Attention Fusion
1 Introduction
2 Related Work
2.1 Fashion Trend Analysis
2.2 Fashion Trend Forecasting Model
3 Methodology
3.1 Multivariate Sequence Pre-processing
3.2 MAFT Model
4 Experiments
4.1 Datasets and Setup
4.2 Analysis of Fashion Trends
4.3 Discussion of Results
4.4 Ablation Studies
5 Conclusions
References
On Searching for Minimal Integer Representation of Undirected Graphs
1 Introduction
2 Graphs with Minimal Integers
2.1 Enumerative Representation of Graphs
2.2 Minimal Integer Representation
3 Computational Experiments
3.1 Results and Discussion
4 Conclusion
References
Multi-scale Multi-step Dependency Graph Neural Network for Multivariate Time-Series Forecasting
1 Introduction
2 Preliminary
2.1 Definition of Graph Neural Network
2.2 Definition of the Problem
3 Methodology
3.1 Overview
3.2 Temporal Convolution Module
3.3 Graph Structure Learning
3.4 Graph Convolution Module
3.5 Skip Connection Layer and Output Module
4 Experiments
4.1 Experimental Setup
4.2 Main Results
4.3 Ablation Study
4.4 Visual Experiment
5 Conclusions
References
Q-Learning Based Adaptive Scheduling Method for Hospital Outpatient Clinics
1 Introduction
2 System Model
2.1 M/G/K Queuing Model
2.2 Work Flow
2.3 Q-Learning Dynamic Scheduling Model
3 Experimental Analysis
3.1 Waiting Time
3.2 Adaptive Scheduling
4 Conclusion
References
Temporal Task Graph Based Dynamic Agent Allocation for Applying Behavior Trees in Multi-agent Games
1 Introduction
2 Background
2.1 Behavior Trees
2.2 Dynamic Graph Embedding
3 Method
3.1 Modelling Dynamic Subtask with Temporal Graph Network
3.2 Agent-Allocation Mechanism
3.3 The Overall Training
4 Experiments
4.1 Performance Analysis
4.2 Ablation Studies
5 Conclusion
References
A Distributed kWTA for Decentralized Auctions
1 Introduction
1.1 Centralized kWTA Network
1.2 Distributed kWTA Network
1.3 Decentralized Auction
1.4 Organization of the Paper
2 Our Distributed kWTA
2.1 Properties
2.2 Illustration
3 Decentralized Auction Mechanisms
3.1 Second Price Auction
3.2 First Price Auction
4 Conclusions
References
Leveraging Hierarchical Similarities for Contrastive Clustering
1 Introduction
2 Related Work
2.1 Deep Clustering
2.2 Hierarchical Clustering
3 Contrastive Learning Using Hierarchical Data Similarities for Deep Clustering
3.1 Construction of the Positive and Negative Sample Pairs
3.2 Learning Clustering-Oriented Latent Space
4 Experimental Result
4.1 Datasets
4.2 Implementation Details
4.3 Performance Comparison
4.4 Ablation Studies
4.5 Qualitative Study
5 Conclusion
References
Reinforcement Learning-Based Consensus Reaching in Large-Scale Social Networks
1 Introduction
2 Model Formulation
2.1 Model Settings
2.2 Reward Settings
3 Algorithm
4 Experiment
4.1 Ablation Study
4.2 Comparative Analyses
5 Conclusions
References
Lateral Interactions Spiking Actor Network for Reinforcement Learning
1 Introduction
2 Method
2.1 Population Encoder
2.2 Lateral Interactions Spiking Neuron Network
2.3 Surrogate Gradient
2.4 Population Decoder
3 Experiment
3.1 Benchmarking LISAN Against Mainstream Models
3.2 Discussion of Different Input Codings for LISAN
4 Conclusion
References
An Improved YOLOv5s for Detecting Glass Tube Defects
1 Introduction
2 Analysis YOLOv5s for Glass Tube Defect Detection
3 Improved YOLOv5s
3.1 CBAM
3.2 CARAFE
3.3 Loss Function and Learning Rate
4 Experimental Results and Analysis
4.1 Visual Results of YOLOv5s and Our Improved Model
4.2 Ablation Experiments
4.3 Comparative Experiments
5 Conclusion
References
Label Selection Approach to Learning from Crowds
1 Introduction
2 Related Work
2.1 Learning from Crowdsourced Annotations
2.2 Selective Prediction
3 Label Selection Layer
3.1 Problem Settings
3.2 Label Selection Layer
4 Experiments
4.1 Image Classification
4.2 Rating Regression
4.3 Named Entity Recognition
5 Analysis
6 Conclusion
References
New Stability Criteria for Markov Jump Systems Under DoS Attacks and Packet Loss via Dynamic Event-Triggered Control
1 Introduction
2 Problem Formulation
2.1 System Description
2.2 DoS Attacks
2.3 Dynamic Event-Triggered Mechanism
2.4 Switched Markov Jump System Model
3 Main Results
4 Numerical Example
5 Conclusion
References
Image Blending Algorithm with Automatic Mask Generation
1 Introduction
2 Related Work
2.1 Image Blending
2.2 Image Segmentation and Detection
3 Proposed Approach
3.1 Mask Generation
3.2 Seamless Blending
3.3 Style Refinement
4 Experiments
4.1 Experiment Setup
4.2 Result Comparison
4.3 Ablation Study
5 Conclusion and Future Work
6 Authorship Statement
References
Design of Memristor-Based Binarized Multi-layer Neural Network with High Robustness
1 Introduction
2 Memristor-Based Layers
2.1 Memristor Model
2.2 Memristor-Based Binarized FC Layer
2.3 Memristor-Based Ba1tch Normalization Layer
2.4 Binary Activation Function
3 Simulation Experiments
3.1 Pattern Classification Result
3.2 Impact of Noises
4 Conclusions
References
FEDEL: Frequency Enhanced Decomposition and Expansion Learning for Long-Term Time Series Forecasting
1 Introduction
2 Related Work
3 Definition of LTSF
4 FEDEL Module
4.1 Time Feature Embedding Module
4.2 Frequency Enhanced Module
4.3 Dependency Capture Module
5 Experiment
5.1 Implementation Details
5.2 Evaluation Metrics
5.3 Main Results
5.4 Results Analysis
6 Conclusion
References
On the Use of Persistent Homology to Control the Generalization Capacity of a Neural Network
1 Introduction
2 Motivation
3 Related Works
4 Proposed Approach
4.1 Fundamental Background
4.2 Pruning Based on the Relevance of NN's Units
4.3 Topological Functional Graph and Model Selection
5 Experimental Settings and Results
5.1 Datasets
5.2 Results and Discussion
6 Conclusion and Perspectives
References
Genetic Programming Symbolic Regression with Simplification-Pruning Operator for Solving Differential Equations
1 Introduction
2 Preliminaries
2.1 Genetic Programming for Symbolic Regression (GPSR)
2.2 Physics-Regularized Fitness Function
3 Proposed Algorithm
3.1 Overall Framework
3.2 Simplification-Pruning Operator
3.3 Hybrid Optimization Strategy
3.4 Fitness Evaluating Function
4 Experimental Design and Result Analysis
4.1 Detailed Problem Description
4.2 Performance Comparison
4.3 Sampling Number and Performance
5 Conclusion
References
Explainable Offensive Language Classifier
1 Introduction
2 Related Work
3 Proposed Method
4 Experiments
5 Conclusion
References
Infrared Target Recognition Technology Based on Few Shot Learning
1 Introduction
2 Infrared Image Generation Based on Generative Adversarial Network Model
2.1 Structure Design of Generative Adversarial Network Model
2.2 Generator Design of Cycle Generative Adversarial Network Model
2.3 Design of Discriminator for Cycle Generative Adversarial Network Model
2.4 Loss Function Design of Cycle Generative Adversarial Network Model
3 Design of Target Recognition Model Based on Lightweight Network
4 Simulation Test and Result Analysis
4.1 Infrared Image Enhancement
4.2 Analysis of the Impact of Infrared Augmented Image on Recognition Accuracy
4.3 Infrared Target Recognition Test
5 Conclusion
References
An Approach to Mongolian Neural Machine Translation Based on RWKV Language Model and Contrastive Learning
1 Introduction
2 Relate Work
2.1 Data Augmentation
2.2 Transformer Improve
3 Method
3.1 Contrastive Learning Data Augmentation
3.2 RWKV Language Model
4 Experiment
4.1 Experiment Setup
4.2 Model Selection
4.3 Results
5 Conclusion
References
Event-Triggered Constrained H Control Using Concurrent Learning and ADP
1 Introduction
2 Problem Description
3 Event-Triggered Constrained H Control
3.1 Event-Triggering Mechanism of the H Control
3.2 Implementation Using Concurrent Learning and ADP
3.3 Stability Analysis
4 Simulation Results
5 Conclusions
References
Three-Dimensional Rotation Knowledge Representation Learning Based on Graph Context
1 Introduction
2 Related Work
3 Methodology
3.1 Relational 3D Rotation Modeling Based on Quaternion Multiplication
3.2 Fuses Graph Context Information
3.3 Model Training
4 Experiment and Result Analysis
4.1 Data Set
4.2 Description of Evaluation Methods and Experimental Settings
4.3 Experimental Results and Analysis
5 Conclusion
References
TextBFA: Arbitrary Shape Text Detection with Bidirectional Feature Aggregation
1 Introduction
2 Related Work
3 Methodology
3.1 Overview
3.2 Bidirectional Feature Aggregator
3.3 Bilateral Decoder with Lateral Connection (BDLC)
3.4 Training
4 Experiments
4.1 Datasets
4.2 Implementation Details
4.3 Ablation Study
4.4 Comparisons with State-of-the-Art Methods
5 Conclusion
References
Bias Reduced Methods to Q-learning
1 Introduction
2 Background
2.1 Maximum Estimator Bias in Maximum Estimator
2.2 Cross-Validation Estimator
3 Bias Reduced Methods to Q-learning
3.1 Underestimation Bias of Our Estimators
3.2 Our Proposed Methods to Q-learning
4 Experiment
5 Conclusion and Future Work
References
A Neural Network Architecture for Accurate 4D Vehicle Pose Estimation from Monocular Images with Uncertainty Assessment
1 Introduction
2 Related Work
2.1 Vehicle Pose Estimation
2.2 Keypoints and Uncertainty
3 Structure of the Pose Estimation System
3.1 Estimation of 2D Keypoints
3.2 Estimation of 3D Keypoints
3.3 Selection of the Best Estimated Keypoints
3.4 Training Process
3.5 Pose Estimation
4 Dataset Preparation for Supervised Learning
5 Uncertainty Estimation
5.1 Estimation of Keypoints Uncertainty
5.2 Propagation Using Unscented Transform
6 Evaluation Results
6.1 Keypoints 2D
6.2 Keypoints 3D
6.3 A3DP Metrics
6.4 Uncertainty Assessment
7 Conclusions
References
PSO-Enabled Federated Learning for Detecting Ships in Supply Chain Management
1 Introduction
1.1 Key Contributions
2 Related Work
3 Proposed Work
3.1 Dataset Collection and Preparation
3.2 Dataset Pre-processing
3.3 Dataset Partitioning
3.4 CNN
3.5 Methodology
4 Results and Analysis
4.1 Performance of the Proposed Approach
4.2 Evaluation Metrics
4.3 Performance Comparison with Other Models
5 Conclusion
References
Federated Learning Using the Particle Swarm Optimization Model for the Early Detection of COVID-19
1 Introduction
1.1 Key Contributions
2 Related Work
3 Proposed Work
3.1 Dataset Collection and Preparation
3.2 Dataset Pre-processing
4 Results and Analysis
4.1 Performance of the Proposed Approach
4.2 Evaluation Metrics
4.3 Performance Comparison with Other Models
5 Conclusion
References
Domain Generalization via Implicit Domain Augmentation
1 Introduction
2 Related Work
3 Methodology
3.1 Preliminaries
3.2 Style Augment
3.3 Augmented Features
3.4 Implicit Domain Augmentation
4 Experiments
4.1 Generalization for Category Classification
4.2 Generalization for Instance Retrieval
4.3 Further Analysis
5 Conclusion
References
Unconstrained Feature Model and Its General Geometric Patterns in Federated Learning: Local Subspace Minority Collapse
1 Introduction
2 Preliminary and Related Work
2.1 Normalized Gram Matrix and Generalized Coherence Estimate
3 Reformulated FL: Unconstrained Feature Model
3.1 Motivation of Reformulation
3.2 Categories of FL
3.3 Deep Learning Formulation
3.4 FL Modeling with Client-Class Sampling
3.5 Observations and Insights
3.6 Related Theorems
4 Experiments
4.1 Where's Client-Class Sampling Information Found in FL?
4.2 Local Subspace Minority Collapse: The Geometric Pattern in FL
4.3 Settings
5 Analyses
5.1 Local Subspace Minority Collapse
5.2 The Lost Sampling Information
5.3 The Frame Drift
6 Inspiration and Discussions
7 Conclusion
References
Mutually Guided Dendritic Neural Models
1 Introduction
2 Methodology
2.1 Dendritic Neural Model
2.2 Mutually Guided Dendritic Neural Models
3 Experiments Setup
3.1 Datasets
3.2 Evaluation Metrics
3.3 Experimental Design
4 Results
4.1 Iris Dataset
4.2 Breast Cancer Dataset
4.3 Glass Dataset
5 Conclusion
References
Social-CVAE: Pedestrian Trajectory Prediction Using Conditional Variational Auto-Encoder
1 Introduction
2 Related Work
2.1 Research on Social Interaction Model
2.2 Research on Multimodal Pedestrian Trajectory Prediction
3 Method
3.1 Data Preparation
3.2 Overall Model Architecture
3.3 Inference Model
3.4 Generative Model
3.5 Observation Encoding
3.6 Latent Variables Estimation
3.7 Trajectory Decoding
3.8 Training
4 Experimental Results and Analysis
4.1 Experimental Setup
4.2 Evaluation Indicator
4.3 Results and Analyses
5 Conclusion
References
Distributed Training of Deep Neural Networks: Convergence and Case Study
1 Introduction
1.1 Objective
1.2 Methodology
2 Related Work
3 Preliminaries: Successive Approximations. Synchronous and Asynchronous Algorithms
4 Convergence Study
4.1 Mathematical Modeling
5 Empirical Validation
5.1 The Links Between the Mathematical Model and the Implementation
5.2 Use Case: Independent Subnet Training
5.3 Experiments
6 Discussion and Perspectives
7 Conclusion
References
Chinese Medical Intent Recognition Based on Multi-feature Fusion
1 Introduction
2 Related Work
2.1 D-CNN
2.2 Bi-GRU
2.3 Attention Mechanism
3 Our Method
3.1 The Text Encoding Layer
3.2 The Feature Extraction Layer
3.3 The Feature Fusion Layer
3.4 The Intent Recognition Layer
4 Experiments
4.1 Datasets
4.2 Experimental Settings
4.3 Baselines
4.4 Results
4.5 Ablation Experiments
4.6 Selection of Parameters
5 Conclusion
References
Multi-mobile Object Motion Coordination with Reinforcement Learning
1 Introduction
2 Related Work
2.1 Traditional Solution Methods for Motion Coordination
2.2 Double Deep Q-Network
2.3 Reinforcement Learning Methods for Solving Motion Coordination Problems
2.4 Path Chessboard Graph
3 State with Time and Reward Model Algorithms for Motion Coordination
3.1 Centralized System
3.2 State Setting
3.3 Action Setting
3.4 State with Time Model
3.5 Dynamically Updated Reward Model
4 Experiments and Result Analysis
4.1 Experiment Platform
4.2 Evaluation Metrics
4.3 Experiment Results and Analysis
4.4 Validation of the Effectiveness of the State with Time Model
5 Conclusion
References
Comparative Analysis of the Linear Regions in ReLU and LeakyReLU Networks
1 Introduction
2 Related Work
3 Linear Regions of Networks
4 ``Dying ReLU'' Based on Piecewise Linear Functions
5 Experiments
6 Conclusion
References
Heterogeneous Graph Prototypical Networks for Few-Shot Node Classification
1 Introduction
2 Related Work
2.1 Heterogeneous Graph Node Classification
2.2 Few-Shot Learning
3 Preliminaries
4 Methodology
4.1 Graph Structural Module
4.2 Meta-Learning Module
4.3 Complexity Analysis
5 Experiments
5.1 Datasets
5.2 Baselines
5.3 Few-Shot Node Classification
5.4 Robustness Analysis
5.5 Comparison of Variant Models
6 Conclusion
References
Improving Deep Learning Powered Auction Design
1 Introduction
2 Related Works
3 Auction Design Through Deep Learning
4 Methods
4.1 Discrete Distribution Sampling
4.2 Item-Wise Payment Network
5 Experiments
5.1 ALGNet
5.2 RegretFormer
6 Conclusion
References
Link Prediction with Simple Path-Aware Graph Neural Networks
1 Introduction
2 Related Work
3 Preliminaries
4 Methods
4.1 GAEs Are Not Good Enough: Awareness of Paths
4.2 What Existing GNNs Have Overlooked: Awareness of Simple Paths
4.3 Designing Efficient GNNs for Link Prediction
5 Experiments
6 Conclusion
References
Restore Translation Using Equivariant Neural Networks
1 Introduction
2 Related Works
3 Equivariant Neural Networks
3.1 Equivariance in Tensor Space
3.2 Equivariant Operators
3.3 Equivariant Neural Networks
4 Translation Restorer
4.1 Method
4.2 Existence of the Restorer
5 Experiments
6 Conclusion
A Proof of Theorem 1
B Proof of Lemma 1
C Proof of Lemma 2
D Experimental Settings
References
SDPSAT: Syntactic Dependency Parsing Structure-Guided Semi-Autoregressive Machine Translation
1 Introduction
2 Related Works
3 Methodology
3.1 Layer-and-Length-Based Syntactic Labeling
3.2 Two-Stage Decoding
3.3 Mixed Training Strategy
4 Experiments
4.1 Setup
4.2 Inference and Evaluation
4.3 Result
5 Analysis and Discussion
6 Conclusion
References
Author Index