Neural Information Processing: 30th International Conference, ICONIP 2023, Changsha, China, November 20–23, 2023, Proceedings, Part II (Lecture Notes in Computer Science, 14448) [1 ed.] 981998081X, 9789819980819

The six-volume set LNCS 14447 until 14452 constitutes the refereed proceedings of the 30th International Conference on N

151 71 59MB

English Pages 609 [607] Year 2023

Table of contents :
Preface
Organization
Contents – Part II
Theory and Algorithms
Distributed Nash Equilibrium Seeking of Noncooperative Games with Communication Constraints and Matrix Weights
1 Introduction
2 Problem Formulation
3 Main Results
3.1 Distributed Nash Equilibrium Seeking Algorithm Under Intermittent Communication
3.2 Convergence Analysis
4 Numerical Example
5 Conclusion
References
Accelerated Genetic Algorithm with Population Control for Energy-Aware Virtual Machine Placement in Data Centers
1 Introduction
2 Insights into GA Computation
3 Adaptive Population Control
3.1 Basic Architecture
3.2 Managing the Population Size in GA
3.3 Updating Systematical Parameters Dynamically
3.4 Population Control Scheme
4 Simulation Experiments
4.1 Experimental Design
4.2 Results for Large-Scaled Data Center
4.3 Results for Medium-Scaled Data Center
4.4 Additional Computation Demand in Each Generation
5 Conclusion
References
A Framework of Large-Scale Peer-to-Peer Learning System
1 Introduction
2 Preliminaries
2.1 P2PL Paradigm
2.2 Secure Multi-party Computation
2.3 Asynchronous Federated Learning
3 P2PL System
3.1 The Architecture of P2PLSys
3.2 The Overall Process of P2PLSys
3.3 The Modules of the Server
3.4 The Modules of Peer Nodes
4 Experiments
4.1 Experimental Design
4.2 Dataset and Neural Network Architectures
4.3 MNIST Classification
4.4 CIFAR10 Classification
5 Conclusion
References
Optimizing 3D UAV Path Planning: A Multi-strategy Enhanced Beluga Whale Optimizer
1 Introduction
2 Related Work
2.1 3D Path Planning Model for UAVs
2.2 Beluga Whale Optimization
3 Multi-strategy Enhanced Beluga Whale Optimizer
3.1 Elitist Group Genetic Strategy
3.2 Adaptive Gauss Variational Operator
3.3 Random Opposition-Based Learning Strategy
4 Validation Experiments
5 OGGBWO Algorithm for 3D UAV Path Planning
6 Conclusion
References
Interactive Attention-Based Graph Transformer for Multi-intersection Traffic Signal Control
1 Introduction
2 Related Work
2.1 Traffic Signal Control Based on DRL
2.2 Traffic Signal Control Based on GRL
3 Problem Description
4 Methodology
4.1 Framework of GTLight
4.2 Node Initialization Module
4.3 Graph Transformer Network Module
4.4 Phase-Timing Optimization Module
5 Experiments
5.1 Experimental Setup
5.2 Comparison Algorithms
5.3 Evaluation Indicator
5.4 Experimental Results
5.5 Study of GTLight
6 Conclusion
References
PatchFinger: A Model Fingerprinting Scheme Based on Adversarial Patch
1 Introduction
2 Related Work
2.1 Model Watermarking
2.2 Model Fingerprinting
3 Problem Definition
3.1 Threat Model
3.2 Fingerprinting DNN Models
3.3 Adversarial Techniques
4 Method
4.1 Patch Injection
4.2 Fingerprint Generation
5 Experiments
5.1 Experimental Setup
5.2 Performance Metrics
5.3 Overall Results of Baseline Testing for PatchFinger
5.4 Specific Results of Adversarial Techniques
5.5 Average Accuracy
6 Conclusion
References
Attribution of Adversarial Attacks via Multi-task Learning
1 Introduction
2 Related Work
3 Methodology
3.1 Multi-task Adversarial Attribution (MTAA)
3.2 Uncertainty Weighted Losses
4 Experiments
4.1 Experimental Setup
4.2 Attribution Experimental Results
4.3 Scalability of MTAA
4.4 Ablation Study
5 Conclusion
References
A Reinforcement Learning Method for Generating Class Integration Test Orders Considering Dynamic Couplings
1 Introduction
2 Related Work
3 Method
3.1 Dynamic Coupling
3.2 Stubbing Complexity
3.3 CITO Generation Based on Reinforcement Learning
4 Experiments
4.1 Experimental Settings
4.2 Results and Analysis
5 Conclusions
References
A Novel Machine Learning Model Using CNN-LSTM Parallel Networks for Predicting Ship Fuel Consumption
1 Introduction
2 Related Study
3 Methodology
3.1 Overview of SFC Prediction
3.2 Modeling of SFC Prediction
4 Experiments
4.1 Data Processing
4.2 Numerical Experiments
4.3 Parameter Tuning and Optimization
5 Conclusions
References
Two-Stage Attention Model to Solve Large-Scale Traveling Salesman Problems
1 Introduction
2 Related Works
2.1 Learn-to-Construct
2.2 Learn-to-Improve
3 Methods
3.1 Two-Stage Attention Model
3.2 Solution Construction
3.3 Improving Strategy
4 Experiments
4.1 Hyper-parameters and Baselines
4.2 Datasets Setup
4.3 Comparative Study
4.4 Ablation Study
5 Conclusions
References
Learning Primitive-Aware Discriminative Representations for Few-Shot Learning
1 Introduction
2 Related Work
3 Method
3.1 Description and Formulation of Few-Shot Learning
3.2 Primitive Mining Network
3.3 Correlation Reasoning Network
3.4 Loss and Train
4 Experiments
4.1 Dataset Description
4.2 Implementation Details
4.3 General Few-Shot Classification Results
4.4 Fine-Grained Few-Shot Classification Results
4.5 Qualitative Analysis
4.6 Ablation Study
5 Conclusion
References
Time-Series Forecasting Through Contrastive Learning with a Two-Dimensional Self-attention Mechanism
1 Introduction
2 Related Works
2.1 Contrastive Learning
2.2 Self-attention Mechanism
2.3 Data Pruning
3 Methodology
3.1 LSTM Adaptive Pruning Method
3.2 Two-Dimensional Feature Extraction Module
4 Experiments
4.1 Experimental Setup
4.2 The Detail of Dataset
4.3 Experimental Results
4.4 Ablation Study
5 Conclusions
References
Task Scheduling with Multi-strategy Improved Sparrow Search Algorithm in Cloud Datacenters
1 Introduction
2 Problem Description
2.1 Encoder and Decoder
2.2 Objective Function
2.3 Fitness Function
3 Proposed Algorithm
3.1 Strategy 1: PWLCM Mapping
3.2 Strategy 2: Discoverer Strategy
3.3 Strategy 3: Joiner Strategy
4 Simulation Results
4.1 Parameter Setting
4.2 Numerical Analysis
5 Conclusion
References
Advanced State-Aware Traffic Light Optimization Control with Deep Q-Network
1 Introduction
2 Related Works
3 Algorithm Model
3.1 State
3.2 Action
3.3 Reward
4 Traffic Signal Control Based on 3DRQN-AM
4.1 Network Model and Algorithm for 3DRQN-AM
5 Experiment
5.1 Simulation Environment and Hyperparameter Setting
5.2 Experimental Evaluation and Analysis of Results
6 Summary
References
Impulsive Accelerated Reinforcement Learning for H Control
1 Introduction
2 System Description and Preliminaries
3 Critic Dynamics
3.1 Critic Dynamic via Impulsive Momentum-Based Control
4 Simulation Results
5 Conclusion
References
MRRC: Multi-agent Reinforcement Learning with Rectification Capability in Cooperative Tasks
1 Introduction
2 Background
2.1 Dec-POMDP
2.2 MERL and IRAT
2.3 MADDPG, QMIX and FACMAC
3 Method
3.1 Individual Reward Rectification
3.2 Mix Network Rectification
3.3 Discrete Action Rectification
4 Experiments and Results
4.1 Predator-Prey
4.2 More Challenging SMAC
5 Conclusion
References
Latent Causal Dynamics Model for Model-Based Reinforcement Learning
1 Introduction
2 Preliminary
2.1 Reinforcement Learning
2.2 Structural Causal Model
3 Latent Causal Dynamics Model Learning
3.1 Latent Causal Dynamics Learning
3.2 Policy Learning
4 Experiments
4.1 Environments and Baselines
4.2 Q1: Performance Comparison
4.3 Q2: Will the Learned Representations Be Useful for Our Dynamics Model?
4.4 Q3: Is the Learned Causal Structure Between Representations Helpful for Our Method?
5 Conclusion
References
Gradient Coupled Flow: Performance Boosting on Network Pruning by Utilizing Implicit Loss Decrease
1 Introduction
2 Related Work
3 Gradient Coupled Flow
3.1 Implicit Loss Decrease on Data to Be Trained
3.2 A Criterion Sensitive to Gradient Coupled Flow
3.3 A Flattened Variant of GCS to Get Subdivided Coupled Flows
4 Experiments
4.1 Performance of GCS in Image Classification
4.2 Evaluate Model Generalization with Implicit Loss and Weight Reduction
5 Conclusion
References
Motif-SocialRec: A Multi-channel Interactive Semantic Extraction Model for Social Recommendation
1 Introduction
2 Related Work
2.1 Social Recommendation
2.2 Motif
2.3 Self-supervised Learning
3 Model
3.1 Preliminaries
3.2 Overall Framework
3.3 Motif Information Extraction
3.4 Representation Generation
3.5 Preference Prediction
3.6 Optimization Model
4 Experiments and Results
4.1 Experimental Settings
4.2 Recommendation Performance (RQ1)
4.3 Parameter Analysis (RQ2)
5 Conclusion and Future Work
References
Dual Channel Graph Neural Network Enhanced by External Affective Knowledge for Aspect Level Sentiment Analysis
1 Introduction
2 Related Work
2.1 Early Machine Learning Methods
2.2 Neural Network Methods
2.3 Pre-trained Models
2.4 Graph Neural Network
2.5 External Commonsense Knowledge
3 Methodology
3.1 Problem Definition
3.2 Model Overview
3.3 Input Module
3.4 Knowledge-Enhanced Semantic Channel
3.5 Knowledge-Enhanced Semantic Channel
3.6 Handling the Conflict of Different Word Segmentation Methods
3.7 Channel Fusion
3.8 Classification Model
3.9 Loss Function
4 Experiments
4.1 Data Sets
4.2 Implementation Details
4.3 Baselines
4.4 Results Comparison
4.5 Ablation Experiment
4.6 Case Study
5 Conclusion
References
New Predefined-Time Stability Theorem and Applications to the Fuzzy Stochastic Memristive Neural Networks with Impulsive Effects
1 Introduction
2 Problem Statement and Preliminaries
3 Main Results
3.1 Predefined-Time Stability
3.2 Predefined-Time Synchronization
4 Numerical Examples
5 Conclusion
References
FE-YOLOv5: Improved YOLOv5 Network for Multi-scale Drone-Captured Scene Detection
1 Introduction
2 Related Works
2.1 Object Detection in Drone-Captured Scenes
2.2 Muti-scale Feature Fusion
3 Proposed Method
3.1 The FE-YOLOv5 Network Framework
3.2 Detection Head for Tiny Objects
3.3 MSF-BiFPN
3.4 Adaptive Attention Module (AAM)
3.5 Feature Enhancement Module (FEM)
3.6 Normalized Wasserstein Distance (NWD)
4 Experiments and Analysis
4.1 Experimental Setting and Evaluation Criteria
4.2 Ablation Studies
4.3 Comparative Experiments
5 Conclusion
References
An Improved NSGA-II for UAV Path Planning
1 Introduction
1.1 Research Background
2 Review of Prior Work
3 Methodology
3.1 NSGA-II Algorithm
3.2 Improvements to the Algorithm
4 Results
5 Conclusion
References
Reimagining China-US Relations Prediction: A Multi-modal, Knowledge-Driven Approach with KDSCINet
1 Introduction
2 Related Work
2.1 Statistical Models
2.2 Data-Driven Models
2.3 Knowledge-Driven Models
3 The KDSCINet Framework
3.1 Overview
3.2 CU-GDELT
3.3 MKGformer-Based Embedding Layer
3.4 Attention-Based SCINet
3.5 Complexity Analysis
4 Experiments
4.1 Baselines and Settings
4.2 Result Analysis
5 Conclusion
References
A Graph Convolution Neural Network for User-Group Aided Personalized Session-Based Recommendation
1 Introduction
2 Related Works
3 Method
3.1 Preliminaries
3.2 User-Item Group Graph Based GCN
3.3 Prediction and Loss Function
4 Experiments
4.1 Dataset
4.2 Evaluation Metrics
4.3 Implementation Details
4.4 Ablation Studies
4.5 Comparison with Other Models
4.6 The Best Number of Groups
5 Conclusion
References
Disentangling Node Metric Factors for Temporal Link Prediction
1 Introduction
2 Related Work
3 Method
3.1 Problem Statement
3.2 Temporal Link Prediction Branch
3.3 Node Metric Disentanglement Branch
3.4 Attention-Based Fusion Module
4 Experiment
4.1 Experimental Setup
4.2 Overall Performance
4.3 Ablation Study
4.4 Performance on Different Node Groups
4.5 Parameter Sensitivity
5 Conclusion
References
Action Prediction for Cooperative Exploration in Multi-agent Reinforcement Learning
1 Introduction
2 Related Work
3 Notation
4 Algorithm
4.1 Framework of PQmix and Intrinsic Reward
4.2 Implementation
5 Experiments
5.1 Environment
5.2 Baselines
5.3 Comparison Experiment in SMAC Environment
5.4 Robustness Evaluation of PQmix Algorithm
6 Conclusion
References
SLAM: A Lightweight Spatial Location Attention Module for Object Detection
1 Introduction
2 Related Work
2.1 Fully Convolutional Object Detector
2.2 Attention Mechanisms
3 Spatial Location Attention Module
3.1 Attention Module Review and Comparison
3.2 The Generation of Spatial Location Attention
3.3 Exemplars: SLAM_C3 and SLAM_C2f
4 Experiments
4.1 Experiment Setup
4.2 Ablation Studies
4.3 Compare with Other Attention Methods
4.4 Network Visualization with Grad-CAM
4.5 Compare with Other Models
5 Conclusion
References
A Novel Interaction Convolutional Network Based on Dependency Trees for Aspect-Level Sentiment Analysis
1 Introduction
2 Related Work
2.1 Dependency Trees
2.2 Graph Convolutional Networks
3 Methodology
3.1 Input and Encoding Layer
3.2 Attention Mechanism Layer
3.3 Dependency Trees Layer
3.4 Graph Convolution Network Layer
3.5 Interactive Network Layer
3.6 Output Layer
4 Experiments
4.1 Datasets
4.2 Parameters Settings
4.3 Model Comparison
4.4 Results Analysis of Comparison Experiments
4.5 Number of Att-GCN Layers
4.6 Ablation Experiment of ASAI-DT Model
5 Conclusion
References
Efficient Collaboration via Interaction Information in Multi-agent System
1 Introduction
2 Background
2.1 Dec-POMDP
2.2 Centralized Training with Decentralized Execution
2.3 Hypergraph Convolution Network
3 CIIMH
3.1 Agent Perceiving
3.2 Interaction Learning
3.3 Overall Optimization Objective
4 Experiment
4.1 Environments and Settings
4.2 Experimental Results and Analysis
5 Conclusion
References
A Deep Graph Matching-Based Method for Trajectory Association in Vessel Traffic Surveillance
1 Introduction
2 Related Works
2.1 AIS and Video Data Fusion
2.2 Deep Graph Matching
3 Preliminaries
3.1 Target Trajectory Graph
3.2 Target Trajectory Graph Matching
4 The Proposed Method
4.1 Attentional Graph Neural Network
4.2 Optimal Matching Layer
5 Experiments
5.1 Comparison Experiments on the FVessel Dataset
5.2 Experiments on the Synthetic Dataset
6 Conclusion
References
Few-Shot Anomaly Detection in Text with Deviation Learning
1 Introduction
2 Related Work
3 FATE: The Proposed Approach
3.1 Problem Statement
3.2 The Proposed Framework
4 Implementation Details
4.1 Datasets
4.2 Model Training and Hyperparameters
4.3 Baselines and Evaluation Metrics
5 Results and Analysis
6 Concluding Remarks
References
MOC: Multi-modal Sentiment Analysis via Optimal Transport and Contrastive Interactions
1 Introduction
2 Related Work
3 Preliminary of Optimal Transport
4 Method
4.1 Optimal Transport
4.2 Double-Modal Contrastive Learning
5 Experiments
5.1 Datasets and Implementation Details
5.2 Baselines
5.3 Overall Results
5.4 Ablation Study
6 Analysis
6.1 Hyper-parameter Heatmap
6.2 Visualization
7 Conclusion
References
Two-Phase Semantic Retrieval for Explainable Multi-Hop Question Answering
1 Introduction
2 Related Work
3 Methodology
3.1 Problem Definition
3.2 Overview
3.3 Rationale-Aware Dense Retrieval
3.4 Entity-Aware Sparse Retrieval
3.5 Training Objective
4 Experiment
4.1 Datasets
4.2 Baselines
4.3 Settings
4.4 Result Analysis
4.5 Ablation
4.6 Explainability
5 Conclusion
References
Efficient Spiking Neural Architecture Search with Mixed Neuron Models and Variable Thresholds
1 Introduction
2 Related Work
3 Preliminaries
4 Methodology
4.1 Search Space
4.2 Evaluation Strategy
4.3 Search Algorithm
5 Experiments
5.1 Experimental Setups
5.2 Performance Evaluation
5.3 Ablation Experiments
6 Conclusion
References
Towards Scalable Feature Selection: An Evolutionary Multitask Algorithm Assisted by Transfer Learning Based Co-surrogate
1 Introduction
2 Preliminaries: Basics of Multitask Optimization
3 Proposed CosEMT Framework
3.1 Co-surrogate Model for Instance-Based Transfer Learning
3.2 Dynamic Resource Allocation Strategy
3.3 Mechanism of Surrogate Model Update
3.4 Summary of cosEMT Framework
4 Experimental Studies
4.1 Datasets for Experiments
4.2 Experimental Configuration
4.3 Results of Comparative Experiments
5 Conclusion
References
CAS-NN: A Robust Cascade Neural Network Without Compromising Clean Accuracy
1 Introduction
2 Related Work
3 Preliminaries
4 Robust Cascade Neural Network
4.1 Confidence Calibration
4.2 Training of CAS-NN
5 Adaptive Attack Against CAS-NN
6 Experiments
6.1 Comparisons with Prominent Adversarial Training Algorithms
6.2 Performance of Confidence Calibration
6.3 Margin Along the Initial Adversarial Direction
6.4 Analysis of the Hyper-parameter
7 Conclusion
References
Multi-scale Information Fusion Combined with Residual Attention for Text Detection
1 Introduction
2 Method
2.1 Network Structure
2.2 Multi-scale Information Fusion Structure
2.3 Residual Attention Module
2.4 Loss Function
3 Analysis of Experimental Results
3.1 Dataset
3.2 Experimental Details
3.3 Ablation Experiments and Analysis
3.4 Comparison with State-of-the-Arts
3.5 Analysis of Visual Results
3.6 Limitations
4 Conclusion
References
Encrypted-SNN: A Privacy-Preserving Method for Converting Artificial Neural Networks to Spiking Neural Networks
1 Introduction
2 Background
2.1 Spiking Neural Networks
2.2 Privacy Protection
3 Encrypted-SNN
3.1 Privacy-Preserving Methods for ANN and SNN
3.2 ANN-SNN Conversion
4 Experiments and Results
4.1 Experiments Setup
4.2 Experiments Results
5 Conclusion
References
PoShapley-BCFL: A Fair and Robust Decentralized Federated Learning Based on Blockchain and the Proof of Shapley-Value
1 Introduction
2 Related Work and Preliminaries
2.1 Related Work
2.2 Preliminaries
3 The Algorithm Design for PoShapley-BCFL
3.1 The Designs of PoShapley Algorithm
3.2 Fair and Robust Aggregation
4 Experimental Results and Evaluations
4.1 Experimental Settings
4.2 Experimental Results Analysis
5 Conclusion
References
Small-World Echo State Networks for Nonlinear Time-Series Prediction
1 Introduction
2 Small-World Echo State Networks
2.1 Topology of the Reservoir Layer
2.2 Reservoir Weights Tuning
2.3 Readout Weights Training
3 Numerical Verification
4 Conclusion
References
Preserving Potential Neighbors for Low-Degree Nodes via Reweighting in Link Prediction
1 Introduction
2 Related Work
3 Preliminaries
4 Proposed Method
4.1 Theoretical Analysis
4.2 Harmonic Weighting
5 Experiments
5.1 Experimental Setup
5.2 Experimental Results
5.3 Visualization
6 Conclusion and Future Work
References
6D Object Pose Estimation with Attention Aware Bi-gated Fusion
1 Introduction
2 Related Work
2.1 End-to-End One-Stage Approach
2.2 Keypoint-Based Two-Stage Approach
3 Method
3.1 Overview
3.2 Self-attention Mechanism Module
3.3 Gate Fusion Module
4 Experimental Results and Discussion
4.1 Comparison with the State-of-the-art Methods on Two Benchmark Datasets
4.2 Ablation Study
5 Conclusion
References
Author Index

Recommend Papers

Neural Information Processing: 30th International Conference, ICONIP 2023, Changsha, China, November 20–23, 2023, Proceedings, Part VI (Lecture Notes in Computer Science, 14452) 9819980755, 9789819980758

The six-volume set LNCS 14447 until 14452 constitutes the refereed proceedings of the 30th International Conference on N

118 2 49MB Read more

Neural Information Processing: 30th International Conference, ICONIP 2023, Changsha, China, November 20–23, 2023, Proceedings, Part V (Lecture Notes in Computer Science) 9819980720, 9789819980727

The six-volume set LNCS 14447 until 14452 constitutes the refereed proceedings of the 30th International Conference on N

125 101 68MB Read more

Neural Information Processing: 30th International Conference, ICONIP 2023, Changsha, China, November 20–23, 2023, Proceedings, Part IV (Lecture Notes in Computer Science, 14450) 9819980690, 9789819980697

The six-volume set LNCS 14447 until 14452 constitutes the refereed proceedings of the 30th International Conference on N

125 49 85MB Read more

Neural Information Processing: 30th International Conference, ICONIP 2023, Changsha, China, November 20–23, 2023, Proceedings, Part III (Lecture Notes in Computer Science) 9819980666, 9789819980666

The six-volume set LNCS 14447 until 14452 constitutes the refereed proceedings of the 30th International Conference on N

109 13 70MB Read more