Neural Information Processing: 30th International Conference, ICONIP 2023, Changsha, China, November 20–23, 2023, Proceedings, Part XV (Communications in Computer and Information Science) 9819981832, 9789819981830

The nine-volume set constitutes the refereed proceedings of the 30th International Conference on Neural Information Proc

144 23 52MB

English Pages 536 [532] Year 2023

Table of contents :
Preface
Organization
Contents – Part XV
Applications
Towards Deeper and Better Multi-view Feature Fusion for 3D Semantic Segmentation
1 Introduction
2 Related Work
2.1 2D Semantic Segmentation
2.2 3D Semantic Segmentation
2.3 3D Semantic Segmentation Based on Joint 2D-3D Data
3 Methodology
3.1 Overview
3.2 Dynamic View Selection
3.3 Unidirectional Projection Module
3.4 Feature Integration
4 Experiments
4.1 Datasets and Implementation Details
4.2 Comparison with SoTAs on ScanNetv2 Benchmark
4.3 Ablation Study and Analysis
4.4 DMF-Net on NYUv2
5 Conclusions and Future Work
References
RF-Based Drone Detection with Deep Neural Network: Review and Case Study
1 Introduction
2 Background
3 Case Study
3.1 System Overview and Experimental Setup
3.2 Results and Discussions
4 Conclusion
References
Effective Skill Learning on Vascular Robotic Systems: Combining Offline and Online Reinforcement Learning
1 Introduction
2 Problem Definition
3 Method
3.1 Rule-Based Interactions
3.2 Network Structure
3.3 Offline Pretraining
3.4 Online Fine-Tuning
4 Results
5 Conclusions
References
Exploring Efficient-Tuned Learning Audio Representation Method from BriVL
1 Introduction
2 Related Works
3 Methodology and Experiments
3.1 Dataset for WavBriVL Performance Test
3.2 Dataset for VQGAN
3.3 Feature Extraction Processing
4 Task 1: WavBriVL Performance Test
4.1 Training, Development, and Evaluation
4.2 Comparisons with Previous Work
4.3 Quantitative Image Analysis
5 Task 2: Speech Generation Picture
5.1 Sound Volume
5.2 Comparison with Previous Work
5.3 Correlation Between Sounds and Images
6 Summary and Conclusions
References
Can You Really Reason: A Novel Framework for Assessing Natural Language Reasoning Datasets and Models
1 Introduction
2 Approach
2.1 Task Formulation
2.2 Cue Metric
2.3 Aggregation Methods
2.4 Transformation of MCQs with Dynamic Choices
3 Experiment
3.1 Datasets
3.2 Quantifying Information Leakage
3.3 Cue Evaluation Methods
3.4 Comparison with Hypothesis-Only Models
3.5 Identifying Problematic Datasets
4 Related Work
5 Conclusion and Future Work
References
End-to-End Urban Autonomous Navigation with Decision Hindsight
1 Introduction
2 Related Work
2.1 State Representation in Urban Scenarios
2.2 Transformer in RL
3 Methodology
3.1 Problem Formulation
3.2 History-Feedforward State Encoding
3.3 End-to-End Learning with Decision Hindsight
4 Experiments
4.1 Simulation Setup
4.2 Result Analysis
5 Conclusion
References
Identifying Self-admitted Technical Debt with Context-Based Ladder Network
1 Introduction
2 Related Work
2.1 Ladder Network
2.2 Pre-training Model
2.3 Self-admitted Technical Debt
3 SATD-LN Model
4 Experimental Setup
4.1 Data and Evaluation Metrics
4.2 Implementation Details
4.3 Baselines
5 Results and Discussions
5.1 The Results of SATD-LN
5.2 The Performance Against the Number of Labeled Data
5.3 The Performance Against the Number of Unlabeled Data
5.4 The Effectiveness of SATD-LN Against Other Semi-supervised Learning Methods
5.5 The Effectiveness of SATD-LN Against Other Supervised Learning Methods
6 Conclusions
References
NDGR: A Noise Divide and Guided Re-labeling Framework for Distantly Supervised Relation Extraction
1 Introduction
2 Related Work
3 Methodology
3.1 Noise Division by Loss Modeling
3.2 Guided Label Generation and Exploitation
3.3 Iterative Training Algorithm
4 Experiments and Analysis
4.1 Datasets
4.2 Baseline Models
4.3 Implementation Details
4.4 Sentence-Level Evaluation
4.5 Analysing on Noisy-TACRED
4.6 Ablation Study
5 Conclusion
References
Customized Anchors Can Better Fit the Target in Siamese Tracking
1 Introduction
2 Proposed Method
2.1 Overall Framework
2.2 Target-Aware Feature Correlation Module
2.3 Customization Anchor Proposal Subnet
2.4 Training
3 Experiments
3.1 Implementation Details
3.2 Datasets and Evaluation Metrics
3.3 Ablation Experiments
3.4 Comparison with State-of-the-art Methods
4 Conclusion
References
Can We Transfer Noise Patterns? A Multi-environment Spectrum Analysis Model Using Generated Cases
1 Introduction
2 Background and Motivation
2.1 Problems in Traditional Denoising Models
2.2 Studies Related to Noise Pattern Transferring
3 Noise Patterns Transferring Model
3.1 D2D Case-Base and S2S Case-Base
3.2 S2S Cases Generating Model
3.3 Noise Patterns Transferring Model
4 Environmental Setup
4.1 Model Configurations and Runtime Environments
4.2 Datasets
4.3 Baselines
5 Experimental Results
5.1 Effects on S2S Cases Generating
5.2 Performance on Analysis Model
5.3 Analysis and Discussions
6 Conclusions
References
Progressive Supervision for Tampering Localization in Document Images
1 Introduction
2 Related Works
2.1 Natural Image Tampering Localization
2.2 Document Image Tampering Localization
3 Methodology
3.1 Multi-view Enhancement
3.2 Progressive Supervision Module
3.3 Detection Module
4 Experiments
4.1 Experimental Setting
4.2 Ablation Study
4.3 Comparison with Other Methods
5 Conclusion
References
Multi-granularity Deep Vulnerability Detection Using Graph Neural Networks
1 Introduction
2 Motivation
2.1 Multi-granularity Program Representation Covering Multiple Vulnerability Types
2.2 Overcoming the Constraint Posed by Hyperparameter Layers
3 The MulGraVD Framework
3.1 Multi-granularity Program Representation
3.2 Multi-granularity Information Propagation
3.3 Representation Learning Model
4 Evaluation
4.1 Research Questions
4.2 Datasets
4.3 Experimental Design of Research Questions
5 Related Work
6 Conclusion
References
Rumor Detection with Supervised Graph Contrastive Regularization
1 Introduction
2 Preliminary Knowledge
3 Method
3.1 Graph Representation
3.2 Rumor Classification
3.3 Supervised Contrastive Learning with Regulation
3.4 Joing Training with Gradient Projection
4 Experiments
4.1 Datasets
4.2 Baselines
4.3 Experimental Results and Analysis
4.4 Ablation Study
5 Conclusion
References
A Meta Learning-Based Training Algorithm for Robust Dialogue Generation
1 Introduction
2 Related Work
3 Method
3.1 FDAML
3.2 Fisher Information
3.3 DAML
4 Experiments
4.1 Datasets
4.2 Metrics
4.3 Implementation Details
5 Results
5.1 Main Results
5.2 The Timing of Using Fisher
6 Conclusion and Future Work
References
Effects of Brightness and Class-Unbalanced Dataset on CNN Model Selection and Image Classification Considering Autonomous Driving
1 Introduction
2 Motivation
3 Related Work
4 Methodology
5 Experimental Results
5.1 Experimental Setup
5.2 Results and Analysis
6 Conclusion and Future Work
References
HANCaps: A Two-Channel Deep Learning Framework for Fake News Detection in Thai
1 Introduction
2 Related Works
3 Methodology
3.1 Proposed HANCaps Model
3.2 Loss Function
4 Experimental Setup
4.1 LimeSoda Dataset
4.2 Experimental Settings
5 Results and Discussion
6 Conclusion and Future Work
References
Pre-trained Financial Model for Price Movement Forecasting
1 Introduction
2 Related Work
3 Task Description and Data Features
4 Model Design
4.1 History Encoder
4.2 Future Decoder
5 Model Pre-training
6 Fine-Tuning on Downstream Tasks
6.1 Lagged Regression (LR)
6.2 Directional Classification (DC)
6.3 Top-K Selection (TKS) of Stocks
7 Experiment
7.1 Our Methods and Baselines
7.2 Results and Analysis
8 Conclusion
References
Impulsion of Movie's Content-Based Factors in Multi-modal Movie Recommendation System
1 Introduction
2 Related Works
3 Problem Statement
4 Dataset
4.1 Multi-modal MovieLens 100K (ML-100K)
4.2 MFVCD-7K dataset
5 Methodology
5.1 Embedding Generation
6 Results and Discussion
7 Error Analysis
8 Conclusion
References
Improving Transferbility of Adversarial Attack on Face Recognition with Feature Attention
1 Introduction
2 Related Works
2.1 Adversarial Attack
2.2 Adversarial Attack on FR
3 Methodology
3.1 Attacks Against Deep Face Model
3.2 Feature Space Attack with Feature Attention Loss
3.3 Ensemble Feature Space Attack with Feature Attention Loss
4 Experiments
4.1 Experiment Setup
4.2 Experimental Results
4.3 Experiments for Feature Space Attack
4.4 Ablation Study
5 Conclusion
References
Dendritic Neural Regression Model Trained by Chicken Swarm Optimization Algorithm for Bank Customer Churn Prediction
1 Introduction
2 Proposed Model
3 Learning Algorithms
4 Application to Bank Customer Churn Prediction
5 Conclusion
References
BERT-LBIA: A BERT-Based Late Bidirectional Interaction Attention Model for Legal Case Retrieval
1 Introduction
2 Model
2.1 Task Definition
2.2 Model Architecture
2.3 Train and Test
3 Experiments
3.1 Datasets and Evaluation Metrics
3.2 Baseline Models
3.3 Experiment Setting and Implementation
3.4 Benchmark Experiment
3.5 Ablation Experiment
4 Related Work
4.1 Long Document Retrieval
4.2 Neural Information Retrieval
4.3 Legal Case Retrieval
5 Conclusions
References
Learning Discriminative Semantic and Multi-view Context for Domain Adaptive Few-Shot Relation Extraction
1 Introduction
2 Related Work
3 Task Definition
4 The Proposed Method
4.1 Discriminative Semantic Learning Module
4.2 Multi-view Context Representation Module
4.3 Training Objective
5 Experiments
5.1 Data Set and Experiment Details
5.2 Baselines
5.3 Overall Results
5.4 Ablation Study
6 Conclusion
References
ML2FNet: A Simple but Effective Multi-level Feature Fusion Network for Document-Level Relation Extraction
1 Introduction
2 Related Work
3 Method
3.1 Problem Definition
3.2 Encoder
3.3 Enhanced U-Shape Module
3.4 Relation Classification Module
4 Experiments
4.1 Datasets
4.2 Implementation Details
4.3 Main Result
4.4 Results on Biomedical Datasets
4.5 Ablation Study and Analysis
4.6 Case Study
5 Conclusion
References
Implicit Clothed Human Reconstruction Based on Self-attention and SDF
1 Introduction
2 Related Work
2.1 Single-View Clothed Human Reconstrcution
2.2 Self-attention for Reconstruction
3 Methodology
3.1 Overview
3.2 Implicit Function
3.3 Self-attention Block
3.4 Loss Fuction
4 Experiments
4.1 Datasets and Evaluation Metrics
4.2 Implements Details
4.3 Ablation Study
4.4 Qualitative Evaluation
4.5 Quantitative Evaluation
5 Conclusion
References
Privacy-Preserving Federated Compressed Learning Against Data Reconstruction Attacks Based on Secure Data
1 Introduction
2 Related Work
2.1 Data Reconstruction Attacks
2.2 Defenses Against Data Reconstruction Attacks
2.3 Compressed Sensing
2.4 Compressed Learning
3 Methodology
3.1 Training with Proxy Image
3.2 Training with Encrypted Data in the Spatial Domain
3.3 Training with Encrypted Data in the Frequency Domain
4 Experiment
4.1 Experimental Settings
4.2 Defense Effect Against Data Reconstruction Attacks
4.3 Secure Performance Evaluation
4.4 Model Performance
5 Conclusion
References
An Attack Entity Deducing Model for Attack Forensics
1 Introduction
2 Related Work
3 Proposed Method
3.1 Definitions
3.2 Sequence Extraction
3.3 Sequence Sampling Based on Auxiliary Strategy
3.4 Sequential Representation Based on Dynamic Word Embedding
3.5 Model Training and Application
4 Experiments
4.1 Datasets
4.2 Environment
4.3 Evaluation Method
4.4 Efficiency Experiment of Sequence Sampling
4.5 Comparison Experiment of Sequence Sampling
4.6 Ablation Experiment of Word Embedding
4.7 Model Evaluation Experiment
5 Conclusion
References
Semi-supervised Classification on Data Streams with Recurring Concept Drift Based on Conformal Prediction
1 Introduction
2 Related Work
3 Proposed Algorithm
3.1 CPSSDS-R
3.2 Inductive Conformal Prediction
3.3 Concept Drift Detection
3.4 Information Sample Selection Strategy
4 Experiments
4.1 Datasets
4.2 Experimental Setup
4.3 Algorithm Accumulative Accuracy and Result Analysis
5 Conclusions
References
Zero-Shot Relation Triplet Extraction via Retrieval-Augmented Synthetic Data Generation
1 Introduction
2 Related Work
3 Methodology
4 Experiment
5 Conclusion
References
Parallelizable Simple Recurrent Units with Hierarchical Memory
1 Introduction
2 PSRU-HM: Parallelizable Simple Recurrent Unit with Hierarchical Memory
2.1 Simple Recurrent Unit with Hierarchical Memory
2.2 CUDA Optimized Network Parallelization
3 Experimental Results and Discussion
3.1 Performance in Text Classification
3.2 Performance in Language Modeling
3.3 Performance in Question Answering
4 Conclusion
References
Enhancing Legal Judgment Prediction with Attentional Networks Utilizing Legal Event Types
1 Introduction
2 Related Work
3 Method
3.1 Legal Text Encoder
3.2 Legal Event Types Attention
3.3 Legal Representation Fusion
3.4 The Output Layer
3.5 Training
4 Experiments
4.1 Datasets
4.2 Baselines
4.3 Experiment Settings and Evaluation Metrics
4.4 Experimental Results
4.5 Ablation Test
5 Case Study
6 Conclusion
References
MOOCs Dropout Prediction via Classmates Augmented Time-Flow Hybrid Network
1 Introduction
2 Proposed Method
2.1 Problem Description
2.2 Time-Flow Hybrid Network
2.3 User Activity Matrix Generation
2.4 User Contextual Features Generation
2.5 Prediction
3 Experiments
3.1 Dataset
3.2 Implementation and Evaluations Metrics
3.3 Parameter Analysis
3.4 Comparison of Correlation Calculations
3.5 Comparison with Other Dropout Prediction Methods
4 Conclusions
References
Multiclass Classification and Defect Detection of Steel Tube Using Modified YOLO
1 Introduction
1.1 Need for Defect Classification and Detection
1.2 Why YOLO for Object Detection
1.3 Contribution of the Paper
2 Literature Survey
3 Methodology
3.1 Classification Techniques
3.2 Background of YOLOv7
3.3 Modified-YOLOv7
4 Dataset Description, Evaluation Indicators and Metrics
5 Results
5.1 Classification
5.2 Detection
6 Conclusion
References
GACE: Learning Graph-Based Cross-Page Ads Embedding for Click-Through Rate Prediction
1 Introduction
2 Related Works
2.1 General CTR Recommendation Systems
2.2 Item Embedding Generation
3 Preliminaries
3.1 Item Knowledge Base
3.2 Problem Formulation
4 Methodology
4.1 Overview
4.2 Graph Creation
4.3 Pre-training Embedding Learning
5 Experiment
5.1 Dataset
5.2 Experimental Settings
5.3 Result from Model Comparison on Public AliEC Dataset
5.4 Result from Model Comparison on Alipay Offline Experimental Dataset
5.5 Result from Online A/B Testing
6 Conclusion
References
TEZARNet: TEmporal Zero-Shot Activity Recognition Network
1 Introduction
2 Related Work
3 Proposed TEZARNet
4 Experimental Study
4.1 Datasets
4.2 Evaluation Metric
4.3 Implementation
4.4 Comparative Study
4.5 Ablation Study
4.6 Reproducibility of TEZARNet
5 Conclusions and Future Work
References
Tigrinya OCR: Applying CRNN for Text Recognition
1 Introduction
1.1 OCR
1.2 The Tigrinya Language
2 Related Works
3 Methodology
3.1 Data
3.2 Proposed Architecture
4 Experiments
4.1 Evaluation Metrics
4.2 Hyperparameter Tuning
5 Results
5.1 Comparison of Error Rate
5.2 Comparison of Model Size
6 Conclusion
References
Generating Pseudo-labels for Car Damage Segmentation Using Deep Spectral Method
1 Introduction
2 Related Work
2.1 Car AI Damage Assessment
2.2 Pseudo-Labeling
2.3 Segmentation Tasks
3 Methodology
3.1 Deep Spectral Methods
3.2 Flip-Bit Algorithm
3.3 Merge Eigenvectors
3.4 Utilizing Pseudo-Labels for Training
4 Experimental Framework
4.1 CarDD Dataset
4.2 Experiment Setting
5 Results and Discussion
5.1 Evaluate Generated Pseudo-Labels from Deep Spectral
5.2 Evaluate by Training the Model
6 Conclusion
References
Two-Stage Graph Convolutional Networks for Relation Extraction
1 Introduction
2 Related Work
3 Our Proposed Approach
3.1 Task Definition
3.2 BERT Encoder
3.3 Two-Stage GCN
3.4 Relation Classification
3.5 Loss Function
4 Experiments
4.1 Datasets
4.2 Evaluation Metrics
4.3 Implementation Details
4.4 Main Results
4.5 Comparison with Other SOTA
4.6 Ablation Study
4.7 Extraction Speed
5 Conclusion
References
Multi-vehicle Platoon Overtaking Using NoisyNet Multi-agent Deep Q-Learning Network
1 Introduction
1.1 Multi-vehicle Collaboration
1.2 Reinforcement Learning for Overtaking
2 Problem Formulation
2.1 Preliminary of Reinforcement Learning
2.2 Multi-agent Reinforcement Learning (MARL) and NoisyNets
3 NoisyNet-MADQN for Platoon Overtaking
3.1 Reward Function Design
3.2 Noisy Network Multi-Agent Deep Q-Learning Network
4 Experiments and Discussion
4.1 Experimental Settings
5 Conclusions
References
Correction to: Multiclass Classification and Defect Detection of Steel Tube Using Modified YOLO
Correction to: Chapter 32 in: B. Luo et al. (Eds.): Neural Information Processing, CCIS 1969, https://doi.org/10.1007/978-981-99-8184-7_32
Author Index