Pattern Recognition: ICPR International Workshops and Challenges: Virtual Event, January 10–15, 2021, Proceedings, Part IV [1 ed.] 3030687988, 9783030687984

This 8-volumes set constitutes the refereed of the 25th International Conference on Pattern Recognition Workshops, ICPR

160 4 21MB

English Pages 754 [749] Year 2021

Report DMCA / Copyright

DOWNLOAD PDF FILE

Table of contents :
Foreword by General Chairs
Preface
Challenges
ICPR Organization
Contents – Part IV
FGVRID - Fine-Grained Visual Recognition and re-Identification
Workshop on Fine-Grained Visual Recognition and Re-identification
Workshop Description
Organization
General Chairs
Program Committee
Densely Annotated Photorealistic Virtual Dataset Generation for Abnormal Event Detection
1 Introduction
2 Related Work
2.1 Related Tools for Synthetic Data Generation
2.2 Related Synthetic Data
2.3 Real-World Datasets for Abnormal Crowd Event Detection
3 Dataset Creation and Description
3.1 Dataset Creation: Interacting with the Virtual World
3.2 Dataset Description
4 Unsupervised Abnormal Event Detection Using GTA V
4.1 Cumulative Sum Method for Abnormal Event Detection
4.2 Experimental Results: Abnormal Event Detection
5 Conclusions
References
Unsupervised Domain Adaptive Re-Identification with Feature Adversarial Learning and Self-similarity Clustering
1 Introduction
2 Related Work
2.1 Person and Vehicle Re-ID
2.2 Unsupervised Domain Adaptation
2.3 Unsupervised Domain Adaptation Methods for Re-ID
3 Methodology
3.1 Problem Definition
3.2 Feature Extractor Pre-training
3.3 Domain-Invariant Feature Adversarial Learning
3.4 Self-similarity Clustering
3.5 Relabeling Algorithm
3.6 Training Procedure
4 Experiment
4.1 Datasets
4.2 Implementation Details
4.3 Ablation Study
4.4 Comparison with State of the Art
5 Conclusion
References
A Framework for Jointly Training GAN with Person Re-Identification Model
1 Introduction
2 Related Work
2.1 Generative Adversarial Networks
2.2 Person Re-Identification
2.3 GANs for Person Re-Identification
3 Proposed Joint Training Framework
3.1 Joint Training with Identification Loss
3.2 Joint Training with Triplet Loss
4 Experimental Results
4.1 Implementation Details
4.2 Ablation Study
4.3 Comparison to State of the Art
5 Conclusion
References
Interpretable Attention Guided Network for Fine-Grained Visual Classification
1 Introduction
2 Related Work
2.1 Fine-Grained Classification
2.2 Interpretable Neural Networks
3 Method
3.1 Data Augmentation Method
3.2 Interpretable Attention
3.3 Progressive Training Mechanism
4 Experiments
4.1 Implementation Details
4.2 Performance Comparison
4.3 Visualization
5 Conclusion
References
Use of Frequency Domain for Complexity Reduction of Convolutional Neural Networks
1 Introduction
2 Literature Review
3 CNN Computing Based on FFT
4 FFT Based Computation Using Splitting
4.1 Proposed Method
4.2 Computational Complexity and Memory Access Analysis
5 Experimental Results
6 Conclusion
References
From Coarse to Fine: Hierarchical Structure-Aware Video Summarization
1 Introduction
2 Related Work
3 Proposed Approach
3.1 Hierarchical Summarization Network
3.2 Reward Function
4 Experiments
4.1 Datasets
4.2 Video Summarization
5 Conclusion
References
ADNet: Temporal Anomaly Detection in Surveillance Videos
1 Introduction
2 Related Works
3 Method
3.1 Anomaly Detection Network
3.2 Temporal Sliding Window
3.3 Loss
3.4 Implementation Details
4 Experiments
4.1 Evaluation Metrics
4.2 Dataset
4.3 Results
5 Conclusion
References
Soft Pseudo-labeling Semi-Supervised Learning Applied to Fine-Grained Visual Classification
1 Introduction
2 Related Work
2.1 Fine-Grained Visual Classification
2.2 Semi-Supervised Learning
2.3 Semi-Supervised FGVC
3 Problem Formulation
3.1 Pseudo-label
3.2 Entropy Minimization
4 Experimental Results
4.1 Datasets
4.2 Comparison with SSL Methods
References
HCAU 2020 - The First International Workshop on Deep Learning for Human-Centric Activity Understanding
Workshop on Deep Learning for Human-Centric Activity Understanding
Workshop Description
Organization
Program Chairs
Program Committee
Spot What Matters: Learning Context Using Graph Convolutional Networks for Weakly-Supervised Action Detection
1 Introduction
2 Related Work
2.1 Action Recognition and Detection
2.2 Visual Relational Reasoning
3 Learning Context with GCN
3.1 Feature Extraction with Weak Supervision
3.2 Graph Convolutional Networks
3.3 Implementation Details
4 Experiments
4.1 Dataset and Evaluation Metric
4.2 Evaluation of Architecture Choices
4.3 Comparison with Baseline and State of the Art
4.4 Reducing Annotation to One Bounding Box
5 Analysis of Attention
5.1 Evaluation of Attention Maps
6 Conclusion
References
Social Modeling Meets Virtual Reality: An Immersive Implication
1 Introduction
2 Crowd Analysis
3 Crowd and Virtual Reality
4 Experiemental Evaluation
5 Conclusion
References
Pickpocketing Recognition in Still Images
1 Introduction
2 Background
3 Methodology
3.1 Dataset Preparation
3.2 Design
4 Experiments
5 Conclusion
References
t-EVA: Time-Efficient t-SNE Video Annotation
1 Introduction
2 Related Work
3 t-EVA for Efficient Video Annotation
3.1 How to Annotate?
4 Experiments
4.1 Datasets
4.2 Evaluation Metrics
4.3 Implementation Details
4.4 Results on ActivityNet
4.5 Generalization
5 Ablation Study
5.1 Dimensionality Reduction
5.2 t-SNE Parameters
5.3 2D-3D Comparison
6 Conclusion
References
Vision-Based Fall Detection Using Body Geometry
1 Introduction
2 Related Work
3 Proposed Method
3.1 Down-Sampling the Videos
3.2 Body and Head Manual Annotations
3.3 Feature Extraction
3.4 Classification
4 Experimental Results and Discussion
4.1 Experimental Setup
4.2 Datasets
4.3 Experiment Results
5 Conclusion and Future Directions
References
Comparative Analysis of CNN-Based Spatiotemporal Reasoning in Videos
1 Introduction
2 Related Work
3 Methodology
3.1 ST modeling Architecture
3.2 Multi-layer Perceptron (MLP) Based Techniques
3.3 Recurrent Neural Networks (RNN) Based Techniques
3.4 Fully Convolutional Network (FCN) Based Techniques
3.5 Training Details
4 Experiments
4.1 Datasets
4.2 Resource Efficiency Analysis
4.3 Results Using Jester Dataset
4.4 Results Using Something-Something Dataset
5 Conclusion
References
Generalization of Fitness Exercise Recognition from Doppler Measurements by Domain-Adaption and Few-Shot Learning
1 Introduction
2 Related Works
3 Ultrasonic Smartphone Exercises: Sensing and Database
3.1 Sensing Principle
3.2 Database
4 Methods
4.1 The Baseline Method
4.2 Our Proposed Method 1: Domain Adaptation
4.3 Our Proposed Method 2: Few-Shot Classification
5 Evaluation and Discussion
5.1 Baseline Results
5.2 Our Proposed Approaches
5.3 Discussion
6 Conclusion
References
Local Anomaly Detection in Videos Using Object-Centric Adversarial Learning
1 Introduction
2 Related Work
3 Proposed Method
3.1 Object Detection and Gradient Extraction
3.2 Gradient-Appearance Relation Discovery Network (GARDiN)
3.3 Partial Mean Squared Reconstruction Errors (PMSRE)
3.4 Adversarial Classification of the PMSRE
3.5 Abnormal Events Detection
4 Experiments
4.1 Datasets and Evaluation Procedure
4.2 Experimental Setup
4.3 Results
4.4 Ablation Study
4.5 Inference Running Time
5 Conclusion
References
A Hierarchical Framework for Motion Trajectory Forecasting Based on Modality Sampling
1 Introduction
2 Related Work
3 Methodology
3.1 Transformation Set Construction
3.2 Encoder
3.3 Decoder
4 Experimental Results
4.1 Multi-modality Evaluation on Synthetic Data
4.2 Trajectory Prediction Evaluation
5 Conclusions
References
Skeleton-Based Methods for Speaker Action Classification on Lecture Videos
1 Introduction
2 Background
2.1 Vision-Based Human Pose Estimator
2.2 Action Classification
2.3 Lecture Video Analysis
3 Methodology
3.1 Action Segment Extraction
3.2 Speaker Pose Estimation
3.3 Speaker Pose Normalization
3.4 Action Segment Representation
3.5 Speaker Action Classification
4 Datasets
4.1 AccessMath
4.2 Extended Dataset
5 Experiments
5.1 Pose Estimator Selection
5.2 Adaptations of the 2S-AGCN Model for Speaker Action Classification
5.3 Classifier: AM Pose Vs 2S-AGCN Model
6 Conclusion
References
IADS - Integrated Artificial Intelligence In Data Science
Workshop on Integrated Artificial Intelligence in Data Science (IADS)
Workshop Description
Organization
IADS Chairs
Program Committee
Fake Review Classification Using Supervised Machine Learning
1 Introduction and Background
1.1 Problem Statement
1.2 Research Questions
1.3 Aims and Objectives
1.4 Research Contributions
2 Literature Review
3 Methodology
3.1 Data Acquisition
3.2 Reducing Noise
3.3 Implementation of Supervised Machine Learning Classifier
3.4 Applying Other Machine Learning Classifiers
3.5 How the Proposed System Works?
3.6 Comparing Classifiers' Performance
4 Results
4.1 Answer to First Research Question
4.2 Answer to Second Research Question
4.3 Answer to Third Research Question
5 Conclusion and Future Work
References
Defect Detection of Stainless Steel Plates Using Deep Learning Technology
1 Introduction
2 Related Works
2.1 YOLOv3: You Only Look once V3
2.2 SSD: Single Shot MultiBox Detector
3 Stainless Steel Images and Definitions of Defects
3.1 Linear Defects
3.2 Regional Defects
4 Experiment
4.1 Image Capture and Environment Establishment
4.2 Dataset
4.3 YOLOv3 Model Training and SSD 300 Model Training
4.4 Verification of the Classification Accuracy of the YOLOv3 Model
4.5 Verification of the Classification Accuracy of the SSD 300 Model
5 Conclusion
References
Deep Neural Networks for Detecting Real Emotions Using Biofeedback and Voice
1 Introduction
1.1 Paper Goals and Organization
2 Related Works
3 Methodology
3.1 Performing the Regression
3.2 Building the Deep ANN
4 Experiment and Results
4.1 Techniques Used
4.2 Results
5 Conclusion
References
Data Augmentation for a Deep Learning Framework for Ventricular Septal Defect Ultrasound Image Classification
1 Introduction
2 Methods
2.1 Problem Definition
2.2 Data Augmentation Approaches
3 Empirical Results of Data Augmentation
3.1 Comparison of the Regression Models
3.2 Weight Values of Linear Regression Model
3.3 Comparisons of the Three DL Algorithms
4 Empirical Results of All VSD Type for a Larger Problem Size
5 Conclusions
References
A Neural Network Model for Lead Optimization of MMP12 Inhibitors
1 Introduction
2 Lead Optimization Problem
3 Lead Optimization Based on Predictive Neural Network Models
4 Results and Discussion
4.1 Evaluation Metrics
4.2 Dataset
4.3 Results of Lead Optimization Using SMPP
4.4 Results of Lead Optimization Using SPP
5 Conclusion and Future Directions
References
An Empirical Analysis of Integrating Feature Extraction to Automated Machine Learning Pipeline
1 Introduction
2 Reference Tools
2.1 Automated Feature Engineering Tool
2.2 AutoML Frameworks
3 Experimental Setup
3.1 Experiment Design
3.2 Datasets
4 Results and Discussion
5 Conclusion and Future Work
References
Input-Aware Neural Knowledge Tracing Machine
1 Introduction
2 Related Work
2.1 Factor Analysis
2.2 Knowledge Tracing Machine
2.3 DeepFM for Knowledge Tracing
3 Proposed Model
3.1 Encoding and Embedding Layer
3.2 Attention Net
3.3 Representation Refining and Sum Pooling Layer
3.4 Hidden Layers
3.5 Prediction Units
3.6 Optimization
4 Experiments
4.1 Datasets
4.2 Baselines
4.3 Student Performance Prediction
4.4 Ablation Study
5 Conclusion
References
Towards Corner Case Detection by Modeling the Uncertainty of Instance Segmentation Networks
1 Introduction
2 Related Work
3 Mask R-CNN
4 Bayesian Uncertainty Modeling
5 Our Approach for Corner Case Detection
5.1 Uncertainty Estimation
5.2 Corner Case Detection
6 Experimental Setup and Result Analysis
6.1 Dataset
6.2 Qualitative Evaluation
7 Conclusion and Future Improvements
References
Intelligent and Interactive Video Annotation for Instance Segmentation Using Siamese Neural Networks
1 Introduction
2 Related Work
3 Towards Intelligent Annotation
3.1 Base Model
3.2 Interaction Model
3.3 Weighted Binary Cross-Entropy Loss
3.4 Approaches to Incorporate Corrections
4 Evaluation Methodology
5 Evaluation
5.1 Weighted Binary Cross-Entropy Loss
5.2 Intersection over Union
6 Discussion
7 Conclusion and Future Work
References
Imputation of Rainfall Data Using Improved Neural Network Algorithm
1 Introduction
2 Study Region
3 Methodology
3.1 Phase 1: Data Preparation
3.2 Phase 2: Missing Data Imputation
4 Material and Methods
4.1 Meteorological Data
4.2 Rainfall from Nearest Neighbor Stations
4.3 Pre-processing Data Using Principal Component Analysis (PCA)
4.4 Imputation Approaches
4.5 Performance Measures
5 Results and Discussion
6 Conclusions
References
Novelty Based Driver Identification on RR Intervals from ECG Data
1 Introduction
2 Related Work
3 Data
4 Own Approach
4.1 Feature Extraction
4.2 Novelty Detection
4.3 Driver Identification
5 Experimental Evaluation
6 Conclusion and Outlook
References
Link Prediction in Social Networks by Variational Graph Autoencoder and Similarity-Based Methods: A Brief Comparative Analysis
1 Introduction
2 Link Prediction: Problem Description and Related Works
3 Methodology
3.1 Variational Graph Autoencoder
3.2 Similarity-Based Methods
4 Numerical Experiments
4.1 Data Set
4.2 Performance Metrics
4.3 Numerical Results
5 Conclusions
References
A Hybrid Wine Classification Model for Quality Prediction
1 Introduction
2 Background Knowledge
2.1 The SVM Classifier
2.2 Random Forest
3 Proposed Hybrid Wine Classification Model
4 Experimental Evaluation
4.1 Dataset Description
4.2 Performance Measure Metrics
4.3 Experimental Analysis
5 Conclusions and Future Work
References
A PSO-Based Sanitization Process with Multi-thresholds Model
1 Introduction
2 Related Work
3 Background Knowledge
4 Proposed Sanitization Algorithm
5 Experimental Results
5.1 Fitness Value
5.2 Runtime
6 Conclusions
References
Task-Specific Novel Object Characterization
1 Introduction
2 Related Work
3 Method
3.1 Detection of Common Objects
3.2 Task-Specific Taxonomy
3.3 Uncertainty and Relevancy for the Task
3.4 Characterizing Relevant Objects
4 Experiments
4.1 Demo Video on YouTube
4.2 Visualization
4.3 Finding the Relevant Objects
4.4 Ignoring the Irrelevant Objects
4.5 Summary of the Results
5 Conclusions
References
IML - International Workshop on Industrial Machine Learning
Workshop on Industrial Machine Learning (IML)
Workshop Description
Organization
IML Chairs
Program Committee
Deep Learning Based Dimple Segmentation for Quantitative Fractography
1 Introduction
2 Background and Related Work
2.1 Computer Vision in Quantitative Fractography
2.2 Fractography
2.3 Crack Growth
2.4 Microstructure
2.5 Dimple Fracture
3 Methodology
3.1 Previous Approaches
3.2 Proposed Model
4 Evaluation
4.1 Dataset
4.2 Experiments
4.3 Results
5 Conclusion
References
PaDiM: A Patch Distribution Modeling Framework for Anomaly Detection and Localization
1 Introduction
2 Related Work
3 Patch Distribution Modeling
3.1 Embedding Extraction
3.2 Learning of the Normality
3.3 Inference: Computation of the Anomaly Map
4 Experiments
4.1 Datasets and Metrics
4.2 Experimental Setups
5 Results
5.1 Ablative Studies
5.2 Comparison with the State-of-the-art
5.3 Anomaly Localization on a Non-aligned Dataset
5.4 Scalability Gain
6 Conclusion
References
Real-Time Cross-Dataset Quality Production Assessment in Industrial Laser Cutting Machines
1 Introduction
2 Problem Description
3 Related Works
4 Inference Model
4.1 Architectures
4.2 Adaptation
5 Deployment
5.1 Client Server Communication
5.2 Hardware
6 Experiments
6.1 Datasets
6.2 Training Procedure
6.3 Results
7 Discussion
8 Conclusion
References
An Online Deep Learning Based System for Defects Detection in Glass Panels
1 Introduction
2 Related Works
3 Problem Description
3.1 Deltamax Detection System
3.2 CLEVER Glass Dataset
4 Methodology
4.1 Overall System Configuration
4.2 Preprocessing
5 Results
5.1 Architecture Selection
5.2 Channel Shift Analysis
5.3 Discussion
5.4 Qualitative Results: Single-Channel, Multi-channel
6 Conclusion
References
Evaluation of Edge Platforms for Deep Learning in Computer Vision
1 Introduction
2 Related Work
2.1 Object Classification
2.2 Object Detection
2.3 Semantic Segmentation
2.4 Platform Benchmarks
3 Platform Evaluation
3.1 Model Overview
3.2 Platform Overview
3.3 Evaluation Overview
4 Experimental Results
4.1 Classification
4.2 Object Detection
4.3 Semantic Segmentation
4.4 Comparison of Tasks
4.5 Inference Analysis
5 Conclusion
References
BlendTorch: A Real-Time, Adaptive Domain Randomization Library
1 Introduction
1.1 Related Work
1.2 Contributions
2 Design Principles
3 Architecture
4 Experiments
4.1 Datasets
4.2 BlendTorch Dataset
4.3 Training
4.4 Prediction
4.5 Average Precision
4.6 Runtime
5 Conclusion
References
SAFFIRE: System for Autonomous Feature Filtering and Intelligent ROI Estimation
1 Introduction
2 Related Work
3 Materials and Methods
3.1 Training Data and Reference Image
3.2 Feature Extraction
3.3 The Training Graph
3.4 Search of Optimal Path in the Training Graph
3.5 The Anchor
3.6 Selection of the Best Feature Type
3.7 Testing
4 Evaluation Setup
4.1 Evaluation Datasets
4.2 Evaluation Metrics
5 Results
5.1 Noise Patterns and Filtering
5.2 Location Accuracy
5.3 Computation Time
6 Discussions
7 Conclusions
References
Heterogeneous Feature Fusion Based Machine Learning on Shallow-Wide and Heterogeneous-Sparse Industrial Datasets
1 Introduction
2 Background
2.1 Transfer Learning
2.2 Heterogeneous Feature Fusion
3 Method
3.1 Proposed Method
3.2 Baseline Methods
3.3 Evaluation Method
4 Results and Discussion
4.1 Toyota Dataset No. 1
4.2 Toyota Dataset No. 2
4.3 Discussion
5 Conclusion
References
3-D Deep Learning-Based Item Classification for Belt Conveyors Targeting Packaging and Logistics
1 Introduction
2 Background
2.1 Conveyor-Based Item Classification System
2.2 Machine Learning
2.3 Deep Learning
2.4 3-D Deep Learning Models
3 Experimental Setup
3.1 Hardware Setup
3.2 Software Setup
3.3 Items Used
4 Methods
4.1 Dataset Acquisition from Sensor Controller
4.2 Point Cloud Equalization
4.3 Dataset Generation
4.4 Candidate Classification Models
4.5 Model Training Strategy
5 Experimental Results
6 Discussion and Conclusion
References
Development of Fast Refinement Detectors on AI Edge Platforms
1 Introduction
2 Baseline Architecture - RefineDet
3 Combining Light-Weight Backbone CNNs
4 Evaluation Environments
5 Results and Discussions
6 Conclusions and Future Works
References
Selecting Algorithms Without Meta-features
1 Introduction
2 Background
2.1 Semantic Segmentation and Object Detection
2.2 Algorithm Selection (AS)
3 Data Set for Algorithm Selection
3.1 Ideal Algorithm Selection
4 Experiments
4.1 Experiments on VOC2012
4.2 Experiments on MSCOCO
4.3 Dynamic Ensemble Approach
5 Results and Discussion
6 Conclusion
References
A Hybrid Machine Learning Approach for Energy Consumption Prediction in Additive Manufacturing
1 Introduction
2 Literature Review
2.1 Analysis of Energy Consumption in AM Systems
2.2 Machine Learning for Predictive Modelling in AM System
3 Methodology
3.1 Multi-source Data Sensing and Collection
3.2 The Hybrid ML Approach for Energy Consumption Prediction
3.3 Validation of Prediction Model
4 Case Study
4.1 Experimental Setup
4.2 Results and Discussion
5 Conclusions
References
Bias from the Wild Industry 4.0: Are We Really Classifying the Quality or Shotgun Series?
1 Introduction
2 Related Work
3 Dataset
4 Task Definition
5 Classification Model
5.1 CNN Architectures
5.2 Loss Functions
6 Experimental Procedure
6.1 Training Settings
7 Results
7.1 Classification Performance
7.2 Bias Detection
7.3 Bias Mitigation
8 Conclusions
References
Machine Learning for Storage Location Prediction in Industrial High Bay Warehouses
1 Introduction
2 Related Work and Background
3 Methods and Results
3.1 Logistic Data Model
3.2 Retrieving ABC Classes for Training Data
3.3 Extending Feature Set
3.4 Implementation / Decision Tree
4 Evaluation and Discussion
5 Conclusion
References
A Deep Learning-Based Approach for Automatic Leather Classification in Industry 4.0
1 Introduction
2 Related Works
3 Materials and Methods
3.1 LASCC Dataset
3.2 Deep Learning Models
3.3 Performance Evaluation
4 Results and Discussions
5 Conclusions and Future Works
References
Automatic Viewpoint Estimation for Inspection Planning Purposes
1 Introduction
2 Related Work
2.1 Inspection Viewpoint Selection
2.2 Pose from RGB/RGB-D Data
3 Approach
3.1 Data Generation
3.2 Architecture
3.3 Training Process
3.4 Runtime
4 Experiments and Results
4.1 Experimental Setup
4.2 Testing Using Different Objects
4.3 Grayscale Images
4.4 Effect of Size of Bottleneck Layer
4.5 Effect of Cluster Numbers on the Accuracy and Performance
4.6 Reconstructed Images
4.7 Inference Time
4.8 Failure Cases
5 Conclusion
References
Localisation of Defects in Volumetric Computed Tomography Scans of Valuable Wood Logs
1 Introduction
2 Our Approach
2.1 2D Defect Detection
2.2 Multi-defect Tracking
3 Experimental Results
3.1 Data Acquisition and Normalization
3.2 Datasets
3.3 Experimental Setup
3.4 Analysis of the Results
4 Conclusions
References
Image Anomaly Detection by Aggregating Deep Pyramidal Representations
1 Introduction
2 Related Work
3 Proposed Model
3.1 Network Architecture
3.2 Objective and Losses
4 Experimental Results
5 Ablation Study
6 Conclusions
References
Fault Detection in Uni-Directional Tape Production Using Image Processing
1 Introduction
2 Related Work
3 Proposed Method
3.1 Tape Location
3.2 Void Detection
3.3 Void Classification
4 Evaluation and Results
4.1 Tape Location Results
4.2 Void Detection Results
4.3 Void Classification Results
5 Conclusion
References
Author Index

Pattern Recognition: ICPR International Workshops and Challenges: Virtual Event, January 10–15, 2021, Proceedings, Part IV [1 ed.]
 3030687988, 9783030687984

  • 0 0 0
  • Like this paper and download? You can publish your own PDF file online for free in a few minutes! Sign Up
File loading please wait...
Recommend Papers