TU Darmstadt / ULB / TUprints

MultiWave-Net: An Optimized Spatiotemporal Network for Abnormal Action Recognition Using Wavelet-Based Channel Augmentation

Elmasry, Ramez M. ; Abd El Ghany, Mohamed A. ; Salem, Mohammed A.-M. ; Fahmy, Omar M. (2024)
MultiWave-Net: An Optimized Spatiotemporal Network for Abnormal Action Recognition Using Wavelet-Based Channel Augmentation.
In: AI, 2024, 5 (1)
doi: 10.26083/tuprints-00027246
Article, Secondary publication, Publisher's Version

[img] Text
ai-05-00014.pdf
Copyright Information: CC BY 4.0 International - Creative Commons, Attribution.

Download (4MB)
Item Type: Article
Type of entry: Secondary publication
Title: MultiWave-Net: An Optimized Spatiotemporal Network for Abnormal Action Recognition Using Wavelet-Based Channel Augmentation
Language: English
Date: 7 May 2024
Place of Publication: Darmstadt
Year of primary publication: 24 January 2024
Place of primary publication: Basel
Publisher: MDPI
Journal or Publication Title: AI
Volume of the journal: 5
Issue Number: 1
DOI: 10.26083/tuprints-00027246
Corresponding Links:
Origin: Secondary publication DeepGreen
Abstract:

Human behavior is regarded as one of the most complex notions present nowadays, due to the large magnitude of possibilities. These behaviors and actions can be distinguished as normal and abnormal. However, abnormal behavior is a vast spectrum, so in this work, abnormal behavior is regarded as human aggression or in another context when car accidents occur on the road. As this behavior can negatively affect the surrounding traffic participants, such as vehicles and other pedestrians, it is crucial to monitor such behavior. Given the current prevalent spread of cameras everywhere with different types, they can be used to classify and monitor such behavior. Accordingly, this work proposes a new optimized model based on a novel integrated wavelet-based channel augmentation unit for classifying human behavior in various scenes, having a total number of trainable parameters of 5.3 m with an average inference time of 0.09 s. The model has been trained and evaluated on four public datasets: Real Live Violence Situations (RLVS), Highway Incident Detection (HWID), Movie Fights, and Hockey Fights. The proposed technique achieved accuracies in the range of 92% to 99.5% across the used benchmark datasets. Comprehensive analysis and comparisons between different versions of the model and the state-of-the-art have been performed to confirm the model’s performance in terms of accuracy and efficiency. The proposed model has higher accuracy with an average of 4.97%, and higher efficiency by reducing the number of parameters by around 139.1 m compared to other models trained and tested on the same benchmark datasets.

Uncontrolled Keywords: abnormal actions, anomaly, accidents, convolutional neural network, convolutional LSTM, channel augmentation, fights, recognition, wavelet transform, violence
Status: Publisher's Version
URN: urn:nbn:de:tuda-tuprints-272460
Additional Information:

This article belongs to the Special Issue Artificial Intelligence-Based Image Processing and Computer Vision

Classification DDC: 000 Generalities, computers, information > 004 Computer science
600 Technology, medicine, applied sciences > 621.3 Electrical engineering, electronics
Divisions: 18 Department of Electrical Engineering and Information Technology > Institute of Computer Engineering > Integrated Electronic Systems (IES)
Date Deposited: 07 May 2024 12:49
Last Modified: 17 May 2024 12:47
SWORD Depositor: Deep Green
URI: https://tuprints.ulb.tu-darmstadt.de/id/eprint/27246
PPN: 518205444
Export:
Actions (login required)
View Item View Item