Elmasry, Ramez M. ; Abd El Ghany, Mohamed A. ; Salem, Mohammed A.-M. ; Fahmy, Omar M. (2024)
MultiWave-Net: An Optimized Spatiotemporal Network for Abnormal Action Recognition Using Wavelet-Based Channel Augmentation.
In: AI, 2024, 5 (1)
doi: 10.26083/tuprints-00027246
Article, Secondary publication, Publisher's Version
Text
ai-05-00014.pdf Copyright Information: CC BY 4.0 International - Creative Commons, Attribution. Download (4MB) |
Item Type: | Article |
---|---|
Type of entry: | Secondary publication |
Title: | MultiWave-Net: An Optimized Spatiotemporal Network for Abnormal Action Recognition Using Wavelet-Based Channel Augmentation |
Language: | English |
Date: | 7 May 2024 |
Place of Publication: | Darmstadt |
Year of primary publication: | 24 January 2024 |
Place of primary publication: | Basel |
Publisher: | MDPI |
Journal or Publication Title: | AI |
Volume of the journal: | 5 |
Issue Number: | 1 |
DOI: | 10.26083/tuprints-00027246 |
Corresponding Links: | |
Origin: | Secondary publication DeepGreen |
Abstract: | Human behavior is regarded as one of the most complex notions present nowadays, due to the large magnitude of possibilities. These behaviors and actions can be distinguished as normal and abnormal. However, abnormal behavior is a vast spectrum, so in this work, abnormal behavior is regarded as human aggression or in another context when car accidents occur on the road. As this behavior can negatively affect the surrounding traffic participants, such as vehicles and other pedestrians, it is crucial to monitor such behavior. Given the current prevalent spread of cameras everywhere with different types, they can be used to classify and monitor such behavior. Accordingly, this work proposes a new optimized model based on a novel integrated wavelet-based channel augmentation unit for classifying human behavior in various scenes, having a total number of trainable parameters of 5.3 m with an average inference time of 0.09 s. The model has been trained and evaluated on four public datasets: Real Live Violence Situations (RLVS), Highway Incident Detection (HWID), Movie Fights, and Hockey Fights. The proposed technique achieved accuracies in the range of 92% to 99.5% across the used benchmark datasets. Comprehensive analysis and comparisons between different versions of the model and the state-of-the-art have been performed to confirm the model’s performance in terms of accuracy and efficiency. The proposed model has higher accuracy with an average of 4.97%, and higher efficiency by reducing the number of parameters by around 139.1 m compared to other models trained and tested on the same benchmark datasets. |
Uncontrolled Keywords: | abnormal actions, anomaly, accidents, convolutional neural network, convolutional LSTM, channel augmentation, fights, recognition, wavelet transform, violence |
Status: | Publisher's Version |
URN: | urn:nbn:de:tuda-tuprints-272460 |
Additional Information: | This article belongs to the Special Issue Artificial Intelligence-Based Image Processing and Computer Vision |
Classification DDC: | 000 Generalities, computers, information > 004 Computer science 600 Technology, medicine, applied sciences > 621.3 Electrical engineering, electronics |
Divisions: | 18 Department of Electrical Engineering and Information Technology > Institute of Computer Engineering > Integrated Electronic Systems (IES) |
Date Deposited: | 07 May 2024 12:49 |
Last Modified: | 17 May 2024 12:47 |
SWORD Depositor: | Deep Green |
URI: | https://tuprints.ulb.tu-darmstadt.de/id/eprint/27246 |
PPN: | 518205444 |
Export: |
View Item |