Service Action Recognition in Power Supply Business Hall with 3D-Fused ConvNet

Lin, Tongyao

doi:10.15598/aeee.v19i1.3950

Service Action Recognition in Power Supply Business Hall with 3D-Fused ConvNet

dc.contributor.author	Lin, Tongyao
dc.contributor.author	Ouyang, Li
dc.contributor.author	Wen, He
dc.contributor.author	Xiong, Dezhi
dc.contributor.author	Smulko, Janusz
dc.date.accessioned	2021-04-29T05:25:21Z
dc.date.available	2021-04-29T05:25:21Z
dc.date.issued	2021
dc.description.abstract	For the purpose of improving the service quality, video surveillance systems are widely used to standardize the service process in power supply business halls. If the employers check surveillance video to ensure predefined process of staff behaviours, it will be characterized as time-consuming. In recent years, great progress has been made in intelligent action recognition using Convolution Neural Networks (CNNs). However, due to the small range of staffs' motion and similar scene information of power supply business halls, the performance of using traditional CNNs to recognize service actions, e.g. bowing, standing and sitting, is general. For improving the recognition rate, this paper proposes a 3D-fused Convolutional Network (ConvNet) for service actions recognition, which focuses on detecting the actions in the typical scene of one staff person and one customer with a well-segmented video clip. The well-segmented video clips are sent as input to the 3D-fused ConvNet for action recognition. The 3D-fused ConvNet consists of two base learners, optical flow base learner and RGB base learner. Both learners use the Convolutional 3D (C3D) architecture. Specifically, the RGB learner can be used to capture the features of small staffs' motion while the optical flow base learner can be viewed as the key part to eliminate the influence of the background, especially in a similar scene. Furthermore, prediction scores of two base learners can be weighted by the softmax function according to the performance of each base learner. Finally, the prediction scores of the two base learners are fused to obtain the prediction result, namely the specific actions of the staffs in the videos. The experiment result shows that the proposed method achieves 92.41% accuracy on the service action dataset of the power supply business hall.	cs
dc.identifier.citation	Advances in electrical and electronic engineering. 2021, vol. 19, no. 1, p. 90 - 99 : ill.	cs
dc.identifier.doi	10.15598/aeee.v19i1.3950
dc.identifier.issn	1336-1376
dc.identifier.issn	1804-3119
dc.identifier.uri	http://hdl.handle.net/10084/143054
dc.language.iso	en	cs
dc.publisher	Vysoká škola báňská - Technická univerzita Ostrava	cs
dc.relation.ispartofseries	Advances in electrical and electronic engineering	cs
dc.relation.uri	https://doi.org/10.15598/aeee.v19i1.3950	cs
dc.rights	© Vysoká škola báňská - Technická univerzita Ostrava
dc.rights	Attribution-NoDerivatives 4.0 International	*
dc.rights.access	openAccess	cs
dc.rights.uri	http://creativecommons.org/licenses/by-nd/4.0/	*
dc.subject	3D convolution	cs
dc.subject	action recognition	cs
dc.subject	ensemble	cs
dc.subject	ower supply business hall	cs
dc.title	Service Action Recognition in Power Supply Business Hall with 3D-Fused ConvNet	cs
dc.type	article	cs
dc.type.status	Peer-reviewed	cs
dc.type.version	publishedVersion	cs

Files

Original bundle

Now showing 1 - 1 out of 1 results

Name:: 3950-488493464-1-PB.pdf
Size:: 8.11 MB
Format:: Adobe Portable Document Format
Description:: 3950-488493464-1-PB.pdf

Download

License bundle

Now showing 1 - 1 out of 1 results

Name:: license.txt
Size:: 718 B
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

AEEE. 2021, vol. 19