☆ 4.6 Article

Building a smart lecture-recording system using MK-CPN network for heterogeneous data sources

NEURAL COMPUTING & APPLICATIONS (2019)

期刊

NEURAL COMPUTING & APPLICATIONS

卷 31, 期 8, 页码 3759-3777

出版社

SPRINGER LONDON LTD

DOI: 10.1007/s00521-017-3328-6

关键词

Automatic lecture-recording system; Virtual cameraman; Virtual director; Shot selection

类别

Computer Science, Artificial Intelligence

资金

Ministry of Science and Technology (MOST), Taiwan, R.O.C. [NSC-102-2221-E-003-013]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Nowadays, lecture-recording systems play a vital role in collecting spoken discourse for e-learning. However, in view of the growing development of e-learning, the lack of content is becoming a problem. This research presents a smart lecture-recording (SLR) system that can record orations at the same level of quality as a human team, but with a reduced degree of human involvement. The proposed SLR system is composed of two subsystems, referred to as virtual cameraman (VC), and virtual director (VD), respectively. All camera man components of VC subsystem are automatic and can take actions that include target and event detection, tracking, and view searching. The videos taken by these three components are forwarded to the VD subsystem, in which the representative shot is chosen for recording or direct broadcasting. We refer to this function of the VD subsystem as shot selection that is based on the content analysis. The capability of shot selection is pre-trained through a machine-learning process characterized by the counter-propagation neural (CPN) network. However, the CPN network yielded poor results when the input data were heterogeneous data. To increases the accuracy of shot selection, we applied multiple kernel learning (MKL) techniques into CPN network, called MK-CPN, to transform all the heterogeneous data from different content analysis methods into unified space. A series of experiments for real lecture has been conducted. The results showed that the proposed SLR system can provide oration records close to some extend to those taken by real human teams. We believe that the proposed system may not be limited to live speeches, if it can be configured with appropriate training materials.

Building a smart lecture-recording system using MK-CPN network for heterogeneous data sources

期刊

NEURAL COMPUTING & APPLICATIONS

出版社

SPRINGER LONDON LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Building a smart lecture-recording system using MK-CPN network for heterogeneous data sources

期刊

NEURAL COMPUTING & APPLICATIONS

出版社

SPRINGER LONDON LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文