High-level semantic photographic composition analysis and understanding with deep neural networks

Min Tzu Wu, Tse Yu Pan, Wan Lun Tsai, Hsu-Chan Kuo, Min-Chun Hu

研究成果: Conference contribution

1 引文 斯高帕斯(Scopus)

摘要

In order to take better photos, it is a fundamental step for the beginners of photography to learn basic photo composition rules. However, there are no tools developed to help beginners analyze the composition rules in given photos. Thus, in this study we developed a system with the capability to identify 12 common composition rules in a photo. It should be noted that some of the 12 common composition rules have not been considered by the previous studies, and this deficit gives this study its significance and appropriateness. In particular, we utilized deep neural networks (DNN) to extract high-level semantic features for facilitating the further analysis of photo composition rules. In order to train the DNN model, our research team constructed a dataset, which is collected from some famous photo websites, such as DPChallenge, Flicker, and Unsplash. All the collected photos were later labelled with 12 composition rules by a wide range of raters recruited from Amazon Mechanical Turk (AMT). Two DNN architectures (AlexNet and GoogLeNet) were then employed to build our system based on the collected dataset. The representative features of each composition rule were further visualized in our system. The results showed the feasibility of the proposed system and revealed the possibility of using this system to assist potential users to improve their photographical skills and expertise.

原文English
主出版物標題2017 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2017
發行者Institute of Electrical and Electronics Engineers Inc.
頁面279-284
頁數6
ISBN(電子)9781538605608
DOIs
出版狀態Published - 2017 九月 5
事件2017 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2017 - Hong Kong, Hong Kong
持續時間: 2017 七月 102017 七月 14

出版系列

名字2017 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2017

Other

Other2017 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2017
國家Hong Kong
城市Hong Kong
期間17-07-1017-07-14

    指紋

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Media Technology

引用此

Wu, M. T., Pan, T. Y., Tsai, W. L., Kuo, H-C., & Hu, M-C. (2017). High-level semantic photographic composition analysis and understanding with deep neural networks. 於 2017 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2017 (頁 279-284). [8026247] (2017 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2017). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICMEW.2017.8026247