Comic presentation for videos has attracted more and more attention in recent years. This work deeply discusses one important component, i.e., speech balloon placement, that was depreciated and was done by heuristic approaches before. According to number of words and emotion embedded in subtitles, and audio energy corresponding to the targeted frame, speech balloons of various sizes and appropriate shapes are generated. How to locate speech balloons in panels at the same comic page is then formulated as an optimization problem. The objective function integrates an intra-panel cost with an inter-panel cost, where the former is designed to avoid occluding important regions of frames and to direct viewer's gaze, and the latter is designed to build the reading tempo. The experimental results show that the proposed method facilitates higher readability, higher content coverage, and better speech balloon placement.