Abstract

Multi-view video coding technique has been widely applied to 3D stereoscopic display. This paper proposes a rate control algorithm for multi-view video codec which is a tradeoff method between the data compression ratio and picture quality of 3D video. This study utilizes the characteristics of visual perception for 3D video viewers to determine the interesting regions in all view videos. Interesting regions with different levels, which depend on the existence of macroblocks with high motion, objects with individual shape, or obvious texture, are determined in the proposed algorithm by considering the corresponding Just Noticeable Distortion function, luminance histogram bin variation and edge strength, respectively. Then the proposed algorithm assigns adequate quantization parameters to control the bit-rate for different levels of interesting regions such that the video quality of the interesting regions is preserved and the bit-rate in the non-interesting regions is reallocated to 3D video storage and transmission for stereoscopic display. The experiment results show that the proposed method successfully achieves high quality preservation and low bit-rate for 3D effects.

© 2012 IEEE

PDF Article

References

  • View by:
  • |
  • |

  1. A. Vetro, Y. Sehoon, M. Zwicker, W. Matusik, H. Pfister, "Overview of multiview video coding and anti-aliasing for 3D displays," IEEE Int. Conf. on Image Process. (2007) pp. 17-20.
  2. T. Fujii, K. Mori, K. Takeda, K. Mase, M. Tanimoto, Y. Suenaga, "Multipoint measuring system for video and sound- 100-camera and microphone system," IEEE Int. Conf. on Multimedia Expo (2006) pp. 437-440.
  3. Y. S. Ho, K. J. Oh, "Overview of multi-view video coding," Int. Workshop on Syst., Signals Image Proces. (2007) pp. 5-12.
  4. G. J. Sullivan, T. Wiegand, H. Schwarz, "Advanced video coding for generic audiovisual services," JVT-AD007 (2006).
  5. JMVC Software http://ip.hhi.de/imagecom_G1/savce/downloads/.
  6. H. Schwarz, D. Marpe, T. Wiegand, "Analysis of hierarchical B pictures and MCTF," IEEE Int. Conf. on Multimedia and Expo. (ICME'06) (2006) pp. 1929-1932.
  7. P. Merkle, A. Smolic, K. Muller, T. Wiegand, "Comparative study of MVC prediction structures," 22nd Meeting ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6 (2007).
  8. P. Merkle, A. Smolic, K. Muller, T. Wiegand, "Efficient prediction structure for multi-view video coding," IEEE Trans. Circuits Syst. Video Technol. 17, 1461-1473 (2007).
  9. W. J. McGill, J. P. Goldberg, "A study of the near-miss involving Weber's Law and pure-tone intensity discrimination," Psychol. J. Santa Barbara 4, 105-109 (1968).
  10. G. L. Wu, T. H. Wu, Y. J. Fu, S. Y. Chien, "Perpcetual-aware H.264/AVC encoder with hardware perception analysis," Proc. IEEE Int. Conf. on Multimedia and Expo (ICME) (2010) pp. 790-795.
  11. Chou, Y. C. Li, "A perceptual tuned subband image coder based on the measure of just noticeable-distortion profile," IEEE Trans. Circuits Syst. Video Technol. 5, 467-476 (1995).
  12. C. Ha, W. Lee, S. Jin, J. Jeong, "Human perception of asymmetrical 3-D inputs," Proc. 3DTV Conf. (2007) pp. 1-4.
  13. G. Saygili, C. G. Gürler, A. M. Tekalp, "Quality assessment of asymmetric stereo video coding," Proc. Int. Conf. Image Process. (2010) pp. 4009-4012.
  14. P. Aflaki, M. M. Hannuksela, J. Häkkinen, P. Lindroos, M. Gabbouj, "Subjective study on compressed asymmetric stereoscopic video," Proc. Int. Conf. Image Process. (2010) pp. 4021-4024.
  15. P. Aflaki, M. M. Hannuksela, J. Häkkinen, P. Lindroos, M. Gabbouj, "Impact of downsampling ratio in mixed-resolution stereoscopic video," Proc. 3DTV Conf. (2010) pp. 1-4.
  16. H. Brust, A. Smolic, K. Müller, G. Tech, T. Wiegand, "Mixed resolution coding of stereoscopic video for mobile devices," Proc. 3DTV Conf. (2009) pp. 1-4.
  17. Y. Zhao, L. Yu, Z. Chen, "Cross-view post-filtering for fidelity enhancement on asymmetric coding of 3D video," Proc. VCIP Conf. (2011).
  18. L. B. Stelmach, W. J. Tam, D. V. Meegan, A. Vincent, P. Corriveau, "Human perception of mismatched stereoscopic 3D inputs," Proc. Int. Conf. on Image Process. (2000) pp. 5-8.
  19. "Draft call for proposals on 3D video coding technology," DaeguKorea (2011) ISO/IEC JTC1/SC29/WG11, MPEG2011/N11830.
  20. "Call for proposals on 3D video coding technology," GenevaSwitzerland (2011) ISO/IEC JTC1/SC29/WG11, MPEG2011/N12036.
  21. Z. Wang, A. C. Bovik, H. R. Sheikh, E. P. Simoncelli, "Image quality assessment: From error visibility to structural similarity," IEEE Trans. Image Process. 14, 600-612 (2004).

2007

P. Merkle, A. Smolic, K. Muller, T. Wiegand, "Efficient prediction structure for multi-view video coding," IEEE Trans. Circuits Syst. Video Technol. 17, 1461-1473 (2007).

2004

Z. Wang, A. C. Bovik, H. R. Sheikh, E. P. Simoncelli, "Image quality assessment: From error visibility to structural similarity," IEEE Trans. Image Process. 14, 600-612 (2004).

1995

Chou, Y. C. Li, "A perceptual tuned subband image coder based on the measure of just noticeable-distortion profile," IEEE Trans. Circuits Syst. Video Technol. 5, 467-476 (1995).

1968

W. J. McGill, J. P. Goldberg, "A study of the near-miss involving Weber's Law and pure-tone intensity discrimination," Psychol. J. Santa Barbara 4, 105-109 (1968).

IEEE Trans. Circuits Syst. Video Technol.

Chou, Y. C. Li, "A perceptual tuned subband image coder based on the measure of just noticeable-distortion profile," IEEE Trans. Circuits Syst. Video Technol. 5, 467-476 (1995).

IEEE Trans. Image Process.

Z. Wang, A. C. Bovik, H. R. Sheikh, E. P. Simoncelli, "Image quality assessment: From error visibility to structural similarity," IEEE Trans. Image Process. 14, 600-612 (2004).

IEEE Trans. Circuits Syst. Video Technol.

P. Merkle, A. Smolic, K. Muller, T. Wiegand, "Efficient prediction structure for multi-view video coding," IEEE Trans. Circuits Syst. Video Technol. 17, 1461-1473 (2007).

Psychol. J. Santa Barbara

W. J. McGill, J. P. Goldberg, "A study of the near-miss involving Weber's Law and pure-tone intensity discrimination," Psychol. J. Santa Barbara 4, 105-109 (1968).

Other

G. L. Wu, T. H. Wu, Y. J. Fu, S. Y. Chien, "Perpcetual-aware H.264/AVC encoder with hardware perception analysis," Proc. IEEE Int. Conf. on Multimedia and Expo (ICME) (2010) pp. 790-795.

A. Vetro, Y. Sehoon, M. Zwicker, W. Matusik, H. Pfister, "Overview of multiview video coding and anti-aliasing for 3D displays," IEEE Int. Conf. on Image Process. (2007) pp. 17-20.

T. Fujii, K. Mori, K. Takeda, K. Mase, M. Tanimoto, Y. Suenaga, "Multipoint measuring system for video and sound- 100-camera and microphone system," IEEE Int. Conf. on Multimedia Expo (2006) pp. 437-440.

Y. S. Ho, K. J. Oh, "Overview of multi-view video coding," Int. Workshop on Syst., Signals Image Proces. (2007) pp. 5-12.

G. J. Sullivan, T. Wiegand, H. Schwarz, "Advanced video coding for generic audiovisual services," JVT-AD007 (2006).

JMVC Software http://ip.hhi.de/imagecom_G1/savce/downloads/.

H. Schwarz, D. Marpe, T. Wiegand, "Analysis of hierarchical B pictures and MCTF," IEEE Int. Conf. on Multimedia and Expo. (ICME'06) (2006) pp. 1929-1932.

P. Merkle, A. Smolic, K. Muller, T. Wiegand, "Comparative study of MVC prediction structures," 22nd Meeting ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6 (2007).

C. Ha, W. Lee, S. Jin, J. Jeong, "Human perception of asymmetrical 3-D inputs," Proc. 3DTV Conf. (2007) pp. 1-4.

G. Saygili, C. G. Gürler, A. M. Tekalp, "Quality assessment of asymmetric stereo video coding," Proc. Int. Conf. Image Process. (2010) pp. 4009-4012.

P. Aflaki, M. M. Hannuksela, J. Häkkinen, P. Lindroos, M. Gabbouj, "Subjective study on compressed asymmetric stereoscopic video," Proc. Int. Conf. Image Process. (2010) pp. 4021-4024.

P. Aflaki, M. M. Hannuksela, J. Häkkinen, P. Lindroos, M. Gabbouj, "Impact of downsampling ratio in mixed-resolution stereoscopic video," Proc. 3DTV Conf. (2010) pp. 1-4.

H. Brust, A. Smolic, K. Müller, G. Tech, T. Wiegand, "Mixed resolution coding of stereoscopic video for mobile devices," Proc. 3DTV Conf. (2009) pp. 1-4.

Y. Zhao, L. Yu, Z. Chen, "Cross-view post-filtering for fidelity enhancement on asymmetric coding of 3D video," Proc. VCIP Conf. (2011).

L. B. Stelmach, W. J. Tam, D. V. Meegan, A. Vincent, P. Corriveau, "Human perception of mismatched stereoscopic 3D inputs," Proc. Int. Conf. on Image Process. (2000) pp. 5-8.

"Draft call for proposals on 3D video coding technology," DaeguKorea (2011) ISO/IEC JTC1/SC29/WG11, MPEG2011/N11830.

"Call for proposals on 3D video coding technology," GenevaSwitzerland (2011) ISO/IEC JTC1/SC29/WG11, MPEG2011/N12036.

Cited By

OSA participates in CrossRef's Cited-By Linking service. Citing articles from OSA journals and other participating publishers are listed here.

Alert me when this article is cited.