Video Parsing


Digital video needs to be properly processed before it can be inserted into a video server. These tasks include compressing, parsing and indexing a video sequence. Video parsing is the process of detecting scene changes or the boundaries between camera shots in a video stream. Digital video sequences are usually stored in a compressed format such as motion JPEG or MPEG. The most computationally expensive part of the decoding algorithm is the inverse DCT (IDCT) operation. If a video sequence can be processed before the IDCT is performed, execution time can be reduced. In this project we proposed a fast algorithm to parse MPEG1, MPEG2 or motion JPEG compressed video sequences by approximating color histograms using DC coefficients and motion vectors.

The following is a poster presented at the 1995 International Conference on Image Processing. To have a closer look at the poster, please click on the area that interests you.

More detailed information may be found in:

Notice: the copyrights to the following papers are held by the publishers. The attached PostScript and PDF files are preprints. Please treat this material in a way consistent with the "fair use'' provisions of the appropriate copyright laws.

Many of these papers include source code and images used in the work. The README files in each directory at the ftp site provide the needed details including the correct citations for each paper.

Address all comments and questions to Professor Edward J. Delp.

K.Shen and E.J. Delp, "A Fast Algorithm for Video Parsing Using MPEG Compressed Sequences," Proceedings of the IEEE International Conference on Image Processing , October 23-26, 1995, Washington, DC., pp. 252-255. The readme file , compressed postscript file, PDF file, and the ftp site.


Professor Edward J. Delp