Capacity Scaling for Graph Cuts in Vision
2007, 2007 IEEE 11th International Conference on Computer Vision
https://doi.org/10.1109/ICCV.2007.4408970Abstract
Capacity scaling is a hierarchical approach to graph representation that can improve theoretical complexity and practical efficiency of max-flow/min-cut algorithms. Introduced by Edmonds, Karp, and Dinic [7, 6] in 1972, capacity scaling is well known in the combinatorial optimization community. Surprisingly, this major performance improving technique is overlooked in computer vision where graph cut methods typically solve energy minimization problems on huge N-D grids and algorithms' efficiency is a widely studied issue [3, 12, 16, 10]. Unlike some earlier hierarchical methods addressing efficiency of graph cuts in imaging, e.g. [16], capacity scaling preserves global optimality of the solution. This is the main motivation for our work studying capacity scaling in the context of vision. We show that capacity scaling significantly reduces non-polynomial theoretical time complexity of the max-flow algorithm in [3] to weakly polynomial O(m 2 n 2 log(U)) where U is the largest edge weight. While [3] is the fastest method for many applications in vision, capacity scaling gives several folds speed-ups for problems with large number of local minima. The effect is particularly strong in 3D applications with denser neighborhoods.
References (18)
- Y. Boykov and G. Funka-Lea. Graph cuts and efficient n-d image segmentation. Int. J. Comput. Vision, 70(2):109-131, 2006.
- Y. Boykov and V. Kolmogorov. Computing geodesics and minimal surfaces via graph cuts. In IEEE International Con- ference on Computer Vision, volume 1, pages 26-33, Wash- ington, DC, USA, 2003.
- Y. Boykov and V. Kolmogorov. An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision. IEEE Trans. Pattern Anal. Mach. Intell., 26(9):1124- 1137, sept. 2004.
- Y. Boykov and V. Lempitsky. From photohulls to photoflux optimization. In British Machine Vision Conference, vol- ume 3, pages 1149-1158, Edinburgh, UK, Sept. 2006.
- Y. Boykov, O. Veksler, and R. Zabih. Fast approximate en- ergy minimization via graph cuts. IEEE Trans. Pattern Anal. Mach. Intell., 23(11):1222-1239, 2001.
- E. A. Dinic. Metod porazryadnogo sokrashcheniya nevya- zok i transportnye zadachi. Issledovaniya po Diskretnoi Ma- ternatike, Science, 1973. Title translation: The Method of Scaling and Transportation Problems.
- J. Edmonds and R. M. Karp. Theoretical improvements in algorithmic efficiency for network flow problems. Journal of the Association for Computing Machinery, 19(2):248-264, 1972.
- L. R. Ford and D. R. Fulkerson. Flows in Networks. Prince- ton University Press, june 1962.
- D. S. Hochbaum. The pseudoflow algorithm: A new algo- rithm for the maximum flow problem. Long version of same titled paper, may 2004.
- O. Juan and Y. Boykov. Active graph cuts. In IEEE Con- ference on Computer Vision and Pattern Recognition, june 2006. New York.
- O. Juan and Y. Boykov. Capacity scaling for graph cuts in vi- sion. Technical report, University of Western Ontario, august 2007.
- P. Kohli and P. H. Torr. Efficiently solving dynamic markov random fields using graph cuts. In IEEE International Con- ference on Computer Vision, volume 2, pages 922-929, 2005.
- V. Kolmogorov and Y. Boykov. What metrics can be approx- imated by geo-cuts, or global optimization of length/area and flux. In IEEE Inter. Conf. on Comp. Vision, volume 1, pages 564-571, Washington, DC, USA, 2005.
- V. Kolmogorov and C. Rother. Minimizing non-submodular functions with graph cuts -a review. IEEE Trans. Pattern Anal. Mach. Intell., 2007. (to appear).
- V. Kolmogorov and R. Zabih. What energy functions can be minimized via graph cuts. IEEE Trans. Pattern Anal. Mach. Intell., 26(2):147-159, February 2004.
- H. Lombaert, Y. Sun, L. Grady, and C. Xu. A multilevel banded graph cuts method for fast image segmentation. In IEEE International Conference on Computer Vision, vol- ume 1, pages 259-265, Washington, DC, USA, 2005.
- C. Rother, V. Kolmogorov, and A. Blake. Grabcut: interac- tive foreground extraction using iterated graph cuts. ACM Trans. Graph., 23(3):309-314, 2004.
- G. Vogiatzis, P. H. S. Torr, and R. Cipolla. Multi-view stereo via volumetric graph-cuts. In IEEE Comp. Soc. Conf. on Comp. Vision and Pattern Recog., volume 2, pages 391-398, Washington, DC, USA, 2005.