IT379 COMPUTER VISION
Course Coordinator: - Prof. Sanket Suthar
Title of the unit | Minimum number of hours | |
---|---|---|
1 |
Introduction and Foundations |
05 |
2 | Digital Image Formation and low-level processing | 04 |
3 |
Depth estimation and multi-camera views |
03 |
4 | Feature Extraction, Image Segmentation and Pattern Analysis | 10 |
5 | Shape Representation and Segmentation | 07 |
6 | Hough Transform and Object recognition | 07 |
7 | 3D Vision and Motion | 05 |
8 | Applications | 04 |
Unit No. | Topics | Teaching Hours |
---|---|---|
1 |
Introduction and Foundations Image Processing, Computer Vision and Computer Graphics, Overview of Diverse Computer Vision Applications: Document Image Analysis, Biometrics, Object Recognition, Tracking, Medical Image Analysis, Content-Based Image Retrieval, Video Data Processing, Multimedia, Virtual Reality and Augmented Reality |
05 |
2 | Digital Image Formation and low-level processing Overview and State-of-the-art, Fundamentals of Image Formation, Transformation: Orthogonal, Euclidean, Affine, Projective, etc; Fourier Transform, Convolution and Filtering, Image Enhancement, Restoration, Histogram Processing. | 04 |
3 |
Depth estimation and multi-camera views Perspective, Binocular Stereopsis: Camera and Epipolar Geometry; Homography, Rectification, DLT, RANSAC, 3-D reconstruction framework; Auto-calibration. Apparel. |
03 |
4 | Feature Extraction, Image Segmentation and Pattern Analysis Edges - Canny, LOG, DOG; Line detectors (Hough Transform), Corners - Harris and Hessian Affine, Orientation Histogram, SIFT, SURF, HOG, GLOH, Scale-Space Analysis- Image Pyramids and Gaussian derivative filters, Gabor Filters and DWT, Segmentation: Region Growing, Edge Based approaches to segmentation, Graph-Cut, Mean-Shift, MRFs, Texture Segmentation; Object detection, Pattern Analysis: Clustering: KMeans, K-Medoids, Mixture of Gaussians, Classification: Discriminant Function, Supervised, Un-supervised, Semi-supervised; Classifiers: Bayes, KNN, ANN models; Dimensionality Reduction: PCA, LDA, ICA; Non-parametric methods. | 10 |
5 | Shape Representation and Segmentation Contour based representation, Region based representation, Deformable curves and surfaces, Snakes and active contours, Level set representations, Fourier and wavelet descriptors, Medial representations, Multiresolution analysis | 07 |
6 | Hough Transform and Object recognition Line detection, Hough Transform (HT) for line detection, foot-of-normal method, line localization, line fitting, RANSAC for straight line detection, HT based circular object detection, accurate center location, speed problem, ellipse detection, Case study: Human Iris location, hole detection, generalized Hough Transform (GHT), spatial matched filtering, GHT for ellipse detection, object location, GHT for feature collation, Object Recognition: Simple object recognition methods, Shape correspondence and shape matching, Principal component analysis , Shape priors for recognition. | 07 |
7 | 3D Vision and Motion Methods for 3D vision, projection schemes, shape from shading, photometric stereo, shape from texture, shape from focus, active range finding, surface representations, point-based representation, volumetric representations, 3D object recognition, 3D reconstruction, introduction to motion, triangulation, bundle adjustment, translational alignment, parametric motion, spline-based motion, optical flow, layered motion. | 05 |
8 | Applications Photo album, Face detection, Face recognition, Eigen faces, Active appearance and 3D shape models of faces Application: Surveillance, foreground-background separation, particle filters, Chamfer matching, tracking, and occlusion, combining views from multiple cameras, human gait analysis Application: In-vehicle vision system: locating roadway, road markings, identifying road signs, locating pedestrians. | 04 |
Textbooks |
---|
Richard Szeliski, Computer Vision: Algorithms and Applications, Springer-Verlag London Limited 2011. Click Here |
Computer Vision: A Modern Approach, D. A. Forsyth, J. Ponce, Pearson Education, 2003. Click Here |