2022 Faculty Courses School of Computing Department of Computer Science Graduate major in Artificial Intelligence

3D Computer Vision

Academic unit or major: Graduate major in Artificial Intelligence
Instructor(s): Asako Kanezaki
Class Format: Lecture (HyFlex)
Media-enhanced courses: -
Day of week/Period (Classrooms): 3-4 Tue (W833) / 3-4 Fri (S223)
Class: -
Course Code: ART.T466
Number of credits: 200
Course offered: 2022
Offered quarter: 4Q
Syllabus updated: Jul 10, 2025
Language: English

Syllabus

Course overview and goals

This course teaches how to process images and 3D data so as to extract higher-level information from the data. It covers the basics of data processing, geometric transformations, and linear algebra for machine learning, as well as recent cutting-edge research on deep neural networks, with hands-on exercises using programming languages such as Python.

Course description and aims

At the end of this course, students should be able to
1) acquire the basics of image and 3D data processing, and
2) implement their own desired processing including deep learning with e.g., Python.

Keywords

Image Processing, 3D Data Processing, Geometric Transformations, Deep Learning, Neural Networks

Competencies

Specialist skills
Intercultural skills
Communication skills
Critical thinking skills
Practical and/or problem-solving skills

Class flow

This course will be taught with slides and programming exercises.

Course schedule/Objectives

	Course schedule	Objectives
Class 1	Input/Output of image and 3D data	Understand 3D sensors and how to input, output, and visualize data.
Class 2	Image and 3D data pre-processing	Learn about data filtering and geometric transformations.
Class 3	Image and 3D features	Learn about image features such as SIFT and 3D keypoints/local features.
Class 4	Image and 3D data correspondence search	Learn about k-d tree data structure and nearest neighbor search
Class 5	3D data registration	Understand RANSAC and graph matching, and the ICP algorithm
Class 6	Linear algebra as a basis for machine learning	Learn about linear algebra, the foundation of machine learning
Class 7	Data classification using machine learning	Learn data classification techniques by support vector machines, etc.
Class 8	Foundation of deep learning	Understand layers and the back propagation technique of deep learning
Class 9	Image processing with deep learning (1)	Learn methods of image classification using deep learning
Class 10	Image processing with deep learning (2)	Learn techniques such as image segmentation using deep learning
Class 11	3D data processing with deep learning (1)	Learn methods of 3D data classification using deep learning
Class 12	3D data processing with deep learning (2)	Learn techniques such as 3D data segmentation using deep learning
Class 13	3D data processing with deep learning (3)	Learn about 3D representations using implicit functions, machine learning using geometric information, etc.
Class 14	Summary and Discussion	Summary and discussion of the lecture

Study advice (preparation and review)

To enhance effective learning, students are encouraged to spend approximately 100 minutes preparing for class and another 100 minutes reviewing class content afterwards (including assignments) for each class.
They should do so by referring to textbooks and other course material.

Textbook(s)

None

Reference books, course materials, etc.

Materials translated into English from the above-mentioned Japanese reference book will be distributed.

Evaluation methods and criteria

Comprehension of lecture content will be evaluated. Grades will be based on exercises and reports.

Related courses

ART.T467 ： Fundamentals of Computer Vision
ART.T552 ： Advanced Topics in Computer Vision
ART.T551 ： Image and Video Recognition

Prerequisites

There are no prerequisites for this course.