2023 Faculty Courses School of Computing Department of Computer Science Graduate major in Artificial Intelligence

Image and Video Recognition

Academic unit or major: Graduate major in Artificial Intelligence
Instructor(s): Nakamasa Inoue
Class Format: Lecture (HyFlex)
Media-enhanced courses: -
Day of week/Period (Classrooms): 1-2 Mon (M-B45(H105)) / 1-2 Thu (M-B45(H105))
Class: -
Course Code: ART.T551
Number of credits: 200
Course offered: 2023
Offered quarter: 4Q
Syllabus updated: Jul 8, 2025
Language: English

Syllabus

Course overview and goals

This course gives an overview of the foundational ideas with some recent advances in image and video recognition. It covers deep neural networks such as convolutional neural networks, region proposal networks, fully convolutional networks and generative adversarial networks. Through lectures and assignments, students will learn the necessary skills to implement their own neural networks.

Course description and aims

At the end of this course, students should be able to
1) explain the basic concepts of image and video recognition, and
2) implement their own network by using deep learning libraries

Keywords

Deep Learning, Neural Networks, Image Recognition, Video Recognition

Competencies

Specialist skills
Intercultural skills
Communication skills
Critical thinking skills
Practical and/or problem-solving skills

Class flow

This course will be taught with slides.

Course schedule/Objectives

	Course schedule	Objectives
Class 1	Introduction	Overview of image and video recognition
Class 2	Basic Mathematics for Deep Learning	Linear algebra and optimization
Class 3	Tools for Deep Learning	Python libraries for deep learning
Class 4	Image Classification	Convolutional neural networks
Class 5	Object Detection	Region proposal networks
Class 6	Image Segmentation	Fully convolutional networks
Class 7	Action Recognition
Class 8	Data Augmentation	Data augmentation for image recognition
Class 9	Image Generation	Generative adversarial networks
Class 10	Adversarial Examples	Adversarial examples and defense methods
Class 11	Domain Adaptation	Adaptation and transfer learning methods
Class 12	Zero-Shot Learning	Zero-shot learning methods using attributes and texts
Class 13	Distributed Learning	Learning with multiple GPUs
Class 14	Theoretical Analysis of Deep Learning	Theoretical Analysis of Deep Learning

Study advice (preparation and review)

To enhance effective learning, students are encouraged to spend approximately 100 minutes preparing for class and another 100 minutes reviewing class content afterwards (including assignments) for each class.
They should do so by referring to textbooks and other course material.

Textbook(s)

Reference books, course materials, etc.

I. Goodfellow, Y. Benito, A. Courville, Deep Learning, MIT Press, 2016.
D. Foster, Generative Deep Learning, O'Reilly Media, 2019.

Evaluation methods and criteria

Assignments (100%)

Related courses

ART.T458 ： Advanced Machine Learning
XCO.T489 ： Fundamentals of artificial intelligence
XCO.T490 ： Exercises in fundamentals of artificial intelligence
XCO.T483 ： Advanced Artificial Intelligence and Data Science A
XCO.T485 ： Advanced Artificial Intelligence and Data Science C
XCO.T486 ： Advanced Artificial Intelligence and Data Science D