Machine Learning for Artificial Intelligence track

Data science

Data engineering

Artificial Intelligence

Machine Learning

Deep Learning

Signal and Image Processing

Objectives

Data science is an emerging scientific domain that aims to extract knowledge from data generated in ever-increasing quantities (Big Data). This ‘new’ science is one of the driving forces behind artificial intelligence, particularly through the study and design of machine learning methods. The MLAI track (A2IA in french) aims to train engineers and researchers specialising in this field, who will be the data managers and data scientists of tomorrow.

Open to international

Courses during the third and fourth semesters are taught in English, in order to welcome English-speaking students and prepare French-speaking students for mobility. Internships are offered every year in our partner laboratories abroad (Belgium, Denmark, Canada, etc.), as was the case for the students in our section "Abroad".

Projects and internship

Individual project and internship are an important part of the MLAI track, with:

Internship or project during the second semester : an internship in a research lab or in a company lasting at least 8 weeks or a project to be carried out independently at the University under the supervision of a member of the teaching staff.
Project in the third semester : an individual project, which can be an introduction to data science research.
Internship in the fourth semester : an end-of-study internship lasting between 4 and 6 months, to be carried out in a company or in a research laboratory.

Programme

The objective of this teaching unit is to achieve greater fluency in oral expression in English in general and professional contexts such as public speaking or during an English interview or meeting. Alone or in a team within a small group, it is a question of being able to work and present arguments orally at a B2 level of the CEFR. In particular, this involves:

Learning the methodology of public speaking.
Intensive work (specific workshops) on the rules of pronunciation of the English language (British or North American accent) to aim for authentic oral expression.
The acquisition of vocabulary and expressions specific to written and oral communication in general and professional contexts.

The objective of this course is to present the basics of machine learning (the different types of contexts/tasks/applications), to understand how the main machine learning methods work, and to give the experimental methodological principles for setting up implementing these methods. The goal is to master the different tasks of machine learning, to understand the differences between the methods and their operating principle and to know how to set up an experimental protocol to test and compare these methods on real data sets. The topics covered are:

Parametric and non-parametric, generative or discriminant classifiers (Gaussian estimation, Parzen estimator, k nearest neighbors, linear separator (perceptron, SVM))
Hierarchical classifiers (decision trees)
Neural networks (MLP)
The selection of models

The objective of this teaching unit is to introduce the issue of the Web of data both from a historical and technical point of view. These are the technologies of the Semantic Web that make it possible to implement the fundamental principles of this Web of data. This is accompanied by a stack of standards issued by the World Wide Web Consortium (W3C) that this course offers to understand both theoretically and practically. The primary objective of this course is to enable students to acquire knowledge of the operation of the Semantic Web and its architectural principles. This is essential for a good understanding of today's Web given the growing role played by Semantic Web technologies. The secondary objective is to train in the standards of the Semantic Web in order in particular to achieve a concrete and mastered understanding of the notion of ontology and their ability to implement reasoning and deductions from new data. The targeted skills are:

Understand the concept of Web of data and know the architectural principles of the Semantic Web.
Master the representation languages adapted to the publication of linked data on the Web (RDF model).
Master the main aspects of the SPARQL query language which allows the querying and modification of (linked) data through the Web.
Understand the notion of ontology and the languages used to write them (RDFS, OWL, SKOS).

The objective of this teaching unit is to learn about graphic interface design with Web technologies. It aims to train in the use of HTML / CSS technologies for the creation of Web interfaces, with an emphasis on modern techniques of responsive web design. At the end of this course, students should be able to design multi-media adaptive graphical interfaces, that is to say that are functional on all types of screens (mobile, tablet, computers, etc.). This course does not cover concepts of ergonomics or graphics, but focuses on the technical mastery of dedicated tools. The targeted skills are:

Master HTML5 for structuring web documents.
Master the key principles of CSS for the formatting and visual and functional rendering of these pages.
Master the fundamental techniques of responsive web design (flexible layout, flexible media, media queries, etc...).

The objective of this teaching unit is to present the theoretical and practical tools allowing the processing of random signals, in particular on aspects related to signal filtering. It is a question of mastering the classic approaches for such treatments. The targeted skills are:

Theoretical deepening of signal processing tools.
Putting into practice the methods seen in class.

The objective of this teaching unit is to know how to organize teamwork around a development project using AI:

Know and implement the principles of team management (cycles, agility, etc.)
Master common project management tools (diagrams, dashboards, version tracking, etc.)

The objective of this teaching unit is to present methods for finding the minimum of a function of R^n by descent with or without constraints, methods which are present in many learning algorithms. It is a question of mastering the theoretical aspects and of being able to implement these methods of descents

The objective of this teaching unit is to master classical and advanced unsupervised data analysis techniques for the purpose of data visualization or dimension reduction. It includes practical work sessions during which the methods are programmed in Python language. It is about understanding the assumptions behind the different data analysis models and implementing modern data analysis techniques

The objective of this teaching unit is to design professional documents (CV and cover letter) consistent with a professional objective and to prepare for the job interview.

The objective of this course is to achieve greater fluency in oral expression in English in general and professional contexts such as public speaking or during an English interview or meeting. Alone or in a team within a small group, it is a question of being able to work and present arguments orally at a B2 level of the CEFR. In particular, this involves:

Learning the methodology of public speaking.
Intensive work (specific workshops) on the rules of pronunciation of the English language (British or North American accent) to aim for authentic oral expression.
The acquisition of vocabulary and expressions specific to written and oral communication in general and professional contexts.

The objective of this teaching unit is to present the problem of combinatorial optimization and the reference approaches to deal with such problems. It will make it possible to master the standard algorithms for solving operational research problems (dynamic programming, mathematical programming, branching algorithms, etc.). The teaching unit provides students with a basic culture in operational research, which makes them able to model a combinatorial optimization problem and to choose the appropriate approach to solve it and to evaluate it.

The objective of this teaching unit is an introduction to multilayer neural networks and their error gradient backpropagation learning algorithm. The SGD algorithm is presented and the problem of the disappearance of the gradient in deep architectures is highlighted. The various techniques for controlling this phenomenon are presented. Convolutional networks are then discussed, and highlighted for face recognition applications. Recurrent network architectures are presented for speech recognition. The architectures of adversary networks make it possible to approach the techniques of generation of false data. The objective is to understand and master modern supervised and weakly supervised learning techniques based on neural network architectures, and deep neural networks. Understand and master the optimization algorithms specific to these architectures. Know how to implement these algorithms using dedicated computing environments in Python language, such as Tensor Flow, Keras, or PyTorch.

The objective of this teaching unit is to deepen the fundamental theoretical concepts of machine learning in general, but also of the most emblematic generalist methods. At the end of this course, students will be familiar with the theoretical foundations of machine learning, will understand the motivations behind the different existing approaches and will master the operation of the most emblematic generalist methods.

The objective of this teaching unit is to deal with the different hardware architectures in the field of massive data processing as well as the methods and tools to make the best use of these different architectures. This teaching will address the hardware aspects, in particular memory (local or distributed on several machines) and different programming paradigms and tools on distributed architecture. The objective is to provide students with the basic knowledge and skills in distributed computing which make them able to use this type of infrastructure and to port their algorithms to such infrastructures and to familiarize them with the associated technologies (frameworks, files, etc.).

The objective of this teaching unit is to apply for at least 8 weeks the theoretical lessons received during the course period, within the framework of academic or industrial projects. The projects are carried out within the URN. The internship can be carried out in the laboratory or in a company.

The aim of this unit is to prepare students to defend their project and/or internship. Students are trained to prepare an oral presentation (scientific or professional), to present a scientific contribution to a specialist or non-specialist audience and to identify, formalise and defend a professional project.

This course aims to enable students to master the rules of written and oral communication in English in general and professional contexts.

The objectives of this UE are to achieve greater fluency in oral expression in general and professional contexts such as public speaking or during an interview in English, and to master the language tools specific to professional written communication.

The targeted skills are the acquisition of vocabulary and expressions specific to written and oral communication in general and professional contexts.

This course presents the different architectures in the field of High Performance Computing (HPC): shared memory computers, distributed memory computers, GPU based accelerators... as well as the methods and tools to use these different architectures. This teaching will address the notions of computing power, profiling and optimization of computing performance, massively parallel computing and porting to GPUs. A focus will be made on the energy consumption of this type of infrastructure. The objective is to provide students with a basic culture of HPC, which will enable them to use this type of infrastructure and to improve their algorithms to adapt them to the use of such infrastructures. A second objective is to make students aware of the energy and environmental impact of using these infrastructures.