Advanced Applied Data Analytics

This course is the second, more advanced part of the diiP course “Applied Data Analytics.” The first part covers basic techniques, methodologies, and practical skills for data analysis. The second part delves into more advanced topics in Machine Learning and Deep Learning (see schedule below).

Like the first part, this course is intended for a variety of doctoral students from several faculties at Université Paris Cité. Key topics include data science, data analysis, machine learning, deep learning, and data mining.

Format of Lectures

This intensive course runs over one week in the winter, from Monday to Friday, between 10:00 AM and 1:00 PM, totalling approximately 15 hours.

Instruction includes slides, hands-on Google Colaboratory exercises, and questionnaires.

The course is conducted via Zoom.

Information

The last session took place from January 20-24, 2025. The next session will be held in January 2026.

Interested students can enrol through https://u-paris.fr/doctorat/advanced-applied-data-analytics/.

Enrolled students who need access to course materials should contact me via email.

Learning Outcomes

By the end of this course, students will:

Understand the fundamentals of data-driven learning, including key AI and ML concepts.
Gain familiarity with transformer architectures, their applications in sequence modelling, and how they power modern AI models.
Develop an in-depth understanding of Graph Neural Networks (GNNs) and their use in processing structured data
Learn about Self-Supervised Learning, including its role in training AI models without labelled data.
Explore Generative Adversarial Networks (GANs) and how they are used in data synthesis and augmentation.
Master Probabilistic Modelling and Bayesian approaches for uncertainty estimation in machine learning models.
Understand the applications of probabilistic techniques in data analytics, decision-making, and predictive modelling.

Instruction Methods

Lectures via Zoom using slides
Practical exercises with Google Colaboratory
Interactive polls and Q&A sessions

Assessment

Participation in class, discussions and polls

Course Material

The Zoom codes to view the recordings are available in the Introduction Lecture.
Students who need access to course materials should please email me.

New! An engaging way of getting a summary of the lectures. Podcasts by Notebook LM summarizing the lectures in English (below)

Lectures 1 and 2: Introduction
Zoom: Zoom recording 1, Zoom recording 2
Podcasts: Lecture 1, Lecture 2
Colaboratory Notebooks

Lecture 3: Generative Adversarial Networks, Transformers, Autoencoders
Zoom: Zoom recording 3, Zoom recording 4
Colaboratory Notebooks

Lecture 4: Graph Neural Networks
Zoom: still Zoom recording 4)
Colaboratory Notebooks

- Graph Classification using a GNN
- Paper type prediction using a GNN

Lecture 5: Probabilistic Learning, Bayesian Deep Learning, Variational Autoencoders
Zoom: Zoom recording 5
Colaboratory Notebooks

- Exploring Bayesian Regression for Uncertainty Quantification in Predictions
- Variational Autoencoder on MNIST data

Additional Material
Zoom: Zoom recording 5
Colaboratory Notebooks:

- - Exploring High-Dimensional Representations with UMAP and Inverse Transformations