This course is the second, more advanced part of the diiP course “Applied Data Analytics.” The first part covers basic techniques, methodologies, and practical skills for data analysis. The second part delves into more advanced topics in Machine Learning and Deep Learning (see schedule below).
Like the first part, this course is intended for a variety of doctoral students from several faculties at Université Paris Cité. Key topics include data science, data analysis, machine learning, deep learning, and data mining.
This intensive course runs over one week in the winter, from Monday to Friday, between 10:00 AM and 1:00 PM, totalling approximately 15 hours.
Instruction includes slides, hands-on Google Colaboratory exercises, and questionnaires.
The course is conducted via Zoom.
The last session took place from January 20-24, 2025. The next session will be held in January 2026.
Interested students can enrol through https://u-paris.fr/doctorat/advanced-applied-data-analytics/.
Enrolled students who need access to course materials should contact me via email.
By the end of this course, students will:
Understand the fundamentals of data-driven learning, including key AI and ML concepts.
Gain familiarity with transformer architectures, their applications in sequence modelling, and how they power modern AI models.
Develop an in-depth understanding of Graph Neural Networks (GNNs) and their use in processing structured data
Learn about Self-Supervised Learning, including its role in training AI models without labelled data.
Explore Generative Adversarial Networks (GANs) and how they are used in data synthesis and augmentation.
Master Probabilistic Modelling and Bayesian approaches for uncertainty estimation in machine learning models.
Understand the applications of probabilistic techniques in data analytics, decision-making, and predictive modelling.
Lectures via Zoom using slides
Practical exercises with Google Colaboratory
Interactive polls and Q&A sessions
Participation in class, discussions and polls
The Zoom codes to view the recordings are available in the Introduction Lecture.
Enrolled students who need access to course materials should contact me via email.
Lecture 1 and 2: Introduction
Zoom recording 1, Zoom recording 2
Lecture 3: Generative Adversarial Networks, Transformers, Autoencoders
Zoom recording 3, Zoom recording 4
Colaboratory Notebooks:
Enhanced Time Series Classification with CNNs and Pre-trained Embeddings
Using the DINOv2 META AI transformer-based model for a galaxy classification task
Transformer-based autoencoder with Consecutive Chunk Masking for Time Series Reconstruction
MNIST Autoencoder: Image Reconstruction & Latent Space Analysis
Lecture 4: Graph Neural Networks
(still Zoom recording 4)
Colaboratory Notebooks:
Additional Material
(still Zoom recording 5)
Colaboratory Notebooks: