PhD Defence by Emil Njor

Data Aware embedded machine learning

Principal supervisor: 
Professor Xenofon Fafoutis, DTU Compute

Co-supervisor: 
Professor Jan Madsen , DTU Compute

Examiners:
 Associate Professor Luca Pezzarossa, DTU Compute
Professor Manuel Roveri, Politecnico di Milano, Associate Professor Pinar Tözün , IT University of Copenhagen

Chairperson at defence:
 Associate professor Matthias Bo Stuart, DTU Compute

Abstract

Over the past decade, Machine Learning (ML) systems have revolutionized the world by turning vast datasets into intelligent systems. Doing so, they have enabled ground-breaking applications, such as human-like chatbots and autonomous vehicles, making a profound impact on everyday lives. Perhaps even more important, ML models have revolutionized fields like drug discovery, medical diagnostics, and crop pest detection — advancements with the potential to save thousands, if not millions, of lives.

However, as ML systems grow in capability and prevalence, their darker sides have come under increased scrutiny. Large language models, for instance, consume orders of magnitude more energy than traditional internet searches, contributing to global environmental challenges. Furthermore, the hardware powering these systems is often prohibitively expensive for smaller organizations and developing economies, exacerbating global in equalities. Moreover, their reliance on centralized processing raises significant concerns around data privacy, security, and fairness.

These challenges are not intrinsic to ML but reflect a paradigm that prioritizes performance over sustainability and accessibility. Embedded ML offers a compelling alternative, focusing on designing lean ML systems that run efficiently on low-power devices. By leveraging advanced optimization techniques — such as utilizing efficient hardware architectures, model quantizing or pruning, and automated model design using AutoML — Embedded ML aims to deliver a new generation of ML systems that are private, energy-efficient and reliable.

Despite its promise, Embedded ML adoption in industry remains slow, largely due to the deep technical expertise required to implement its optimizations effectively. This thesis addresses these barriers by presenting new insights into the development of Embedded ML systems for Predictive Maintenance (PdM) applications, a new large and high-quality dataset for advancing Embedded ML research, and Data Aware Neural Architecture Search (NAS) — a novel AutoML technique enabling unprecedented tiny ML models. Through these contributions, this thesis advances Embedded ML toward its transformative vision: a future of sustainable, accessible, and ubiquitous machine learning.

See event here