Our focus is on improving optimization algorithms in online decision-making. Using techniques from online algorithms for solving optimization problems, we can provide worst-case guarantees, but normal (averagecase) behavior may not be satisfactory. Using techniques from machine learning, we can often provide good behavior in practice, but guarantees are lacking, and in particular missing for situations not captured by the training data. We aim at combining the best features from these two areas.
Strategiske, tværgående forskningsprojekter, ledet af DIREC-forskere – og ofte i samarbejde med eksterne samarbejdspartnere – har til formål at levere værdi for både den videnskabelige verden og samfundet. Formålet med SciTech-projekterne er at opbygge forsknings- og uddannelseskapaciteten på universiteterne.
The search for nearest neighbors is an emerging and increasingly vital component in data analysis tasks, for example using vector embedding databases. Typically, the search is the bottleneck in terms of efficiency. Approximate nearest neighbor (ANN) search methods are often employed to speed up the application. However, different methods for ANN search come with different biases that can be positive or negative for the downstream application. In this project, the bias of different ANN methods and its impact on different applications will be studied.
There is an unmet need for decentralised privacy-preserving machine learning. Cloud computing has great potential, however, there is a lack of trust in the service providers and there is a risk of data breaches. A lot of data are private and stored locally for good reasons, but combining the information in a global machine learning system could lead to services that benefit all.
AI is radically changing society and the main driver behind new AI methods and systems is machine learning. Machine learning focuses on finding solutions for, or patterns in, new data by learning from relevant existing data. Thus, machine learning algorithms are often applied to large datasets and then they more or less autonomously find good solutions by finding relevant information or patterns hidden in the data.