Team awarded NSF/AFOSR grant for UAV digital twin foundations


TDAI core faculty Subhadeep Paul (Department of Statistics) and Mahdi Khali (Department of Computer Science and Engineering) are PI and co-PI respectively on a new NSF award, "Heterogeneous transfer and federated learning for digital twin in unmanned aerial vehicles," along with co-PI Debdipta Goswami (Department of Mechanical and Aerospace Engineering).
The MATH-DT program that administers the award is jointly funded by NSF Division of Mathematical Sciences (DMS) and Air Force Office of Scientific Research (AFOSR) to support foundational mathematical and statistical research on Digital Twins in applied sciences. The team was formed during brainstorming sessions organized by TDAI, which offers proposal development services and other support for interdisciplinary research teams pursuing extramural funds for projects with data science and/or computational components.


Abstract
In recent years, the ambitious concept of Digital Twin (DT), with the aspiration of creating a virtual representation of a complex asset or process, has received enormous interest from industry leaders, engineers, policymakers, and scientists. The DT promises to digitally replicate the high-dimensional, multi-modal, and dynamic physical systems and assist decision-making with reliable predictions. For example, in aerospace applications, a DT that can accurately simulate flight operational conditions based on data and feedback from the "flying twin" has great potential to improve aerial vehicles' safety, reliability, and control, especially in extreme service conditions. However, the development of DTs has been largely ad hoc, and generalizable mathematical foundations are poorly understood. The investigators aim to develop Artificial Intelligence (AI) solutions in terms of heterogeneous transfer and personalized federated learning to solve mathematical and computing challenges in developing a DT for Unmanned Aerial Vehicles (UAVs). An interdisciplinary approach is taken that combines expertise in Statistics, Aerospace Engineering, and Computer Science to develop a holistic statistical and computational framework that is useful for the development of DTs generally, even beyond aerospace applications. The team will undertake educational outreach programs in collaboration with university partners. The team will also teach and mentor middle school and high school students in a free summer camp to explore data science and analytics. In addition, the team will teach and train graduate and undergraduate students in interdisciplinary data science and engineering. Finally, to enable rapid translation of the proposed research to digital twins in academic and industry environments, the research team will create R and Python packages and libraries.
The project will develop (1) a flexible heterogeneous transfer learning approach that allows for unknown, possibly nonlinear functional relationships between the response and the features and allows for differing covariates or control conditions and (2) a personalized federated learning approach that accommodates diverse computational resources and storage capacities of local devices with applications to (3) solve challenges in developing DT in UAVs. The project will innovate in all three thrusts of this project. First, while the potential of DTs to improve the safety and reliability of their flying twins has been long recognized, there are very few principled approaches toward developing a DT with uncertainty quantification. The project contributes to understanding the mathematical challenges and building a mathematical foundation for DTs of UAVs. Second, while several methods exist for knowledge transfer between domains, most approaches assume the same set of features between a target domain and a source domain. Moreover, they do not provide rigorous uncertainty quantification. The project will develop a heterogeneous transfer learning framework with statistical guarantees. Third, while federated learning (FL) has emerged as an important method for training DTs with distributed data across multiple edge devices, the main assumption of such methods is that agents or edge devices learn a model with the same parameters. However, in DT applications, agents may have different computational and storage resources. This project responds to the challenge by developing FL algorithms where the learning tasks are personalized to the agents' resource constraints.