Projects
This page is still under construction !
MatSwarm
The rapid evolution of Industry 4.0 demands seamless collaboration among material research institutions to speed up advanced material discovery. The current platforms struggle with integrating large-scale, heterogeneous datasets, leading to data silos that hinder collaboration and innovation. The University of Science and Technology Beijing addresses these challenges through the National Material Data Management and Services (NMDMS) platform, which aggregates over 14 million material data entries from 30+ institutions, supporting high-throughput experimentation and collaborative research in materials genomic engineering. Key to this platform’s success is its advanced data normalization, distributed storage, and blockchain-based middleware, which ensures secure, cross-institutional data sharing. The MatSwarm framework further enhances this environment by introducing swarm transfer learning to boost model accuracy and generalization on non-i.i.d. data. NMDMS stands as a pioneering tool in materials research, driving innovation and fostering secure, efficient, and collaborative materials computation across institutions.
FedMDH
In the field of materials science, due to various factors such as material sources, testing equipment, and technical methods, the data distributions across different organizations are often non-identical and non-independent (non-i.i.d.) . This data heterogeneity can manifest in various forms, including 1) feature space disparity, 2) sample imbalance, and 3) label distribution variance. We define it as multi-dimensional heterogeneity (MDH). To overcome these challenges, we introduce FedMDH, a federated learning framework designed to tackle Multi-Dimensional Heterogeneity. While FedMDH is applicable to various downstream tasks, this work focuses on the widespread, complex, and underexplored regression tasks in materials science.