Japanese

Research News

Technology/Materials

AI Technology for Safer Integrated Analysis of Data Held by Multiple Organizations

image picture Image by ART STOCK CREATIVE/Shutterstock

Researchers from the University of Tsukuba have developed a more secure artificial intelligence technology, "non-readily identifiable data collaboration analysis," which integrates and analyzes personal data held by multiple companies, municipalities, hospitals, etc. by sharing only abstract data that cannot be readily identified with the original data.


Tsukuba, Japan—Collecting sufficient data without distribution bias is essential to improve the performance of artificial intelligence (AI) analysis. AI technology is required to collect data dispersed across multiple institutions and safely perform integrated analysis while keeping certain information confidential, such as personal information and know-how. Specifically, the use of data is believed to be restricted if personal information is involved and identifiable in the shared data.


This research team has developed a secure AI technology called "non-readily identifiable data collaboration analysis" that shares only abstract data that cannot be readily identified with the original data and allows the integrated analysis of personal information held by multiple parties, such as companies, local governments, hospitals, and other organizations. The research team has introduced a framework for the mathematical definitions of readily identifiable data. Thereafter, the team has proposed an integrated analysis algorithm that shares only the abstracted data that cannot be readily identified with the original data. This will enable more data to be used in analysis involving personal information, which, in turn, is expected to significantly improve the accuracy of AI analysis.


Specific applications include disease prediction via estimation of risk factors through the integrated analysis of test and medication data from multiple medical institutions and the enhancement of educational effectiveness through the integrated analysis of student data from multiple educational institutions. This technology is anticipated to facilitate the development of a new platform that gathers high-quality personal information from various institutions while protecting the original data and employing AI for comprehensive data analysis.


###
This work was supported in part by the New Energy and Industrial Technology Development Organization (NEDO), Japan, Japan Science and Technology Agency (JST) (No. JPMJPF2017), and the Japan Society for the Promotion of Science (JSPS), Japan, Grants-in-Aid for Scientific Research (Nos. JP19KK0255, JP21H03451, JP22H00895, JP22K19767).



Original Paper

Title of original paper:
Non-readily identifiable data collaboration analysis for multiple datasets including personal information
Journal:
Information Fusion
DOI:
10.1016/j.inffus.2023.101826

Correspondence

Professor SAKURAI Tetsuya
Director of Center for Artificial Intelligence Research (C-AIR), University of Tsukuba


Related Link

Center for Artificial Intelligence Research (C-AIR)