INCISIVE launches its first interoperable cancer data repository prototype

Published on 27/06/2022

The INCISIVE project has reached its first major milestone after 18 months of work by launching its first prototype of an interoperable federated data repository of thousands of clinical images of breast, lung, prostate, and colorectal cancer. The repository allows the secure and GDPR-compliant sharing of health data by hospitals and other potential data providers with the scientific community working on Artificial Intelligence (AI) related training and experimentation.

The first prototype federates 3 out of the 9 INCISIVE data providers together with all their cancer imaging and other clinical data, which is a fraction of the total data that the project plans to federate. For this purpose, the consortium has collected in a temporary central storage a total of 2.5 million cancer images for more than 7,000 de-identified patients from all the clinical partners involved in the project. The collected data is ready to be integrated into the federated data repository as soon as the remaining 6 data providers complete the set-up of their data nodes.

The project’s coordinator, Gianna Tsakou, Senior Project Manager at MAGGIOLI SpA – Research & Innovation Lab in Athens, highlights the challenges that the consortium has faced to achieve this milestone: “one of the biggest challenges that we successfully addressed was putting in place all the necessary agreements and technical work for ensuring that the massive retrospective data sharing complies with legal and ethical requirements in all 5 European countries and for all 9 data providers where data nodes are planned”. Other challenging tasks have been data collection, preparation and de-identification, establishing a common understanding of the AI services that the project will deliver, designing the platform and implementing an operational version of the federated approach in terms of data storage and federated learning.

Advancing towards the AI Toolbox

The first INCISIVE prototype already includes some of the functionalities expected for the AI Toolbox, which aims to provide decision-making support to medical professionals regarding cancer diagnosis and treatment. INCISIVE partners have already started working on almost all AI models targeted in the project, and the first prototype incorporates those more advanced, namely the models for breast density classification and lung image segmentation for several image modalities.

The first prototype AI toolbox also includes initial approaches on explainable AI, data analysis pipelines that will enable the delivery of the planned AI services, as well as initial work on the User Interface of the AI services so that medical professionals can comprehensively view and read the AI inference results in a way that is as intuitive and transparent as possible.

Use cases supported

The first prototype comprises all the main use cases and platform functionalities foreseen for the project for the potential users of the INCISIVE platform.

Firstly, for the data providers, it supports data preparation, including data de-dentification, annotation and quality checking before sharing their data in the INCISIVE repository. Secondly, for the AI researchers looking for training or validation data for their models, the first prototype supports searching and querying of the data in the federated nodes, allowing the creation of a workspace and the training of their algorithms using federated learning. Finally, for the medical professionals, it supports the delivery of AI-enabled inference services following a models-as-a-service approach, where medical professionals must only provide the image to the system and then get the AI-enabled inference results with only one click.

Next steps

The INCISIVE consortium has started working on the second prototype, which will integrate the remaining 6 data providers into the federated storage and make their data interoperable and reusable during and after the project. The project also expects to include a central data storage node in the integrated platform for those data providers who cannot or do not wish to set up their own node locally and to make available a data de-identification tool that is optimized according to the data providers’ needs, as well as a semi-automatic data annotation tool to accelerate the work of data partners.

The second prototype, which will be available early next year, will also optimize the federated learning process in terms of the usage of computational resources required and the quality of AI models produced from this process.

For more detailed information, watch this video interview with the project’s coordinator, Gianna Tsakou.

https://youtu.be/Vw0S3O4vLEs

[INCISIVE has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 952179.]

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Analytics" category .
cookielawinfo-checkbox-functional	1 year	The cookie is set by the GDPR Cookie Consent plugin to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Necessary" category .
cookielawinfo-checkbox-others	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Others".
cookielawinfo-checkbox-performance	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Performance".
XSRF-TOKEN	12 hours	This cookie is set by Wix and is used for security purposes.
__cf_bm	1 day	This cookie is used to distin guish between humans and bots. This is beneficial for the website, in order to make valid reports on the use of their website.

Cookie	Duration	Description
fm_cookie_025ecaa6af3245a6d205881199e31936	1 month	No description
fm_cookie_4b326503275f850eddffd31b87c7377e	1 month	No description
fm_cookie_4d9df4a544a5e6681638318cf10ced2c	1 month	No description
fm_cookie_4e813cd103bf7ad8a897682725127aa0	1 month	No description
fm_cookie_555a60cee7ad30ddd21e8c200d2ce90f	1 month	No description
fm_cookie_70ca6ddb5aa4801753856cca80ec45a4	1 month	No description
fm_cookie_8fa874516026a18fb39256e8ce046bbd	1 month	No description
fm_cookie_901b0e45a4071f78d4142039ce5dc658	1 month	No description
fm_cookie_9a0712a45758a95c9e91b8705c8a2f52	1 month	No description
fm_cookie_a37dd2b5d3c78d33b3a0b8dba6c17bef	1 month	No description
fm_cookie_b31788b694cd26189cc34e76b2911d44	1 month	No description
fm_cookie_ba407410a85eb42807dc5bf2533c7dde	1 month	No description
fm_cookie_c612000b4831eae79c12b914185c7ab2	1 month	No description
fm_cookie_d05ec5059b48f3b8f3429f6df80e6232	1 month	No description
fm_cookie_d7b994af5a899d496f489dba393da561	1 month	No description
fm_cookie_dbbd88794c9c96b9e4506e93bf228a08	1 month	No description
fm_cookie_e4417c01d59ce094836e01d8fa3e8ed0	1 month	No description
fm_cookie_f02c26cd813957432a32a453843c69ca	1 month	No description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.

Cookie	Duration	Description
dayShowInterval43212	Persistent	Unclassified
fm_cookie_af99609d952b19c0ed325b094989c980	1 month	No description
fm_cookie_d7b994af5a899d496f489dba393da561	1 month	No description
mp-priority	Persistent	Unclassified
mypopups_session	12 hours	No description
time43212	Persistent	Unclassified
timestampInterval43212	Persistent	Unclassified
_pk_id.1.d0b1	1 year 27 days	No description
_pk_ses.1.d0b1	30 minutes	No description