POLYHEDRON vs COVID (Multilanguage interaction on high-priority scientific questions)
The problem to be solve
Today there are the datasets representing the most extensive machine-readable Coronavirus literature collection available for data and text mining to date, with over 57,000 articles, more than 45,000 of which have full text. https://www.kaggle.com/allen-institute-for-ai/CORD-19-research-challenge This project helps the science community answer high-priority scientific questions related to COVID-19, and to effectively interact with each other indifferent languages answering these questions.
We are using semantic-linguistic analysis of large volumes of unstructured information, their structuring, establishing contextual links between the documents being processed, forecasting and supporting the processes of rational choice with the formation of information-analytical WEB-oriented decisions.
What you have done during the weekend (hackathon)
We used Dataset from https://connect.medrxiv.org/relate/feed/181 https://www.kaggle.com/allen-institute-for-ai/CORD-19-research-challenge And provided answers to the following questions https://www.kaggle.com/allen-institute-for-ai/CORD-19-research-challenge/tasks?taskId=568 We created a technical prototype to solve the problem above.
Who is doing what in the team:
Dmytro Filistieiev @Dmytro works as Product manager and responsible for project architecture.
Maksym Nadutenko @maxkrb works on back-end with c# and programmed text mining corpus based techniques with lexicographical theory. This includes semantic indexing and ontology-based fuzzy search.
Vitalyi Pryhodnuk @ tangens91 works on most of the frontend and big part of the backend, including lots of text-processing submodules.
Vjacheslav Gorborukov @slavon07 works on back-end, including MCDA (Multiple-criteria decision analysis) modules.
Oleksandr Stryzhak @ sae953 scientific advisor of the project.
The solution’s impact to the crisis
This project will enable scientists and healthcare professionals around the world to reduce the time needed to retrieve specific scientific COVID-19 data and to effectively interact with each other indifferent languages (especially in Ukraine), streamlining data and knowledge from around the world on COVID-19 issues. As a result, a more rapid response to the COVID-19 and similar cases is more likely.
The necessities in order to continue the project
To continue the project, logistical support is required to host the project in the Cloud, the maintenance of a team of programmers and a team to accompany the project until its completion. The estimated cost of one year is approximately from 100 000 Euro to 1 900 000 Euro.
The value of your solution(s) after the crisis
After the crisis related to COVID-19, the project can be applied in any field of activity where there is a need to process a large amount of information, as an option for research in the field of ecology, consumption of goods and services, standardization of products and processes, marketing research. etc.