Public research and development (R&D) can target areas of research where breakthroughs are needed to deepen AI’s uses in science and engineering. Research goals include going beyond current models based on large datasets and high-performance computing, and to find ways to automate the large-scale creation of findable, accessible, interoperable and reusable (FAIR) data. Another target could be to advance AutoML – automating the design of machine-learning models – to help address the scarcity and high cost of AI expertise. Research challenges could be organised around AutoML for science, and research could be funded that involves applying AutoML in AI-driven science.
Support should also be given for the development of open platforms (such as OpenML and DynaBench) that track which AI models work best for a wide range of problems. Public support is needed to make such platforms easier to use across many scientific fields.
Public R&D could help foster new, interdisciplinary, blue-sky thinking. For instance, natural language processing (NLP) can help to work with the enormous growth of scientific literature. However, current performance claims are overstated. Today’s research in NLP also offers limited incentives for the sort of high-risk, speculative ideation that breakthroughs may need. Research centres, funding streams and/or publication processes could be set up to reward novel methods – even if these are at a nascent stage.
Knowledge bases organise the world’s knowledge by mapping the connections between different concepts, drawing on information from many sources. Governments should support an extensive programme to build knowledge bases essential to AI in science, a need that will not be met by the private sector. Research could work towards creating an open knowledge network to serve as a resource for the whole AI research community. Relatively small amounts of public funding could help bring together AI scientists, scientists from multiple domains and professional societies ̶ along with volunteers ̶ to build the foundations for AI to utilise and communicate professional and commonsense knowledge.
The thematic diversity of research on AI appears to be narrowing and is increasingly driven by the compute- and data-intensive approaches that dominate in large tech companies. Bolstering public R&D might make the field more diverse and help to grow the talent pool. Funders could pay special attention to projects that explore new techniques and methods separate from the dominant deep-learning paradigm. Meanwhile, policy makers could support research to examine and quantify losses of technological resilience, creativity and inclusiveness brought about by a narrowing of AI research and the possible implications of the increasing dominance of industry in AI research.
Much of AI in science involves teaming with people, but funders could also help develop specialised tools to enhance collaborative human-AI teams, and to integrate these tools into mainstream science. Combining the collective intelligence of humans and AI is important, not least because science is now carried out by ever-larger teams and international consortia. Investment in this field of research has lagged other topics in AI.
Among other fields, progress is needed in applying machine learning to medical imaging. Failures during COVID-19 were considerable. As in other uses of machine learning in science, incentives are needed to encourage research on methods with greater validation. Funding should involve more rigorous evaluation practices.