Using Artificial Intelligence for Research Quality Assessment: A Comprehensive Guide

September 18, 2024

The Role of Artificial Intelligence in Research Quality Assessment

In a world where technology continues to advance at a rapid pace, the integration of artificial intelligence (AI) into various aspects of our lives has become increasingly prevalent. One area where AI is making a significant impact is in research quality assessment. Mike Thelwall, a data scientist at the University of Sheffield, UK, recently conducted a study to evaluate the ability of large language models (LLMs) to assess academic papers against the criteria of the research excellence framework (REF), the United Kingdom’s national audit of research quality.

Thelwall’s experiment involved feeding 51 of his own research works into a custom version of ChatGPT, a language model, to evaluate their quality. Surprisingly, the chatbot was able to produce authentic-sounding reports that could pass as human-written. However, the model faltered when faced with a paper on “squirrel surgeons,” highlighting the limitations of AI in assessing research quality. Despite this setback, the study sheds light on the potential of AI in aiding research evaluation.

The Potential Benefits of AI in Research Evaluation

The rapid rise of generative AI tools like ChatGPT and image generators such as DALL-E has sparked discussions on how AI can contribute to research assessment. While AI has shown promise in providing feedback on manuscripts and streamlining administrative processes, there are concerns about its ability to fully evaluate research quality. Weixin Liang, a computer science PhD student at Stanford University, conducted a study on the use of the LLM GPT-4 to assess manuscripts, with researchers finding the AI-generated feedback helpful but lacking in-depth critiques.

Despite the limitations, there is potential for AI to assist in certain aspects of research evaluation, particularly in providing feedback on early drafts of papers. However, there is a need for extensive testing and cautious implementation to ensure that AI tools are used appropriately and do not compromise the integrity of the peer review process. As the research community explores new ways to assess research quality, AI could play a valuable role in enhancing efficiency and productivity.

Challenges and Considerations in AI-Assisted Research Evaluation

While the use of AI in research evaluation holds promise for improving efficiency and accuracy, there are several challenges and considerations that must be addressed. Marnie Hughes-Warrington, deputy vice-chancellor of research and enterprise at the University of South Australia, emphasizes the importance of evolving definitions of research quality and the need to assess non-traditional research outputs beyond publications and citations.

Elizabeth Gadd, head of research culture and assessment at Loughborough University, raises concerns about the potential misuse of AI in research evaluation and the risk of exacerbating existing biases and inequalities. She stresses the importance of evaluating only where necessary and maintaining human involvement in decision-making processes to ensure fair and ethical assessments. Gadd’s work on developing the SCOPE framework for responsible research evaluation highlights the need for a balanced approach to incorporating AI in research assessment.

As the research community navigates the complexities of integrating AI into research evaluation, it is essential to prioritize transparency, accountability, and ethical considerations. By carefully considering the implications of AI in research assessment and implementing robust guidelines and safeguards, researchers can harness the potential of AI to enhance the quality and impact of research outcomes.