AI in research assessment

Unmasking Funding Biases: How AI Validates Research Grant Criteria

Explore how interpretable machine learning audits research grant criteria, revealing mismatches between official policies and actual evaluation outcomes for fairer, more transparent funding decisions.

ARSA Technology Team

15 Apr 2026 • 5 min read

Research funding is the lifeblood of scientific progress, and the frameworks used to assess and award grants are critical. Yet, these foundational criteria are rarely subjected to rigorous empirical review. In Brazil, the Research Productivity (PQ) Grant from the National Council for Scientific and Technological Development (CNPq) significantly shapes academic careers. The grant's detailed regulations outline various dimensions of academic activity, from bibliographic output to human resource training, all intended to guide evaluation decisions. However, a crucial question remains: do these formally prescribed criteria truly align with the factors that actually determine grant outcomes?

The Imperative for Evidence-Based Research Assessment

Globally, research funding agencies strive to identify and reward scientific excellence, but the effectiveness and fairness of their evaluation processes are often taken for granted. Traditional reviews, while valuable, may not fully capture the complex interplay of factors that truly define research productivity. Without systematic scrutiny, there's a risk that policies designed to promote certain academic activities might inadvertently prioritize others, leading to potential misalignments between policy intent and practical outcomes. This lack of transparency can affect career progression, resource allocation, and overall scientific equity.

Existing literature frequently maps the profiles of grant holders across various disciplines, often relying on bibliometric and demographic data. While these studies offer insights into the distribution of awards, they typically treat official criteria as static background information rather than dynamic variables to be rigorously tested. This leaves the actual mechanisms of grant evaluation largely unexplored. Bridging this gap requires a method that can dissect the practical validity of these criteria, moving beyond mere descriptive analysis to uncover the underlying statistical signals that truly differentiate award recipients.

Leveraging Interpretable Machine Learning for Policy Validation

To address this challenge, a recent academic paper titled "Auditing automated research assessment: an interpretable machine learning approach to validate funding criteria" by Gouveia, Silva, and Amancio, as published on arXiv, introduced an innovative approach. Researchers operationalized the officially stated criteria of the Brazilian PQ Grant into measurable variables. This involved extracting data from structured curriculum vitae (CVs) and bibliometric sources like OpenAlex, effectively treating policy-defined indicators as testable hypotheses rather than mere assumptions. The methodology then employed a block-based adaptation of the Boruta feature selection algorithm across several machine learning classifiers.

In simpler terms, feature selection is a powerful technique in machine learning that helps identify the most relevant variables or "features" in a dataset that contribute most significantly to a prediction or classification. The Boruta algorithm, in particular, is designed to find all relevant features, even if they are correlated with others, making it robust for complex datasets like academic profiles. Machine learning classifiers are algorithms that learn from data to categorize new observations into predefined classes – in this case, different grant levels. By analyzing how well these models could predict grant levels based on various criteria, the team could determine the statistical contribution of each dimension, specifically focusing on identifying top-tier (Level 1A) researchers. This approach helps understand not just if a prediction can be made, but why it is made, offering crucial interpretability to the findings. ARSA Technology regularly employs custom AI solutions for complex data analysis tasks that require high accuracy and interpretability.

Key Findings: Unpacking the Real Drivers of Grant Success

The study yielded compelling insights. The machine learning models achieved high predictive performance, with mean AUC (Area Under the Receiver Operating Characteristic Curve) scores reaching 0.96. An AUC of 0.96 is exceptionally high, indicating that the models are highly accurate in distinguishing between different grant levels. This robust performance confirms that PQ levels indeed carry a strong and structured statistical signal, meaning there are discernible patterns in the data that reliably differentiate grant recipients.

However, the analysis further revealed that the explanatory power for these predictions was heavily concentrated within a limited subset of features. Specifically, bibliographic production (publications), graduate-level supervision, and institutional management roles emerged as the most significant drivers. Surprisingly, several other criteria explicitly emphasized in the official regulations demonstrated no detectable statistical contribution to the classification outcomes. This crucial finding suggests a significant potential misalignment: what is formally stated as important in policy documents does not always translate into an effective signal in the actual evaluation process. The practical evaluative signal appears to be substantially more compact and streamlined than the official framework implies. Our AI API solutions are often deployed in similar scenarios to extract and analyze critical data points, providing actionable insights.

Implications for Transparent and Efficient Funding

The findings of this research offer invaluable evidence-based insights for the refinement and transparency of research assessment policies. For funding agencies, understanding which criteria truly drive evaluation outcomes can lead to more targeted and efficient frameworks. By focusing on the most impactful metrics, agencies can streamline processes, reduce administrative burden, and ensure that resources are allocated based on empirically validated indicators of productivity. This can enhance fairness, reduce potential biases, and improve the overall interpretability of evaluation decisions.

For researchers, these insights provide a clearer understanding of the factors that genuinely contribute to grant success. While official guidelines offer a broad overview, empirical validation can help academics align their efforts with the criteria that statistically prove to be most influential. This allows for more strategic career planning and a more informed approach to demonstrating research impact. Furthermore, by highlighting the disparity between stated policy and practical application, the study encourages a continuous dialogue between policymakers and the scientific community to foster more adaptive and responsive evaluation systems. Companies like ARSA, experienced since 2018 in delivering robust AI and IoT solutions, can assist organizations in implementing such data-driven policy evaluation frameworks.

The Future of Evidence-Based Policy

The application of interpretable machine learning in auditing research assessment frameworks represents a significant step forward for evidence-based policy-making. It transforms what were once theoretical assumptions into testable hypotheses, allowing for data-driven validation and refinement. As research ecosystems become more complex and the volume of academic output continues to grow, such analytical tools will become indispensable for ensuring that funding decisions are not only fair and transparent but also maximally effective in fostering scientific innovation. This approach moves beyond simply characterizing researcher profiles to contribute to broader discussions on the design of research productivity indicators and the continuous improvement of academic reward systems.

By embracing these advanced analytical methodologies, institutions can evolve their assessment practices to be more reflective of real-world impact and productivity. This not only benefits individual researchers and funding bodies but also strengthens the integrity and efficiency of the global scientific endeavor.

Ready to enhance your organization's data analysis and decision-making processes with advanced AI? Explore ARSA Technology’s solutions and contact ARSA for a free consultation today.

Source: Gouveia, R. P., Silva, T. C., & Amancio, D. R. (2026). Auditing automated research assessment: an interpretable machine learning approach to validate funding criteria. arXiv preprint arXiv:2604.09827.