Introduction
Attention-deficit/hyperactivity disorder (ADHD) is a prevalent neurodevelopmental disorder that can persist into adulthood and typically initially appears in childhood. It is typified by recurrent patterns of impulsivity, hyperactivity, and inattention that impede growth or functioning (Furman, 2005; Yuyang et al., 2019). Because of its varied presentation, which can change depending on age, gender, and the presence of concomitant diseases, diagnosing ADHD is still a challenging undertaking (Lee et al., 2021; Sharma & Couture, 2014; Yuyang et al., 2019). Complex datasets, including behavioral testing, medical records, and neuroimaging research, can now be analyzed thanks to recent developments in AI, especially machine learning (Lee et al., 2021; Quinn & Madhoo, 2014). These developments could enhance conventional diagnostic techniques and provide a more in-depth understanding of unique profiles (Funkhouser, 2024).
A serious issue still exists despite the expanding corpus of research: women are still underdiagnosed with ADHD (Attoe & Climie, 2023; Martin, 2024; Quinn & Madhoo, 2014). Historically, male-dominant samples have been used to establish and evaluate diagnostic definitions and instruments, which frequently fail to capture the more internalized, less disruptive symptom profiles that women and girls typically display (Funkhouser, 2024; Martin, 2024). For many women with ADHD, this gender prejudice reinforces long-term unfavorable outcomes and unmet clinical needs by contributing to delayed diagnosis, frequent misdiagnosis, and inadequate treatment (Du Randt, 2024; Hinshaw et al., 2022; Sharma & Couture, 2014). The question of whether AI technologies exacerbate or mitigate current diagnostic disparities is gaining attention as they are increasingly incorporated into clinical decision-making processes (Quinn & Madhoo, 2014). The use of AI in ADHD research has been the subject of several recent reviews, such as those by du Randt, who explored ADHD in women and discussed unique challenges and misdiagnosis issues (Du Randt, 2024). Other notable reviews include those that concentrate on explainable AI for ADHD prediction (Andrikopoulos et al., 2024; Navarro-Soria et al., 2024), on computational screening approaches using behavioral and questionnaire data (Qin et al., 2025; Slobodin et al., 2020; Trognon & Richard, 2022), on diagnosis using structural and functional neuroimaging data (Agoalikum et al., 2023; Lohani & Rana, 2023; Peng et al., 2013), and on AI applications for children with ADHD (Sun et al., 2025). These critiques, however, frequently ignore model performance across demographic subgroups, especially gender, in favor of concentrating primarily on technical performance and overall correctness. Therefore, there is still a significant knowledge gap on whether AI-based techniques might lessen gender-based diagnostic discrepancies or if they run the risk of reinforcing the same biases found in traditional evaluations.
This review closes that gap by examining the use of AI techniques in gender-focused ADHD research. Specifically, we examine studies that analyze data from behavioral surveys, neuroimaging, and electronic health records using AI methods that go beyond machine learning, including approaches to symbolic reasoning or rule-based decision systems. We investigate whether different AI techniques identify or lessen gender-based disparities in ADHD diagnosis, and we point out methodological flaws and potential research directions. This study additionally examines whether the included papers disclose model fairness metrics, do gender-specific subgroup analyses, or address representational biases in their training data, all of which are informed by recommended practices described in recent equity-focused AI frameworks (Sun et al., 2025). These factors are crucial for guaranteeing fair healthcare results as well as for evaluating AI accuracy. This review attempts to elucidate AI’s present and potential role in promoting more inclusive and equitable diagnostic methods for ADHD by combining findings from various modalities and periods.
Methods
A thorough, organized literature analysis was carried out to examine how AI technologies have been used to diagnose ADHD in women, as well as whether these approaches lessen or maintain gender-based diagnostic inequalities. Peer-reviewed research published between January 2012 and July 2025 was the focus of the search, which was conducted using PubMed and Google Scholar. The Boolean search query that was utilized was this: (“Attention Deficit” OR “ADHD”) AND (“machine learning” OR “artificial intelligence”) AND (“gender” OR “women” OR “sex differences”) AND (diagnosis OR detection OR classification). Finding papers that used AI or machine learning to diagnose ADHD and also reported sex-specific results or included gender-based analyses was the goal of the inquiry.
Only English-language full-text publications with human subjects were included. We used inclusion criteria based on two important review studies to ensure high methodological reliability and relevance: the review on AI-based ADHD detection by Garg et al. (2022) (Sun et al., 2025), which described validated AI approaches across neuroimaging, behavioral, and physiological modalities, and the systematic review of gender differences in adult ADHD by Thapar et al. (2022) (Hinshaw et al., 2022), which focused on sex-disaggregated analysis in functional impairments. These requirements must be fulfilled by eligible studies: (Furman, 2005) applying AI/ML models (such as SVM, random forests, and deep neural networks) and (Yuyang et al., 2019) employing these models to diagnose ADHD or classify its symptoms. Research that (a) only employed standard statistical techniques, (b) did not present any gender-based data or analysis, or (c) were reviews, opinion pieces, or conference abstracts lacking complete results were not included. Two reviewers independently screened abstracts and titles after duplicates were eliminated. Eligibility disagreements were settled by consensus. We looked at the gender representation (the percentage of female participants), the AI methodology (traditional ML, deep learning, ensemble models, NLP), the data modality (behavioral scales, EEG/MRI, EHR, wearable sensors), and the model performance metrics (accuracy, sensitivity, specificity, AUC, F1 score) of each eligible study. By Thapar et al. (Hinshaw et al., 2022), we also evaluated if the research documented trends in misdiagnosis, functional consequences, or gender-specific symptom patterns. The data types (behavioral, neuroimaging, EHR, multimodal), AI approach, degree of gender-specific reporting (none, partial, full), and publication years (2012–2017, 2018–2022, 2023–2025) were used to group the studies for ease of comparison. A detailed, evidence-based summary of how AI is currently addressing or ignoring gender gaps in ADHD diagnosis was made possible by this methodology.
Results
Utilizing the search criteria described in the Methods section, we found 9 papers describing AI solutions for automatic ADHD diagnosis. These studies were grouped by data type, AI type, gender, year of publication, and key findings.
The results are shown in Figure 1.
Three studies used AI classification models using MRI data as the main input (Bohland et al., 2012; Lohani & Rana, 2023; Peng et al., 2013). Without identifying the participants’ gender or doing a gender-based analysis, one study that used extreme learning machines (ELM) obtained a diagnosis accuracy of 90.18% (Peng et al., 2013). Another study included both male and female participants and used multimodal models to increase accuracy, although gender was not investigated separately (Bohland et al., 2012). Using MRI and machine learning algorithms, a more recent study reported 75% categorization accuracy; however, it did not examine outcomes by gender (Lohani & Rana, 2023). The first MRI-based investigation to specifically stratify results by gender was conducted in 2023. The study used dynamic functional network connectivity (dFNC) analysis and discovered sex-specific variations in brain connections between children and adults with ADHD (Agoalikum et al., 2023).
Data from questionnaires were used in five studies to classify ADHD (Navarro-Soria et al., 2024; Qin et al., 2025; Slobodin et al., 2020; Trognon & Richard, 2022). When gender was included as a predictor but not examined independently, a 2020 study used machine learning to reach 87% accuracy (Slobodin et al., 2020). Another study that used TRAQ10-based models in 2022 did not undertake gender-specific analysis, although it did perform well (Trognon & Richard, 2022). Although both genders were included as features in the model, a 2024 study that used questionnaire data achieved 90% accuracy without performing subgroup analysis (Navarro-Soria et al., 2024). Both SVM (AUC 0.96, accuracy ~85%) and random forest models (AUC 0.99) performed well on questionnaire data, according to a 2025 study; however, no gender-stratified findings were given (Qin et al., 2025).
Only one study, which achieved an accuracy of 81.6%, classified ADHD using physiological cues, such as cardiac and electrodermal characteristics, in combination with SVM (Andrikopoulos et al., 2024). Despite the inclusion of both male and female subjects, no analysis was done specifically for gender. This study underscores the need for larger, more varied samples while highlighting the growing significance of physiological signs as possible objective markers of ADHD.
Gender-specific analyses were generally lacking in all of the research. Only one study performed a sex-stratified analysis and discovered gender variations in brain connectivity, despite the fact that the majority comprised both male and female subjects (Agoalikum et al., 2023). Other studies either included gender as a model component without stratified evaluation (Andrikopoulos et al., 2024; Navarro-Soria et al., 2024; Slobodin et al., 2020), recorded gender without additional analysis (Bohland et al., 2012; Lohani & Rana, 2023; Trognon & Richard, 2022), or did not report gender at all (Peng et al., 2013). The dearth of gender-specific insights in existing AI-assisted ADHD research is a significant gap.
Discussion
Due in large part to the more obvious hyperactive and impulsive characteristics that are commonly linked to ADHD, male participants have historically dominated ADHD research. A distorted picture of ADHD that underrepresents female symptomatology resulted from early diagnostic criteria and treatment regimens that were centered on male symptom patterns (Du Randt, 2024). Consequently, research findings are frequently not applicable to women with ADHD, particularly when they involve neuroimaging or behavioral analysis. By concentrating mostly on men, we run the risk of ignoring the more subtle, frequently internalized symptoms that are more prevalent in women, like anxiety, daydreaming, and emotional dysregulation.
Although both male and female volunteers are increasingly frequently included in studies, our data demonstrate that gender disparities are often not thoroughly investigated. The approach of not examining gender differences explicitly ignores the distinct symptom patterns of women with ADHD, even if it offers a more balanced sample. In addition to displaying more internalized symptoms like emotional dysregulation, women frequently report a greater prevalence of comorbidities like anxiety and depression (Attoe & Climie, 2023; Furman, 2005). When AI models are created, these gender differences are routinely overlooked, which could reinforce diagnostic models that are centered on men. The way ADHD presents differently in women may be missed by AI systems educated on non-gendered data, which could reinforce preexisting biases. The growing emphasis on research that particularly looks at gender variations in ADHD diagnosis is one tactic that could help reduce gender-based diagnostic disparities. Emotional discomfort and co-occurring mental health disorders are more common in women with ADHD than in males, according to research (Attoe & Climie, 2023; Du Randt, 2024; Quinn & Madhoo, 2014), and machine learning algorithms may identify these gender-specific differences (Funkhouser, 2024). Female ADHD patients may have more accurate diagnoses if AI models take into consideration their particular symptom profiles. Nevertheless, despite the encouraging promise, some research indicates that there is a chance of reinforcing preexisting gender prejudices if AI models are not created with these gender inequalities in mind. AI systems trained exclusively on datasets of male patients may not adequately reflect the more complex symptom profiles seen in women (Sun et al., 2025).
The necessity for gender-specific strategies is highlighted by recent court cases showing that women are underdiagnosed with ADHD. There will probably be a greater push in future research toward studies that are only focused on women, as these difficulties highlight differences in diagnostic procedures. This is in line with the expanding movement in AI research to prioritize gender-inclusive data. Particularly in identifying ADHD in women, where conventional diagnostic techniques frequently fall short, improved diagnostic accuracy has been demonstrated by AI models designed to differentiate between male and female symptom presentations (Agoalikum et al., 2023; Lohani & Rana, 2023; Navarro-Soria et al., 2024). However, AI systems run the danger of making the problem of underdiagnosis or misdiagnosis worse if gender is not sufficiently taken into account in the training data, especially when trained on datasets that are primarily male (Lohani & Rana, 2023).
Future studies should look at gender-specific data from a variety of modalities, such as multimodal, behavioral, neuroimaging, and EHR databases (including behavioral, neuroimaging, EHR, and multimodal data). When it comes to treating the complexity of ADHD symptoms in women, each technique has special benefits. Future AI models can more accurately represent the variety of ADHD symptomatology and lessen diagnostic inequities by including gender considerations into various data modalities.
Conclusion
We carried out a thorough literature analysis to look into how gender differences, more especially, the underdiagnosis of women, are taken into account by AI-based approaches to ADHD diagnosis. Nine pertinent peer-reviewed studies that were published between 2012 and 2025 were found. Data modalities (MRI, questionnaires, physiological signals), AI techniques (SVM, RF, ELM), the year of publication, and the degree of gender-specific analysis were all taken into consideration while analyzing these studies. Our results show that even though AI diagnostic accuracy can reach 90%, gender-sensitive analyses are still rare. While the other studies either ignored gender reporting or only considered it as a demographic variable, only one study performed sex-stratified analysis. Ignoring gender-specific analysis reinforces preexisting diagnostic biases and ignores significant clinical differences in female ADHD. Fairness metrics, sex-disaggregated results, and balanced representation in training data must be given top priority in future studies. If these steps are not taken, underdiagnosed groups, particularly women, may continue to be overlooked, which could lead to the continuation of disparities in mental healthcare.
