Elfadil Abdalla Mohamed has received his MSc and Ph.D. in Computer Science from University of Technology Malaysia (UTM), Malaysia, in 2002. He is currently working as an Associate Professor at College of Information Technology, Ajman University, United Arab Emirates. His main areas of research interest are in data mining and database.
The recent COVID-19 pandemic has forced educational institutions worldwide to adopt e-learning. UAE higher education institutions have implemented e-learning systems and programs to cope with this unprecedented situation. This paper measured the strength of association between key aspects of e-learning systems and programs and students’ motivation to learn in Ajman University (AU). Cronbach’s coefficient alpha was used to test the internal consistency reliability of key aspects of e-learning (EL-8) and students’ motivation to learn (SML-16). Exploratory factor analysis was used to test the validity of, and coherence of patterns in, the data. Parametric and non-parametric methods were used to investigate the strength of association between key aspects of e-learning and students’ motivation to learn in AU. The results indicated that motivation variables were more strongly correlated with both e-teaching materials and e-assessments key aspects relative to others such as e-discussion, and e-grade checking and feedback
Protein complexes are groups of two or more polypeptide chains that bind to form noncovalent networks of protein interactions. Over the past decade, researchers have created a number of means of computing the ways in which protein complexes and their members can be identied through these interaction networks. Although most of the existing methods identify protein functional complexes from the protein-protein interaction networks (PPIs) at a fairly decent level, the applicability of advanced graph network methods has not yet been adequately investigated. This paper proposes various graph convolutional network (GCN) methods to improve the detection of protein complexes. We rst formulate the protein complex detection problem as a node classication problem. Then, we developed a Neural Overlapping Community Detection (NOCD) model to cluster the nodes (proteins) using a complex afliation matrix. A representation learning approach, that combines a multi-class GCN feature extractor (to obtain the nodes' features) and a mean shift clustering algorithm (to perform the clustering), is also utilized. We convert the dense-dense matrix operations into dense-sparse or sparse-sparse matrix operations to improve the efciency of the multi-class GCN network by reducing space and time complexities. The proposed solution signicantly improves the scalability of the existing GCN. Finally, we apply clustering aggregation to nd the best protein complexes. A grid search is then performed on various detected complexes obtained via three well-known protein detection methods, namely ClusterONE, CMC, and PEWCC, with the help of the Meta-Clustering Algorithm (MCLA) and the Hybrid Bipartite Graph Formulation (HBGF). We test the proposed GCN-based methods on various publicly available datasets and nd that they perform signicantly better than previous state-of-the-art methods. The code/data are available for free download from https://github.com/Analystharsh/GCN_complex_detection.
Psychophysiological and cognitive tests as well as other functional studies can detect pre-symptomatic stages of dementia. When assembled with structural data, cognitive tests diagnose NDs more reliably thus becoming a multimodal diagnostic tool. Objective. Our main goal is to improve screening for dementia by studying an association between the brain structure and its function. Hypothetically, the brain structure-function association has features specific for either disease-related cognitive deterioration or normal neurocognitive slowing while aging. Materials and methods. We studied a total number of 287 cognitively normal cases, 646 of mild cognitive impairment, and 369 of Alzheimer's disease. To work out a new marker of neurodegeneration, we created a convolutional neural network-based regression model and predicted the cognitive status of the cognitively preserved examinee from the brain MRI data. This was a model of normal aging. A big deviation from the model suggests a high risk of accelerated cognitive decline. Results. The deviation from the model of normal aging can accurately distinguish cognitively normal subjects from MCI patients (AUC D 0.9957). We also achieved creditable performance in the MCI-versus-AD classification (AUC D 0.9793). We identified a considerable difference in the MMSE test between A-positive and A-negative demented individuals according to ATN-criteria ( 6.27 ± 1.82 vs 5.32 ± 1.9; p < 0:05). Conclusion. The deviation from the model of normal aging can be potentially used as a marker of dementia and as a tool for differentiating Alzheimer's disease from non-Alzheimer's dementia. To find and justify a reliable threshold levels, further research is required.
This paper explores the importance of the electric eel foraging optimization (EEFO) algorithm in addressing feature selection (FS) problems, with the aim of ameliorating the practical benefit of FS in real-world applications. The use of EEFO to solve FS problems props our goal of providing clean and useful datasets that provide robust effectiveness for use in classification and clustering tasks. High-dimensional feature selection problems (HFSPs) are more common nowadays yet intricate where they contain a large number of features. Hence, the vast number of features in them should be carefully selected in order to determine the optimal subset of features. As the basic EEFO algorithm experiences premature convergence, there is a need to enhance its global and local search capabilities when applied in the field of FS. In order to tackle such issues, a binary augmented EEFO (BAEEFO) algorithm was developed and proposed for HFSPs. The following strategies were integrated into the mathematical model of the original EEFO algorithm to create BAEEFO: (1) resting behavior with nonlinear coefficient; (2) weight coefficient and confidence effect in the hunting process; (3) spiral search strategy; and (4) Gaussian mutation and random perturbations when the algorithm update is stagnant. Experimental findings confirm the effectiveness of the proposed BAEEFO method on 23 HFSPs gathered from the UCI repository, recording up to a 10% accuracy increment over the basic BEEFO algorithm. In most test cases, BAEEFO outperformed its competitors in classification accuracy rates and outperformed BEEFO in 90% of the datasets used. Thereby, BAEEFO has demonstrated strong competitiveness in terms of fitness scores and classification accuracy. When compared to its competitors, BAEEFO produced superior reduction rates with the fewest number of features selected. The findings in this research underscore the critical need for FS to combat the curse of dimensionality concerns and find highly useful features in data mining applications such as classification. The use of a new meta-heuristic algorithm incorporated with efficient search strategies in solving HFSPs represents a step forward in using this algorithm to solve other practical real-world problems in a variety of domains.
A fundamental obstacle to healthcare transformation continues to be the acquisition of knowledge and insightful data from complex, high dimensional, and heterogeneous biological data. As technology has improved, a wide variety of data sources, including omics data, imaging data, and health records, have been available for use in healthcare research contexts. Electronic health records (EHRs), which are digitalized versions of medical records, have given researchers a significant chance to create computational methods for analyzing healthcare data. EHR systems typically keep track of all the data relating to a patient’s medical history, including clinical notes, demographic background, and diagnosis details. EHR data can offer valuable insights and support doctors in making better decisions related to disease and diagnostic forecasts. As a result, several academics use deep learning to forecast diseases and track health trajectories in EHR. Recent advances in deep learning technology have produced innovative and practical paradigms for building end-to-end learning models. However, scholars have limited access to online HER databases, and there is an inherent need to address this issue. This research examines deep learning models, their architectures, and readily accessible EHR online databases. The goal of this paper is to examine how various architectures, models, and databases differ in terms of features and usability. It is anticipated that the outcomes of this review will lead to the development of more robust deep learning models that facilitate medical decision-making processes based on EHR data and inform efforts to support the selection of architectures, models, and databases for specific research purposes.
With the increasing dimensionality of the data, High-dimensional Feature Selection (HFS) becomes an increasingly difficult task. It is not simple to find the best subset of features due to the breadth of the search space and the intricacy of the interactions between features. Many of the Feature Selection (FS) approaches now in use for these problems perform significantly less well when faced with such intricate situations involving high-dimensional search spaces. It is demonstrated that meta-heuristic algorithms can provide sub-optimal results in an acceptable amount of time. This paper presents a new binary Boosted version of the Spider Wasp Optimizer (BSWO) called Binary Boosted SWO (BBSWO), which combines a number of successful and promising strategies, in order to deal with HFS. The shortcomings of the original BSWO, including early convergence, settling into local optimums, limited exploration and exploitation, and lack of population diversity, were addressed by the proposal of this new variant of SWO. The concept of chaos optimization is introduced in BSWO, where initialization is consistently produced by utilizing the properties of sine chaos mapping. A new convergence parameter was then incorporated into BSWO to achieve a promising balance between exploration and exploitation. Multiple exploration mechanisms were then applied in conjunction with several exploitation strategies to effectively enrich the search process of BSWO within the search space. Finally, quantum-based optimization was added to enhance the diversity of the search agents in BSWO. The proposed BBSWO not only offers the most suitable subset of features located, but it also lessens the data’s redundancy structure. BBSWO was evaluated using the k-Nearest Neighbor (k-NN) classifier on 23 HFS problems from the biomedical domain taken from the UCI repository. The results were compared with those of traditional BSWO and other well-known meta-heuristics-based FS. The findings indicate that, in comparison to other competing techniques, the proposed BBSWO can, on average, identify the least significant subsets of features with efficient classification accuracy of the k-NN classifier.
Education data mining has emerged as a powerful technique for uncovering hidden patterns in educational data, forecasting academic achievement, and increasing retention rates. In this work, the performance of nine regression algorithms has been evaluated in predicting students’ academic success. Information from 650 students enrolled in three different computing majors has been assembled into a dataset. The following input attributes were chosen: attendance rate, course grade, gender, course category, delivery mode, school type, and high school score; the grade point average was the target variable. Findings indicate that Random Forest Regressor, Light Gradient Boosting Machine, Gradient Boosting Regressor, and Extra Tree are the four most effective regression algorithms in the order given. Except for the Light Gradient Boosting Machine approach, the other three algorithms showed that course grade is the most important predictor of a student's GPA, followed by high school score. All four algorithms showed that gender is the least reliable indicator of GPA. Future work will conduct sensitivity analysis to evaluate the impact of individual attributes on predictions to gain more insight into the factors affecting students’ performance. The effect of social, economic, and demographic features will also be investigated Keywords— Educational Data Mining, Machine Learning, Regression Models, Student Success Prediction.
Background: Universities need to find strategies for improving student retention rates. Predicting student academic performance enables institutions to identify underachievers and take appropriate actions to increase student completion and lower dropout rates. Method: In this work, we proposed a model based on random forest methodology to predict students’ course performance using seven input predictors and find their relative importance in determining the course grade. Seven predictors were derived from transcripts and recorded data from 650 undergraduate computing students. Results: Our findings indicate that grade point average and high school score were the two most significant predictors of a course grade. The course category and class attendance percentage have equal importance. Course delivery mode does not have a significant effect. Conclusion: Our findings show that courses students at risk find challenging can be identified, and appropriate actions, procedures, and policies can be taken.
Banking risk measurement and management remain one of many challenges for managers and policymakers. This study contributes to the banking literature and practice in two ways by (a) proposing a risk ranking index based on the Mahalanobis Distance (MD) between a multidimensional point representing a bank’s risk measures and the corresponding critical ratios set by the banking authorities and (b) determining the relative importance of a bank’s risk ratios in affecting its financial standing using an Adaptive Neuro-Fuzzy Inference System. In this study, ten financial ratios representing five risk areas were considered, namely: Capital Adequacy, Credit, Liquidity, Earning Quality, and Operational risk. Data from 45 Gulf banks for the period 2016–2020 was used to develop the model. Our findings indicate that a bank is in a sound risk position at the 99%, 95%, and 90% confidence level if its Mahalanobis distance exceeds 4.82, 4.28, and 4.0, respectively. The maximum distance computed for the banks in this study was 9.31; only five out of the forty-five banks were below the 4.82 and one below the 4.28 and 4.0 thresholds at 3.96. Sensitivity analysis of the risks indicated that the Net Interest Margin is the most significant factor in explaining variations in a bank’s risk position, followed by Capital Adequacy Ratio, Common Equity Tier1, and Tier1 Equity in order. The remaining financial ratios: Non-Performing Loans, Equity Leverage, Cost Income Ratio, Loans to Total Assets, and Loans to Deposits have the least influence in the order given; the Provisional Loans Ratio appears to have no influence.
Predicting students’ academic performance and the factors that significantly influence it can improve students’ completion and graduation rates, as well as reduce attrition rates. In this study, we examine the factors influencing student academic achievement. A fuzzy-neural approach is adopted to build a model that predicts and explains variations in course grades among students, based on course category, student course attendance rate, gender, high-school grade, school type, grade point average (GPA), and course delivery mode as input predictors. The neuro-fuzzy system was used because of its ability to implicitly capture the functional form between the dependent variable and input predictors. Our results indicate that the most significant predictors of course grades are student GPA, followed by course category. Using sensitivity analysis, student attendance was determined to be the most significant factor explaining the variations in course grades, followed by GPA, with course delivery mode ranked third. Our findings also indicate that a hybrid course delivery mode has positively impacted course grades as opposed to online or face-to-face course delivery alone. Keywords: educational data mining; neuro-fuzzy systems; student performance; prediction; neural networks; fuzzy systems