Date of Award
12-2025
Degree Type
Dissertation
Degree Name
Doctor of Philosophy (PhD)
School
Computing Sciences and Computer Engineering
Committee Chair
Dr. Ahmed Sherif
Committee Chair School
Computing Sciences and Computer Engineering
Committee Member 2
Dr. Sarah Lee
Committee Member 2 School
Computing Sciences and Computer Engineering
Committee Member 3
Dr. Dia Ali
Committee Member 3 School
Computing Sciences and Computer Engineering
Committee Member 4
Dr. Chaoyang Zhang
Committee Member 4 School
Computing Sciences and Computer Engineering
Committee Member 5
Dr. Amer Dawoud
Committee Member 5 School
Computing Sciences and Computer Engineering
Abstract
The research work finds a solution to precision agriculture of cotton cultivation using artificial intelligence (AI) models. Two sets of model performance based on the application are selected namely a low resource and a high resource setting. This is because using drone surveys to capture images identifying the classes of stressed and unstressed cotton plantation requires limited model architecture and CPU based computation. Thus, traditional AI models were selected for low resource settings. Again, for high computation intensive models like transfer learning-convolution neural network (CNN) based architectures were grouped into high resource settings. There was another issue of class imbalance in cotton available dataset causing overfitting of model performance. We adopted a target data augmentation strategy to add synthetic data to original dataset, such that each class for classification contributes equally to overcoming the class imbalance. Traditional ML models like Naïve Bayes, K-Nearest Neighbor (KNN), Support Vector Machine (SVM) and Random Forest (RF) were trained with augmentation strategy on cotton disease dataset from Kaggle. Various DL were optimized for better performance like multilayered CNN, VGG16, ResNet50, InceptionV3, Efficient Net, and DenseNet121. Those embodied frameworks were also trained with same dataset with target augmentation to overcome the class imbalance. Out of the models RF and DenseNet121 performed best based on all evaluation metrics and were used for distributed federated learning paradigms with different client data distribution. RF showed the maximum gain in accuracy of 83.69% with data augmentation and enhancement of all other metrics of Precision, Recall, F1 Score and Area Under the Curve (AUC) with targeted augmentation of dataset. Similarly, DenseNet121 showed accuracy of 99.58% under geometric data augmentation. The learning capacity of DenseNet121 enhanced in all other metrics with data augmentation. We found the FL frameworks were scalable and robust and multiclient updates the server model with better performance ensuring data privacy on farmers’ private agricultural field information. The FL based RF shows some susceptibility to client distribution and heterogeneity of client updates to global model and convergence of performance, while the FL based DenseNet121 showed more resilient to such variance of client distribution and global model updates.
ORCID ID
0000-0002-0346-1115
Copyright
Partha P Sengupta, 2025
Recommended Citation
Sengupta, Partha P., "Privacy Preserving-Based Artificial Intelligence for Precision Agriculture" (2025). Dissertations. 2416.
https://aquila.usm.edu/dissertations/2416
Included in
Artificial Intelligence and Robotics Commons, Computer Engineering Commons, Data Science Commons, Theory and Algorithms Commons