Emerging Researchers National (ERN) Conference

nsf-logo[1]

  • About
    • About AAAS
    • About the NSF
    • About the Conference
    • Partners/Supporters
    • Project Team
  • Conference
  • Abstracts
    • Abstract Submission Process
    • Presentation Schedules
    • Abstract Submission Guidelines
    • Presentation Guidelines
    • Undergraduate Abstract Locator (2020)
    • Graduate Abstract Locator (2020)
    • Faculty Abstract Locator (2020)
  • Travel Awards
  • Resources
    • App
    • Award Winners
    • Code of Conduct-AAAS Meetings
    • Code of Conduct-ERN Conference
    • Conference Agenda
    • Conference Materials
    • Conference Program Books
    • ERN Photo Galleries
    • Events | Opportunities
    • Exhibitor Info
    • HBCU-UP/CREST PI/PD Meeting
    • In the News
    • NSF Harassment Policy
    • Plenary Session Videos
    • Professional Development
    • Science Careers Handbook
    • Additional Resources
    • Archives
  • Engage
    • Webinars
    • Video Contest
    • Video Contest Winners
    • ERN 10-Year Anniversary Videos
    • Plenary Session Videos
  • Contact Us

Facial Recognition Research Utilizing the ORL Database of Faces

Undergraduate #86
Discipline: Mathematics and Statistics
Subcategory: Mathematics and Statistics
Session: 1
Room: Tyler

Santiara Churchwell - North Carolina A&T State University
Co-Author(s): Ian Livengood, Zhilin 'Lin' Qian, and Xirui 'Siri' Zhou



In this paper, we will recognize human faces from the ORL Database of Faces by utilizing Principal Component Analysis (PCA), Support Vector Analysis (SVM), Naïve Bayes Classifier, and Random Forest. We will introduce the prominent and relatively new concept of facial recognition, comparing it to traditional methods and presenting its significance in the field of biometrics. We will establish the importance of determining the most accurate classifiers for the ORL dataset. We will also define the data source while providing a description of the database we utilized. Next, we will explain our methodology concerning processing the data, including steps such as loading the image data, adding labels, extracting features with PCA, generating eigenfaces, reconstructing images, classifying the data with various methods, and performing cross-validation. We successfully accounted for 98.58% of the variance through utilizing 250 principal components in PCA. We accomplished a mean accuracy of 96.75 with SVM and 96.25% model accuracy with Random Forest. We interpreted and compared these results in terms of accuracy and time efficiency. This paper also provides considerations of future work. If granted more time, we would perform additional classification techniques, such as K-Nearest Neighbors and Neural Networks, and we would implement cross-validation methods with Naive Bayes Classifier to ensure that our program is efficient for varying training sets defined within our dataset.

Funder Acknowledgement(s): We are appreciative of the research experience we have been granted with the support of several generous sources. As researchers working under the grant NSF HRD-1719498, we have been given the funds necessary to explore the realm of facial recognition. We would like to extend a special thanks to the following sponsors: the NSF ACE DSA Project, Mathematics Department at North Carolina A&T State University, Key Laboratory of Intelligent Perception and Systems for High-Dimensional Information (KLIPSHDI) of Ministry of Education, School of Computer Science and Engineering at Nanjing University of Science and Technology, Dr. Guoqing Tang of NCAT, Dr. Zhong Jin of NJUST, and Mr. Jonathan Fabish of NCAT and NCSU.

Faculty Advisor: Dr. Guoqing Tang, tang@ncat.edu

Role: I performed the Random Forest Classification test on the ORL Database. I incorporated the concept of Random Forest into python code through opencv. I imported the RandomForestClassifier and make_classification packages. I also defined the training data and class labels for the training data as X and Y respectively. Then, I tuned the parameters, n_estimators, max_depth, oob_score, n_jobs, random_stae, max_features and min_samples_leaf? to acquire our desired accuracy. I then created the variable test_labels, an array of the class labels of the test set, set y_true equal to test_labels, and defined y_pred_rf to be an array of the Random Forest model?s prediction of the class labels of the test set. I calculated the model accuracy after tuning the parameters properly. I accomplished a model accuracy of 96.25% with a standard deviation of 1.275%.

ERN Conference

The 2022 ERN Conference has been postponed.

Full Notice

What’s New

  • Congratulations to Zakiya Wilson-Kennedy on her 2021 AAAS Fellowship
  • Event Vaccination and Liability Policy
  • Webinars
  • Events|Opportunities
  • AAAS CEO Comments on Social Unrest, Racism, and Inequality
  • Maintaining Accessibility in Online Teaching During COVID-19
  • In the News
  • HBCU/CREST PI/PD Meeting

Conference Photos

ERN Conference Photo Galleries

Awards

ERN Conference Award Winners

Checking In

nsf-logo[1]

This material is based upon work supported by the National Science Foundation (NSF) under Grant No. DUE-1930047. Any opinions, findings, interpretations, conclusions or recommendations expressed in this material are those of its authors and do not represent the views of the AAAS Board of Directors, the Council of AAAS, AAAS’ membership or the National Science Foundation.

AAAS

1200 New York Ave, NW Washington,DC 20005
202-326-6400
Contact Us
About Us

The World's Largest General Scientific Society

Useful Links

  • Membership
  • Careers at AAAS
  • Privacy Policy
  • Terms of Use

Focus Areas

  • Science Education
  • Science Diplomacy
  • Public Engagement
  • Careers in STEM

 

  • Shaping Science Policy
  • Advocacy for Evidence
  • R&D Budget Analysis
  • Human Rights, Ethics & Law
© 2022 American Association for the Advancement of Science