การตรวจจับและการจำแนกใบหน้าสวมหน้ากากโดยใช้ข้อมูลเข้าหลายค่ากับการเรียนรู้เชิงลึก

ภีมกานต์ รุจิราพงศ์

Please use this identifier to cite or link to this item: http://kb.psu.ac.th/psukb/handle/2016/19228

Title:	การตรวจจับและการจำแนกใบหน้าสวมหน้ากากโดยใช้ข้อมูลเข้าหลายค่ากับการเรียนรู้เชิงลึก
Other Titles:	Face-mask Detection and Classification Using Multiple Inputs Deep Learning
Authors:	วิภาดา เวทย์ประสิทธิ์ ภีมกานต์ รุจิราพงศ์ Faculty of Science (Computer Science) คณะวิทยาศาสตร์ ภาควิชาวิทยาการคอมพิวเตอร์
Keywords:	การจำแนกประเภทใบหน้า;โครงข่ายประสาทเทียมแบบคอนโวลูชัน;การสกัดคุณลักษณะ;การเรียนรู้เชิงลึก
Issue Date:	2023
Publisher:	มหาวิทยาลัยสงขลานครินทร์
Abstract:	This research presents face-mask detection and classification using multiple inputs. The face-mask detection composes of 3 classes including the with_mask class, with_out_mask class, and wear_mask_incorrect class. The differences between these classes are the nose area and mouth area which help in classification. A deep learning multiple input model that can use face images, nose images, and mouth images as inputs was developed. This experiment is tried out with 2 datasets including Face Mask Label Dataset (FMLD) and Andrewmvd Face Mask Detection Kaggle (AFMDK). There are comparison models which are created by using single input and multiple inputs. The study finds that the results are confirmed that the purposed multiple input model has accuracy, precision, recall, and F1 score has higher values than a single input model in both datasets. This research also does an experiment on image enhancement by super-resolution for small image problems. The results increase the resolution at the nose and mouth area.The experiment shows that model trained by image from BSRGANs cannot solves the small image problem but can solve the medium and large images.
Abstract(Thai):	งานวิจัยนี้นำเสนอการตรวจจับและการจำแนกใบหน้าสวมหน้ากากโดยใช้ข้อมูลเข้า หลายค่า เนื่องจากการตรวจจับใบหน้าสวมหน้ากากแบบ 3 ประเภท ได้แก่ประเภทสวมหน้ากาก ถูกต้อง ประเภทไม่สวมหน้ากาก และประเภทสวมหน้ากากไม่ถูกต้อง เมื่อสังเกตข้อแตกต่างของทั้ง 3 ประเภท จะเห็นว่าบริเวณจมูกและปากจะเป็นส่วนหลักที่ช่วยในการจำแนกประเภท จึงพัฒนาโมเดล การเรียนรู้เชิงลึกที่รับข้อมูลเข้าหลายค่า ได้แก่ ภาพใบหน้า ภาพบริเวณจมูกและภาพบริเวณปาก การ ทดลองมีการใช้ชุดข้อมูล 2 ชุด ได้แก่ ชุดข้อมูล Face Mask Label Dataset (FMLD) และ Andrewmvd Face Mask Detection Kaggle (AFMDK) มีการเปรียบเทียบการสร้างโมเดลซึ่งใช้ ข้อมูลเข้าแบบเดี่ยว พบว่าผลลัพธ์ที่ได้โมเดลข้อมูลเข้าหลายค่าที่นำเสนอมีค่าความถูกต้อง ค่าความ แม่นยำ ค่าความครบถ้วน และค่า F1 มีประสิทธิภาพสูงกว่าโมเดลข้อมูลเข้าเดี่ยวทั้งสองชุดข้อมูล นอกจากนี้งานวิจัยนี้ได้ทำการทดลองการปรับปรุงภาพ โดยใช้ Super-Resolution (SR) เพื่อ แก้ปัญหาภาพขนาดเล็ก ทำให้ภาพบริเวณจมูกและภาพส่วนปากมีความคมชัดมากขึ้น จากการ ทดลองพบว่าโมเดลใช้ภาพจาก BSRGANs ไม่สามารถช่วยแก้ปัญหาภาพขนาดเล็กได้แต่สามารถช่วย แก้ปัญหาภาพขนาดปานกลางและขนาดใหญ่ได้
Description:	วิทยาการศาสตรมหาบัณฑิต (วิทยาการคอมพิวเตอร์), 2566
URI:	http://kb.psu.ac.th/psukb/handle/2016/19228
Appears in Collections:	344 Thesis

Files in This Item:

File	Description	Size	Format
6310220004.pdf		4.16 MB	Adobe PDF	View/Open

Show full item record

This item is licensed under a Creative Commons License