การวิเคราะห์ท่าทางของมนุษย์โดยใช้ข้อมูลสีและความลึกจากหลายมุมมอง

พงศกร เจริญเนตรกุล

Please use this identifier to cite or link to this item: http://kb.psu.ac.th/psukb/handle/2016/12265

Title:	การวิเคราะห์ท่าทางของมนุษย์โดยใช้ข้อมูลสีและความลึกจากหลายมุมมอง
Other Titles:	Human action analysis from multi-view using RGB-D information
Authors:	นิคม สุวรรณวร พงศกร เจริญเนตรกุล Faculty of Engineering Computer Engineering คณะวิศวกรรมศาสตร์ ภาควิชาวิศวกรรมคอมพิวเตอร์
Keywords:	การประมวลผลภาพ;การวิเคราะห์ข้อมูลภาพ;คอมพิวเตอร์วิทัศน์
Issue Date:	2561
Abstract:	This research applied image processing and computer vision techniques to contribute three essential perspectives; modeling profile-based human action recognition, people tracking and re-identification, interesting event detection. The interesting event detection consists of falling, hand-waving for asking help, and jumping. This approach is based on overlapped area of interest using multi-view RGB-D. All functions are purposed for health care promotions and surveillance system applications. The action recognition consists of high level and feature level fusion. In high level, the results of action in single- view system are fused for making decision using empirical analysis to weight the most realizable result to be a result of multi-view system. The maximum improvement for some action is up to 97.70% and overall result increases to 16.66% (percentage point) when compared with single-view action recognition. Another in feature level fusion, we proposed a method to build Layer Feature Model that allows to fuse features of depth from multi- view. The experimental results of fusion model are 86.40% in NW-UCLA dataset, 93.00% in i3DPost dataset, and 99.31% in PSU dataset. In addition, we introduce people tracking and people re-identification by using analysis of position and color descriptor. The position and color descriptor are clearly attributes for both tracking in a single-view and matching those views. Moreover, the color descriptor is also used for supporting cursory people re-identification. The precisions of people re-identification are 92.87% in single person entering and 85.50% when 2-person simultaneously entering. In the interesting event detection, falling detection resulted in the average precision of 90.65% that derived from precision of lying. The average of hand-waving detection precision is 92.96%. The jumping detection has sensitivity rate 92.96% and specificity rate 99.31%.
Abstract(Thai):	งานวิจัยฉบับนี้ได้ประยุกต์ใช้เทคนิคด้านการประมวลผลภาพและคอมพิวเตอร์วิ ทัศน์ เพื่อใช้ในการสร้างแบบจําลองโครงสร้างมนุษย์เพื่อใช้การรู้จําท่าทางพื้นฐานของมนุษย์ รวม ไปถึงการติดตามและจดจําตัวบุคคล การตรวจจับเหตุการณ์ที่น่าสนใจ ในกรณีศึกษาของการล้ม การโบกมือเพื่อขอความช่วยเหลือ และการกระโดด จากภาพสีและความลึกหลายมุมมอง ณ บริเวณเดียวกัน เพื่อนําไปใช้ในการในการเฝ้าระวังด้านงานส่งเสริมสุขภาพและระบบรักษาความ ปลอดภัย โดยในงานส่วนของการรู้จําท่าทางจะมีการใช้การฟิวชันข้อมูลระดับสูง และระดับฟีเจอร์ ซึ่งในการฟิวชันข้อมูลระดับสูง จะนําท่าทางจากระบบการรู้จําท่าทางจากมุมมองเดี่ยวหลาย ๆ มุมมองมาเพื่อตัดสินใจ ชั่งน้ําหนัก วัดค่าความน่าเชื่อถือของคําตอบจากการทํานายคําตอบใน มุมมองเดี่ยวต่างๆ และหาท่าทางที่มีความน่าเชื่อถือที่สุดมาเป็นคําตอบของระบบโดยรวม โดยได้ เพิ่มความถูกต้องของการรู้จําจากมุมมองเดี่ยวมากที่สุดที่ 98.17% และได้เพิ่มความถูกต้องจาก การเฉลี่ยทุกมุมมองและท่าทางที่ 16.66% (แต้มเปอร์เซ็นต์) จากการเปรียบเทียบกับมุมมอง เดี่ยว ส่วนการฟิวชันข้อมูลระดับฟีเจอร์นั้น ผู้วิจัยได้นําเสนอวิธีการที่จะสร้างแบบจําลองฟีเจอร์ แบบเลเยอร์ (Layer Feature Model) ซึ่งทําให้สามารถฟิวชันฟีเจอร์ของข้อมูลความลึกจากหลาย มุมมอง โดยผลการทดลอง ได้ทดสอบในชุดข้อมูล Northwestern UCLA มีความแม่นยําเฉลี่ย 86.40% ชุดข้อมูล i3DPost ที่มีความแม่นยําเฉลี่ยที่ 93.00% และความแม่นยําเฉลี่ย 99.31% ในชุดข้อมูล PSU ในส่วนของการติดตามและจดจําตัวบุคคลนั้น จะใช้การวิเคราะห์ข้อมูลตําแหน่ง และสี ซึ่งเป็นข้อมูลเบื้องต้นที่เด่นชัดสามารถนํามาใช้ในการแยกแยะแต่ละบุคคลเพื่อติดตามทั้ง ในกล้องเดียวกันและระหว่างกล้อง รวมไปถึงใช้ในการจดจําตัวบุคคลในเบื้องต้น ซึ่งการติดตาม และจดจําตัวบุคคลมีความแม่นยําในกรณีที่เข้าไปในระบบครั้งละหนึ่งคนที่ 92.87% และกรณีที่ เข้าไปในระบบครั้งละสองคนที่ 85.50% ส่วนในการตรวจจับเหตุการณ์ที่น่าสนใจในกรณีศึกษา ของการล้ม มีความแม่นยําที่ 90.65% ตามการรู้จําท่าทางในท่านอน การโบกมือเพื่อขอความ ช่วยเหลือมีอัตราความแม่นยําการตรวจจับได้โดยเฉลี่ยทั้งหมด 92.96% และการกระโดดมีค่า ความไวในการตรวจจับโดยเฉลี่ยเท่ากับ 94.44% และค่าความจําเพาะโดยเฉลี่ยเท่ากับ 99.31%
Description:	วิทยานิพนธ์ (ปร.ด. (วิศวกรรมคอมพิวเตอร์))--มหาวิทยาลัยสงขลานครินทร์, 2561
URI:	http://kb.psu.ac.th/psukb/handle/2016/12265
Appears in Collections:	241 Thesis

Files in This Item:

File	Description	Size	Format
433162.pdf		25.27 MB	Adobe PDF	View/Open

Show full item record

This item is licensed under a Creative Commons License