Please use this identifier to cite or link to this item: https://olympias.lib.uoi.gr/jspui/handle/123456789/684
Full metadata record
DC FieldValueLanguage
dc.contributor.authorΧασάνης, Βασίλειοςel
dc.date.accessioned2015-10-15T08:24:34Z-
dc.date.available2015-10-15T08:24:34Z-
dc.identifier.urihttps://olympias.lib.uoi.gr/jspui/handle/123456789/684-
dc.identifier.urihttp://dx.doi.org/10.26268/heal.uoi.500-
dc.rightsDefault License-
dc.subjectΒίντεο, Κατάτμηση σε πλάναel
dc.subjectΑναπαράσταση πλάνουel
dc.subjectΒίντεο, Κατάτμηση σε σκηνέςel
dc.subjectΤαινίες, Κατάτμηση υψηλού επιπέδουel
dc.subjectΒίντεο, Δημιουργία περίληψης αμοντάριστουel
dc.subjectΒίντεο, Ανίχνευση γεγονότων σε ακολουθίες παρακολούθησης μέσωel
dc.titleΤεχνικές μηχανικής μάθησης για διαχείρηση γνώσης σε πολυμεσικά δεδομέναel
heal.typedoctoralThesis-
heal.type.enDoctoral thesisen
heal.type.elΔιδακτορική διατριβήel
heal.generalDescriptionΠεριέχει πίνακες και διαγράμματα
heal.classificationΠληροφορικήel
heal.identifier.secondaryhttp://thesis.ekt.gr/thesisBookReader/id/18816#page/1/mode/2up-
heal.languageen-
heal.accessfree-
heal.recordProviderΠανεπιστήμιο Ιωαννίνων Σχολή Θετικών Επιστημών Τμήμα Πληροφορικήςel
heal.publicationDate2009-
heal.bibliographicCitationΒβιβλιογραφία: σ. 131-141
heal.abstractIn this thesis we have proposed novel methods for video segmentation and representation that are based on machine learning techniques (classi cation, clustering). First, we considered support vector machines for video shot detection. Then, an improved spectral clustering algorithm was employed for video shot representation. The same algorithm in combination with a sequence alignment algorithm was employed for video scene segmentation. Movie segmentation into scenes and chapters was also implemented using temporally smoothed visual words histograms. Furthermore, the proposed techniques were also employed for video rushes summarization and event detection in video surveillance sequences. More speci cally, in order to perform video shot detection, we proposed in Chapter 2 a supervised learning methodology [11, 15]. In this way, we have avoided the use of thresholds and we were able to detect shot boundaries of videos with totally di erent 127 visual characteristics. Novel features have been de ned describing the variation between adjacent frames and the contextual information in a neighborhood of frames and became inputs to a SVM classi er which categorized transitions to normal, abrupt and gradual. In this way, all categories of video shot transitions were detected simultaneously. Numerical experiments that compare our algorithm with threshold dependent methods and another supervised learning methodology indicate that our algorithm provides superior results. In Chapter 3 a key-frame extraction algorithm [10, 14] has been presented that is based on the combination of spectral clustering approach and fast global k-means algorithm. We have also proposed a technique to estimate the number of the extracted key-frames. The extracted key-frames are unique, non-repetitive and summarize the video shot content, which is also indicated from the numerical experiments where appropriate quality measures were de ned and computed. In Chapter 4 we presented a novel video scene segmentation algorithm [9, 14] that employs the improved spectral clustering algorithm of Chapter 3 and a sequence alignment algorithm. Shots were rst clustered into groups based only on their visual similarity using the method presented in Chapter 3 and a label was assigned to each shot according to the group that it belonged to. Then, a sequence alignment algorithm was applied to detect when a change occurs to the pattern of shot labels, providing the nal scene segmentation result. Numerical experiments on TV-series and movies have shown that the proposed scene detection method accurately detects most of the scene boundaries, while preserving good tradeo between recall and precision. In Chapter 5 we presented a high-level movie segmentation algorithm [13]. In this approach, the movie shots were represented with local invariant descriptors instead of color histograms, resulting into a visual words histogram representation. Next the visual words histograms of shots were temporally smoothed (using a gaussian kernel) with respect to histograms of neighboring shots in order to preserve valuable contextual information. This semantic smoothing process at di erent time scales results in e#cient movie segmentation at di erent high-levels, such as scenes and chapters.en
heal.advisorName-
heal.committeeMemberNameΛύκας, Αριστείδηςel
heal.committeeMemberNameΓαλατσάνος, Νικόλαοςel
heal.committeeMemberNameΜπλέκας, Κωνσταντίνοςel
heal.committeeMemberNameΚόλλιας, Στέφανοςel
heal.committeeMemberNameΣταφυλοπάτης, Ανδρέαςel
heal.committeeMemberNameΛάγαρης, Ισαάκel
heal.committeeMemberNameΚόντης, Λυσίμαχοςel
heal.academicPublisherΠανεπιστήμιο Ιωαννίνων Σχολή Θετικών Επιστημών Τμήμα Πληροφορικήςel
heal.academicPublisherIDuoi-
heal.numberOfPages140 σ.-
heal.fullTextAvailabilityfalse-
Appears in Collections:Διδακτορικές Διατριβές

Files in This Item:
There are no files associated with this item.


This item is licensed under a Creative Commons License Creative Commons