Home > Computer science essays > Fusion of eye gaze point and speech recognition

Essay: Fusion of eye gaze point and speech recognition

Essay details and download:

  • Subject area(s): Computer science essays
  • Reading time: 2 minutes
  • Price: Free download
  • Published: 17 October 2015*
  • File format: Text
  • Words: 564 (approx)
  • Number of pages: 3 (approx)

Text preview of this essay:

This page of the essay has 564 words. Download the full version above.

Fusion is a process of combining two or more things together to form a single entity.
Fusion processes are stated as follows:
A. Levels of Fusion.
One of the earliest considerations is to decide what strategy to follow when fusing multiple modalities. The most widely used strategy is to fuse the information at the feature level, which is also known as early fusion. The other approach is decision level fusion or late fusion [11] which fuses multiple modalities in the semantic space. A combination of these approaches is also practiced as the hybrid fusion approach [11].
B. How to Fuse?
There are several methods that are used in fusing different modalities. These methods are particularly suitable under different settings. The discussion also includes how the fusion process utilizes the feature and decision level correlation among the modalities, and how the contextual and the confidence information influence the overall fusion process [11].
C. When to Fuse?
The time when the fusion should take place is an important consideration in the multimodal fusion process. Certain characteristics of media, such as varying data capture rates and processing time of the media, poses challenges on how to synchronize the overall process of fusion. Often this has been addressed by performing the multimedia analysis tasks (such as event detection) over a timeline [11]. A timeline refers to a measurable span of time with information denoted at designated points. The timeline-based accomplishment of a task requires identification of designated points at which fusion of data or information should take place. Due to the asynchrony and diversity among streams and due to the fact that different analysis tasks are performed at different granularity levels in time, the identification of these designated points, i.e. when the fusion should take place, is a challenging issue [11].
D. What to Fuse?
The different modalities used in a fusion process may provide complementary or contradictory information and therefore knowing which modalities are contributing towards accomplishing an analysis task needs to be understood. This is also related to finding the optimal number of media streams [11] or feature sets required to accomplish an analysis task under the specified constraints. If the most suitable subset is unavailable, can one use alternate streams without much loss of cost-effectiveness and confidence?
SYSTEM ARCHITECTURE
The system architecture consist different modules as shown in figure. Each module performs the specific task, The Eye tracking module uses various image processing techniques and the tracking algorithm for estimating the direction of the pupil. The Speech Recognition module recognizes the spoken words and compares them according to the specified grammar.
Fig 13: Fusion of Eye Tracking and Speech Recognition.
EXPERIMENTAL RESULT
Fig 14: Main Form
Fig 15: Captured Image
CONCLUSION
In this paper, a real time eye-gaze detection system is presented. We proposed an eye-gaze detection algorithm using images captured by a single web camera. The user with several disabilities can use this system for handling computer. A real time eye motion detection technique is presented. We verified the system accuracy by performing the experiments. Although the proposed algorithm may look rather simple, but it is able to detect the eye gaze with high successful rate.
FUTURE WORK
Our future work will mainly concentrate on improving the accuracy of the proposed eye-gaze detection algorithm, making the system quicker and robust, and considering other eye movements such as the user blinking. The proposed system will be verified by numerous experiments with different users.

...(download the rest of the essay above)

About this essay:

If you use part of this page in your own work, you need to provide a citation, as follows:

Essay Sauce, Fusion of eye gaze point and speech recognition. Available from:<https://www.essaysauce.com/computer-science-essays/essay-fusion-of-eye-gaze-point-and-speech-recognition/> [Accessed 27-03-24].

These Computer science essays have been submitted to us by students in order to help you with your studies.

* This essay may have been previously published on Essay.uk.com at an earlier date.