Speech to text

khawar usman

Outline

Results and Discussions

Future Recommendations

Speech to text

khawar usman

2016

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

The Speech is the first important primary need, and the most convenient means of communication between people. The communication among human computer interaction is called human computer interface. This project basically gives an overview of major technological perspective and appreciation of the fundamental progress of speech to text conversion and also gives complete set of speech to text conversion based on Raspberry-Pi. The project also focuses on the language translator which is very important for daily life. A comparative study of different technique is done as per stages. This paper concludes with the decision on future direction for developing technique in human computer interface system in different mother tongue and it also discusses the various techniques used in each step of a speech recognition process and attempts to analyze an approach for designing an efficient system for speech recognition. However, with modern processes, algorithms, and methods we can process speech signals easily and recognize the text. In this system, we are going to develop an on-line speech-to-text engine. However, the transfer of speech into written language in real time requires special techniques as it must be very fast and almost 100% correct to be understandable. The objective of this paper is to recapitulate and match up to different speech recognition systems as well as approaches for the speech to text conversion based on Raspberry-Pi technology and identify research topics and applications which are at the forefront of this exciting and challenging field.

Related papers

Speech Recognition System

IRJET Journal

This paper is demonstrating to convert the audio signals to perform the task. Speech recognition is one of the fastest growing technology nowadays. In this paper, we aimed at developing the speech recognition system as a helping tool for the differently able people. This paper demonstrates to convert the speech into English text. The conversion of speech into text is made by the speech recognizer. It can be used at various places with many possible solutions. There are around 20% people who are suffering from many disabilities. There are people who are blind, some cannot use their hands effectively and for illiterates, for them this system could be very helpful. This system will also be helpful for the enterprises where most of the work is to type. This system can recognize the audio signals and convert into text it can perform some operations, such as open calculator, open Google chrome etc. ; it also enables a user to perform operations such as "save, open, exit" a file by providing voice input . Likewise this system can perform some operations. At the initial level effort is made to provide help for basic operations as discussed above, to perform more operation this software can be updated and enhanced further. This paper presents a method to design a speech to text then performs a task accordingly using .net framework using Visual Studio.

downloadDownload free PDF View PDFchevron_right

To Develop and Implement Text-To-Speech Wireless Communication System Using Raspberry-Pi

International Journal IJRITCC

This project is based on the text to speech conversion using the concept of IOT (internet of things) in which we can transmit the message or text in efficient manner. We will achieve an efficient and distortion free communication using internet as a medium, so that there is no restriction on the distance. The system will convert the text data into speech from anywhere. We are using the Raspberry Pi module to decode the data as well as convert it into the speech signal. The Raspberry Pi 2b module is a latest Embedded module, which having ARM 64 bit processor. This will make the operation faster. This text to speech conversions system makes the information/data transmission easier. The system will have the transmitter which can be any electronic device like Computer or laptop. We can also use an Android phone also. The message / text will be transmitted via E-mail ID to the raspberry pi module at the receiver side. This project can be applicable for the various organization as well as highly restricted areas. We can implement this project using IOT with GSM module.

downloadDownload free PDF View PDFchevron_right

Smart Reader – Text To Speech Converter Using Raspberry Pi

Aparna Dhavlikar

2018

Smart Reader allows user to hear the text which is given as input. It involves extraction of text from the image and converting the text to speech. This is done with Raspberry Pi and a camera module by using the OCR [optical character recognition] technique. The system consists of a webcam interfaced with raspberry pi. Raspberry pi has the audio port where the output can be heard through the headphone or the speaker. The conversion time aimed is few milliseconds. This device can help visually impaired persons to hear the text in images to be read.

downloadDownload free PDF View PDFchevron_right

Review on Speech Recognition System for Indian Languages

jinal tailor

International Journal of Computer Applications, 2015

Speech recognition system is a natural way for the interaction of human to machine. Automatic Speech Recognition is advance way to operate computer without much efforts through speech only. In this paper survey related to indo Aryan languages usage for communicating directly with the machine has been performed. This mechanism includes various techniques and experimental results. Speech Recognition system is implemented for English, French, Spanish, German, Japanese and Chinese. Only little work has been performed for Indo-Indian languages like Gujarati, Marathi, Hindi, Tamil etc. Speech to Text is an emerging research area due to complexity and various frameworks of Indo-Aryan languages.

downloadDownload free PDF View PDFchevron_right

Implementing a Speech Recognition System Interface for Indian Languages

Mayank Dave

2013

Human computer interaction through Natural Language Conversational Interfaces plays a very important role in improving the usage of computers for the common man. It is the need of time to bring human computer interaction as close to human-human interaction as possible. There are two main challenges that are to be faced in implementing such an interface that enables interaction in a way similar to human-human interaction. These are Speech to Text conversion i.e. Speech Recognition & Text To Speech (TTS) conversion. In this paper the implementation of one issue Speech Recognition for Indian Languages is presented.

downloadDownload free PDF View PDFchevron_right

Design and Implementation of Text to Speech Synthesizer

Aryan Singh

International Journal of Futuristic Innovation in Engineering, Science and Technology (IJFIEST)

Current research introduces a novel, efficient, and less expensive way for users to hear than to read the content of text images in real time. Includes Optical Character Recognition (OCR) ideas and a Text to Speech Synthesizer on Raspberry Pi (TTS). This type of technology uses a visual connection to allow visually impaired people to communicate with computers successfully. Extraction of text from colored images is a serious problem in computer vision. Converting text into speech is a process that scans and translates English letters and numbers into pictures using recognizable letter recognition (OCR) and converts them into words.

downloadDownload free PDF View PDFchevron_right

Exploration of Speech enabled System for English

Dr. Kamlesh Sharma

This paper presents exploration of speech enable operating systems, software, and applications. It begins with a description of how such systems work, and the level of accuracy that can be expected. It explains the applications of speech recognition technology in different areas education, medical, mobile computing, railway reservation, dictation, and web browsing. A brief comparison of the operating systems supported for voice, speech recognition software or tool. It gives the brief introduction about the potential of voice/speech recognition software. It explains the feature of different speech enable Operating system and speech recognition software. Windows speech recognition have many innovative features for Windows operating system and efficiently assist the computer to control, dictate, navigate, selecting the words, sending emails and correcting the words or sentences. It also explains the benefits and issue related to speech technology. In last era speech recognition technology grew tremendously. There are large number of companies who are working in these area and developing software for the people who are not able to control the system through keyboard or mouse such as physically impaired and senior citizens. This paper gives a brief introduction of speech enabled OS and speech recognition software.

downloadDownload free PDF View PDFchevron_right

A Technical Approach to Describe the Scenario about How a Speech Recognition System Convert Speech into Text

Bahar tutul

Automatic speech recognition system is invented a few decade earlier and it improve day by day. In very first it started with 8 word but if we see now it has a huge database with almost 230million word. By this system normally we can able to interact with device through our voice command and can do our desire work easily. The process of speech to text conversion is one of the part of this system and a few model like: HMM (Hidden Markov Model), MFCC(Mel Frequency Cepstral Coefficient) etc. are use and a working procedure done behind step by step for this conversion process. So the main objective is to determine to show the whole process of speech to text conversion which done by automatic speech recognition which will help those people who want to know the whole process and those who have some interest in this field.

downloadDownload free PDF View PDFchevron_right

A REVIEW ON METHODS FOR SPEECH-TO-TEXT AND TEXT-TO-SPEECH CONVERSION

IRJET Journal

Internet has evolved over time and has revolutionized many fields and impacted many lives. Internet is a boon to mankind. The main field revolutionized by the internet is communication. Internet has enabled faster and easier communication. Through this paper we aim to study the different methodology for Speech-To-Text and Text-To-Speech conversion that will be used in a voice-based email system. This system is based on interactive voice response. The aim is to study and compare the various methods used for STT and TTS conversions and to figure out the most efficient technique that can be adapted for both the conversion processes. As a result, based on review study it is found that HMM is a statistical model therefore most suitable for both STT and TTS conversions. At last a model using HMM and ANN methods for STT and HMM for TTS conversions proposed.

downloadDownload free PDF View PDFchevron_right

A Review on: Speech Recognition System

vaishali bhimte

This paper presents a brief survey on Speech recognition and discusses major themes and advances. Automatic speech recognition uses the process and related technology for converting speech signals into a sequence of words or other linguistic units by means of an algorithm implemented as a computer program. After years of research and development the accuracy of automatic speech recognition remains one of the important research challenges. Speech understanding systems presently are capable of understanding speech input for vocabularies of thousands of words in operational environments. Speech Recognition offers greater freedom to employ the physically handicapped in several applications like manufacturing processes, medicine and telephone network. The objective of this review paper is to summarize and compare some of the well known methods used in various stages of speech recognition system.

downloadDownload free PDF View PDFchevron_right

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

Related papers

Text to Speech Conversion

Dk Kamesh

Indian Journal of Science and Technology, 2016

The present paper has introduced an innovative, efficient and real-time cost beneficial technique that enables user to hear the contents of text images instead of reading through them. It combines the concept of Optical Character Recognition (OCR) and Text to Speech Synthesizer (TTS) in Raspberry pi. This kind of system helps visually impaired people to interact with computers effectively through vocal interface. Text Extraction from color images is a challenging task in computer vision. Text-to-Speech conversion is a method that scans and reads English alphabets and numbers that are in the image using OCR technique and changing it to voices. This paper describes the design, implementation and experimental results of the device. This device consists of two modules, image processing module and voice processing module. The device was developed based on Raspberry Pi v2 with 900 MHz processor speed.

downloadDownload free PDF View PDFchevron_right

Speech to Text Conversion using Android Platform

E Mahender

2012

For the past several decades, designers have processed speech for a wide variety of applications ranging from mobile communications to automatic reading machines. Speech recognition reduces the overhead caused by alternate communication methods. Speech has not been used much in the field of electronics and computers due to the complexity and variety of speech signals and sounds. However, with modern processes, algorithms, and methods we can process speech signals easily and recognize the text. In this project, we are going to develop an on-line speech-to-text engine. The system acquires speech at run time through a microphone and processes the sampled speech to recognize the uttered text. The recognized text can be stored in a file. We are developing this on android platform using eclipse workbench. Our speech-to-text system directly acquires and converts speech to text. It can supplement other larger systems, giving users a different choice for data entry. A speech-to-text system c...

downloadDownload free PDF View PDFchevron_right

Implementation of Efficient Speech Recognition System on Mobile Device for Hindi and English Language

Dipti Patil

International Journal of Advanced Computer Science and Applications

Speech recognition or speech to text conversion has rapidly gained a lot of interest by large organizations in order to ease the process of human to machine communication. Optimization of the speech recognition process is of utmost importance, due to the fact that real-time users want to perform actions based on the input speech given by them, and these actions sometime define the lifestyle of the users and thus the process of speech to text conversion should be carried out accurately. Here`s the plan to improve the accuracy of this process with the help of natural language processing and speech analysis. Some existing speech recognition software's of Google, Amazon, and Microsoft tend to have an accuracy of more than 90% in real time speech detection. This system combines the speech recognition approach used by these softwares and joined with language processing to improve the overall accuracy of the process with the help of phonetic analysis. Proposed Phonetic Model supports multilingual speech recognition and observed that the accuracy of this system is 90% for Hindi and English speech to text recognition. The Hindi WordNet database provided by IIT Mumbai used in this research work for Hindi speech to text conversion.

downloadDownload free PDF View PDFchevron_right

Voice Recognition System: Speech-To-Text

Vijay Prasad

2015

VOICE RECOGNITION SYSTEM:SPEECH-TO-TEXT is a software that lets the user control computer functions and dictates text by voice. The system consists of two components , first component is for processing acoustic signal which is captured by a microphone and second component is to interpret the processed signal, then mapping of the signal to words. Model for each letter will be built using Hidden Markov Model(HMM). Feature extraction will be done using Mel Frequency Cepstral Coefficients(MFCC). Feature training of the dataset will be done using vector quantization and Feature testing of the dataset will be done using viterbi algorithm. Home automation will be completely based on voice recognition system.

downloadDownload free PDF View PDFchevron_right

Speech to Text Conversion in Real-time

Nuzhat Pooja, safaet hossain

2015

Real time speech to text" can be defined as accurate conversion of words that represents uttered word instantly after speaking. Speech-to-text-conversion is a useful tool for integrating people with hearing impairments in oral communication settings, e. g. counseling interviews or conferences. However, the transfer of speech into written language in real time requires special techniques as it must be very fast and correct to be understandable. Our aim is to develop software that enhances the user's way of speech through correctness of pronunciation following the English phonetics. This software allows one to learn, judge and recognize their potential in English language. It also facilitates an extra add-on feature which nourishes the user's communication skills by an option of text to speech conversion also. The paper introduces and discusses different techniques for speech to text conversion and its process that described in complement with the options that are already in use. This paper presents a method to design a Text to Speech con version module by the use of Matlab. This method is simple to implement and involves much lesser use of memory spaces.

downloadDownload free PDF View PDFchevron_right

Study of Speech Recognition Technology and its Significance in Human-Machine Interface

IJSTE - International Journal of Science Technology and Engineering

This paper gives brief information regarding Speech Recognition Technology by including various speech parameters, different approaches to speech recognition, basics of acoustic models, language models, complex algorithms and feature extraction techniques. The objective of paper precisely focusses on speech processing and its applicability in technologies like Human-Machine-Interface (HMI) and various day to day life applications.

downloadDownload free PDF View PDFchevron_right

Review of Speech Recognition System

IJIRCST I, Jyoti Madan

This paper takes a tour of speech recognition system which includes it's evaluation and accuracy of system and discuss the structure of utterance that uses the vocal tract to make the utterance. Dynamic warping with its neural approach to convert the speech into text. This paper also explains the basic working of speech recognition system with elaboration of it's techniques.

downloadDownload free PDF View PDFchevron_right

A Study on Speech Recognition Technology

RAMALATHA MARIMUTHU

Computing

The paper highlights a brief study on speech recognition technology, describing the various processing stages and results and also some primary applications as well. Following this review some of the vital strengths and speech processing steps will also discuss.

downloadDownload free PDF View PDFchevron_right

An Overview on Speech Recognition System and Comparative Study of its Approaches

Faizan Mehmood

Language is the most important means of communication and speech is its main medium. In human to machine interface, speech signal is transformed into analog and digital wave form which can be understood by machine. Speech technologies are vastly used and has unlimited uses. These technologies enable machines to respond correctly and reliably to human voices, and provide useful and valuable services .This paper gives an overview of the speech recognition process, its basic model, and its application, approaches and also discuss comparative study of different approaches which are used for speech recognition system. These papers also give an overview of different techniques of speech recognition system to summarize some of the well known methods used in various stages of speech recognition system.

downloadDownload free PDF View PDFchevron_right

Isolated Word Recognition System for Speech to Text Conversion Using Ann

Sunanda Mendiratta

2016

The capacity of a device or a program to listen, identify various sounds is referred as Speech recognition and recognize some known languages from the spoken words and for human machine interface the Automatic Speech Recognition (ASR) system is helpful. In recent periods, for the ASR system, lot of research works has been developed but the concerns in that system arevast, because of the improper techniques used for the feature selection. Proper features are selected in this paper in order to develop a superior ASR system and convert the spoken word into corresponding text. Three phases are comprised in the proposed system; preprocessing, feature extraction and classification. Initially, from the source the spoken word is detected by the preprocessing phase and the noise level is reduced. Then, totally eight features are extracted containing five statistical features and three common in the feature extraction phase. Then, the classifier is trained by using these features referred as ...

downloadDownload free PDF View PDFchevron_right