Speech Compression

description37 papers

group58 followers

lightbulbAbout this topic

Speech compression is a process that reduces the data rate of audio signals representing human speech, aiming to minimize bandwidth usage while preserving intelligibility and quality. It employs various algorithms and techniques to eliminate redundancy and irrelevant information in speech signals, facilitating efficient storage and transmission.

lightbulbAbout this topic

Key research themes

1. How can subspace and spectral subtraction methods improve low-bit-rate speech compression in noisy environments?

This research theme explores advanced preprocessing techniques aimed at enhancing the signal-to-noise ratio (SNR) of speech signals before compression under low-bit-rate conditions, particularly for applications like cellular communication. The focus lies on comparing and combining signal-subspace-based speech enhancement with spectral subtraction algorithms to mitigate additive noise effects and improve quality in bandwidth-constrained speech coding frameworks.

Quality improvement of low-bit-rate noisy speech using the subspace method

by Mohamed El-Mahallawy

2022, Proceedings of the Nineteenth National Radio Science Conference

Key finding: This paper demonstrates that a signal-subspace-based speech enhancement algorithm outperforms conventional spectral-subtraction-based noise reduction methods in improving the perceptual quality of speech coded by the CELP... Read more

articleView Paper downloadDownload

A New Sinusoidal Speech Coding Technique with Speech Enhancer at Low Bit Rates

by eyad alqam

2022

Key finding: The study proposes a sinusoidal speech coder with a noise-resilient design that classifies frames into voiced/unvoiced segments to optimize parameter selection. The codec extracts spectral peaks in the frequency domain using... Read more

articleView Paper downloadDownload

Compressed domain packet loss concealment of sinusoidally coded speech

by Christoffer Rodbro

2024, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).

Key finding: This work introduces a packet loss concealment method working directly on quantized sinusoidal speech parameters at 8 kbit/s, employing time-scaling of adjacent packets to compensate lost data in VoIP scenarios. The coded... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What are the benefits and challenges of wavelet transform-based speech compression methods?

Wavelet transform methods, particularly Discrete Wavelet Transform (DWT), have been widely studied for speech compression due to their ability to efficiently represent non-stationary signals by capturing both temporal and spectral properties. This research theme examines how DWT-based methods exploit multi-resolution analysis to achieve high compression ratios while preserving signal quality and how these approaches compare to traditional coding standards and other transforms, including practical implementation aspects and trade-offs.

Speech and Image Compression Using Discrete Wavelet Transform

by Mukhtiar unar

2025, IEEE/Sarnoff Symposium on Advances in Wired and Wireless Communication, 2005.

Key finding: This paper highlights the effectiveness of DWT in achieving promising compression ratios for speech (2.31) and images (2.67) while maintaining high signal energy retention (~99.99%) and yielding high SNR and PSNR values. It... Read more

articleView Paper downloadDownload

Audio Steganography Coding Using the Discrete Wavelet Transforms

by Habib Hamam

2022

Key finding: The research integrates discrete wavelet transform to compress stego-speech signals while preserving perceptual integrity. Optimization of wavelet selection, decomposition depth, and coefficient thresholding enables a balance... Read more

articleView Paper downloadDownload

Speech Compression using DWT in FPGA

by P.C. Bhaskar

2024, ijser.org

Key finding: The paper implements single-level DWT-based speech compression on FPGA using VHDL, separating high and low frequency components and retaining approximation coefficients to reduce bit rate. It addresses practical hardware... Read more

articleView Paper downloadDownload

Speech Coding Techniques for VoIP Applications: A Technical Review

by Bhagwat P Patil

2024, World Applied Sciences Journal

Key finding: This review compares multiple speech coders including traditional ITU standard codecs and wavelet-based coders, analyzing compression ratio, SNR, and mean opinion scores for English and Hindi speech. It highlights that... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. Can integration of speech recognition features into low-bit-rate compression enhance recognition accuracy and system efficiency?

This theme investigates methods to reconcile low-bit-rate speech compression with speech recognition performance by incorporating recognition-relevant features (e.g., MFCC) into the compression pipeline. The aim is to minimize recognition degradation commonly caused by traditional waveform compression, enabling direct recognition from compressed representations and reducing retraining needs, facilitating distributed speech recognition, and improving playback on devices with storage constraints.

Low Bit Rate Speech Compression For Playback In Speech Recognition Systems

by Ron Hoory

2022

Key finding: The RECOVC algorithm compresses speech by encoding Mel-Frequency Cepstral Coefficients (MFCC) and pitch period, enabling lossless recognition over low-bandwidth channels without degrading recognition accuracy in large... Read more

articleView Paper downloadDownload

Transformer Model Compression for End-to-End Speech Recognition on Mobile Devices

by Leïla Ben Letaifa

2023, 2022 30th European Signal Processing Conference (EUSIPCO)

Key finding: This work focuses on compressing transformer models used in end-to-end speech recognition by pruning and quantizing weights, significantly reducing model size (up to 84%) with minimal impact on accuracy. By optimizing model... Read more

articleView Paper downloadDownload

VAD techniques for real-time speech transmission on the Internet

by Rahul Sah 18SCSE1140067

2023, 5th IEEE International Conference on High Speed Networks and Multimedia Communication (Cat. No.02EX612)

Key finding: Comparative evaluation of time-domain voice activity detection (VAD) algorithms for VoIP shows that efficient silence detection significantly reduces bandwidth by pruning non-speech frames without compromising toll-grade... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Speech Compression

Method of Speech Signal Compression in Speaker Identification Systems

by Karim Konate

2025

In this paper we present a technique of efficacy improvement of speech signal compression algorithm without individual features speech production loss. The compression in this case means to delete, from the digital signal, those... more

descriptionView Paper arrow_downwardDownload

Time-Compressed Speech as an Educational Medium: Studies of Stimulus Characteristics and Individual Differences. Final Report

by Ray Edward Johnson

2024

ED035315 - Time-Compressed Speech as an Educational Medium: Studies of Stimulus Characteristics and Individual Differences. Final Report.

descriptionView Paper arrow_downwardDownload

Speech Compression using DWT in FPGA

by P.C. Bhaskar

2024, ijser.org

The paper gives the details about the speech compression using discrete wavelet transform in FPGA. In today's world multimedia files are used, storage space required for these files is more and sound files have no option so ultimate... more

descriptionView Paper arrow_downwardDownload

E9261 Linear Prediction Coding - Line Spectral Frequencies_RamAG

by Ramakrishnan Angarai Ganesan

2024, E9261:Speech Information Processing

Details of Linear prediction analsyis and line spectral frequencies, both used in speech coding.

descriptionView Paper arrow_downwardDownload

Speech Coding Techniques for VoIP Applications: A Technical Review

by Bhagwat P Patil

2024, World Applied Sciences Journal

Voice over Internet Protocol (VoIP) is a revolutionary technology which is acting as a platform for the development of latest trends in modern communication world. The speech signal quality in VoIP is governed by the speech coding... more

defined as: Principle of Speech Coding Using Wavelet Transform Principle of Speech Coding Using Wavelet Transform Based Codec: The process of speech coding using wavelet transform based codec is explained in Fig.1. The speech quality requirements of the codecs dictate the choice of mother-wavelet function. The objective is to minimize reconstructed error variance and maximize Signal to Noise Ratio (SNR) [26]. Wavelets work by decomposing a signal into different resolutions or frequency bands. The signal compression is achieved by reconstructing the signal by selecting a small number of approximation coefficients and some details coefficients by the concept of thresholding. Generally 5-level decomposition is adequate for speech signals [27].

near comparable results as compared to the standard codecs among other families of the wavelets. The quality of the reconstructed signal was tested as per the subjective analysis and found to be in compliance with the Mean Opinion Score standard requirements of the ITU standard [37][38].The MOS of PCM is best followed by Daubechies family wavelet based codecs. Hence it can be inferred from the above results that wavelet based codecs provides a good alternative to the ITU standard Codecs employed in the VoIP applications.

Table 1: Comparison of the Standard Speech Codecs Conjugate Structure-Algebraic Code Excited Linear Prediction (CS-ACELP) (ITU G-729): CS-ACELP is the most modern hybrid coding technique which is currently deployed in almost all the latest VoIP applications. It operates at 8Kbps and provides near Toll quality performance of the voice signal. The coder is based on code excited linear prediction model. It utilizes conjugate structure for the 2 D joint vector quantization of the sub- frame based adaptive codebook gain and fixed codebook gain. Here the linear combination of the code book vector For dyadic case a=2! and b= K, where k and j are integers. The equation which defines the scaling function ®(x) is defined as

Table 2: Details of the Test Sentences Comparative performance of LPC, CELP, PCM, ADPCM and the different wavelet based codecs are presented in Fig. 2, Fig. 3, Fig. 4 and Fig.5 in terms of compression ratio, SNR, NRMSE and MOS. It is observed from results that wavelet based codecs provides a greater degree of compression than the ITU standard codecs in

descriptionView Paper arrow_downwardDownload

The efficient digital transmission of information

by Peter Cochrane

2023, Electronics and Power

descriptionView Paper arrow_downwardDownload

A Comparative Study on Compression and Compressed Sensing of Speech Signals

by Flavita Pinto

2023, International journal of engineering research and technology

Speech processing is the fastest growing technology due to its applications in various fields such as research, forensic and aid for blind people. This paper describes speech processing techniques which involve improving the signal to... more

descriptionView Paper arrow_downwardDownload

Network Database Security Issues and Defense

by M. Sabareesan

2023

Database security is the mechanisms that secure the database against deliberate or accidental threats, unauthorized users, hackers and ip snoopers. In this paper we proposed two mixed techniques to secure the database ie one is... more

Fig 1 Secret communication between end user

3. Mutation operation[7]: Mutation is simply an occasional random alteration of the value of a string position. In a binary code, this involves changing a | to 0 and vice versa. Sabareesan M, Gobinathan N / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 3, Issue 1, January -February 2013, pp.1748-1752

Fig 3. Schematic diagram of inserting records into the database All the data given in the above step are given to the encryption algorithm for encryption. These are the data that are to be encrypted before adding to the database. The encryption process has the following processes.

Fig.4 Schematic Block Diagram for updating data 2.Interchange Characters The receiver obtains the cipher text which is in alphanumeric format. But since the encryption was done in an octal/binary format, the cipher text is again encoded to octal/binary format. The sender and receiver use the same type of octal encoding schemes so as to maintain integrity and this is decided before the encryption and decryption of data starts. After converting the cipher text to octal format, the receiver proceeds to get back the octal encoded plain

descriptionView Paper arrow_downwardDownload

Implementation of El-Gamal algorithm for speech signals encryption and decryption

by Ali Thaeer Hammid

2023, Procedia Computer Science

In the applications of Internet and wireless communication network, information security is one of the most challenging aspects. Cryptography is the best solution that offers the requisite protection from unintended persons. By using... more

descriptionView Paper arrow_downwardDownload

Conversion of Image to Grayscale Using Wavelet Transformation with Daubechies Basis function Systems

by ABBA MUHAMMAD ADUA

2023

Image restoration forms the foundation of various applications in the areas of medicine, astronomy etc. Historically an image is reproduced utilizing numerous techniques of which Fourier and wavelet transform systems developed from the... more

descriptionView Paper arrow_downwardDownload

AUDIO VOICE RECORDER AND TRANSMISSION THROUGH A WIRELESS MEDIUM BETWEEN PC AND VOICE SOURCES

by Bitrus Zirata Kamaunji

2023, Journal of Engineering and Energy Research

There is a fast growing need to support audio voice communication in wireless channel between a voice source and personal computer PC. In this paper we develop an algorithm using matlab software to indicate how a PC can receive audio... more

Analog Audio Voice Transmission between Voice Source and PC Analog data such as voice and video are often digitized to be able to use digital facilities Once analog data is converted to digital data, it can be transmitted directly using NRZ-L The digital data can be encoded as a digital signals and transmit with other techniques The digital data can be into analog signal using ASK, PSK, QAM etc Device for conversion is codec (coder-decoder). Two digital method of conversion (PCM, and DM), the most common technique for using digital signals to encode analog data is PCM. To transfer analog voice signals off a local loop to digital end office within the phone system, one uses a codec. In today’s busy world, where even every micro second is considered as very significant, almost all existing systems can be considered as rea time systems because time has become a very important factor fo1 execution of any system. The present communication systems such as wimax, Bluetooth are also providing real time communication but it has range problem as well as they are not very secure. The wireless technologies have gained lots of importance because of fast speed security and low cost. While we have attempted real-time audio & videc transmission system based on visible light communication but it has range limitations of just three meters. In this paper, we are trying tc transmit audio to a remote location. This system can be used to transmit recorded audio to remote computer in real time.

The integer numbers have effectively been coded into zeros and ones. The ones and zeros now contain the audio information encoded in a form that could be processed by a computer.

Audio Voice Recorder and Transmission through a Wireless Medium between PC Voice Sources Graphical Representation of Audio Voice over Wireless Channel for 10 and 20 Seconds

descriptionView Paper arrow_downwardDownload

Secure voice cryptography based on Diffie-Hellman algorithm

by Sura F. Yousif

2023, IOP Conference Series: Materials Science and Engineering

This article introduces a new technique for voice signals encryption & decryption to ameliorate the information security during transferring over unsecure network. The presented mechanism is based on a particular type of asymmetric key... more

descriptionView Paper arrow_downwardDownload

Implementation of El-Gamal algorithm for speech signals encryption and decryption

by Sura F. Yousif

2023, Procedia Computer Science

Table 5. Comparison with other methods in terms of performance measurements in the decryption process

Fig. 1. block diagram of the introduced cryptosystem 4. The performance measurements

Fig. 2. Waveforms of the (a) acquired; (b) encrypted; (c) decrypted speech signals decryption quality measures used in the current cryptosystem are the same that used in the encryption process which are: Signal to Noise Ratio (SNR), Segmental signal to Noise-Ratio (SNRseg) and Log-Likelihood Ratio (LLR). Higher quality of the decrypted speech signal is obtained when the values of SNR and SNRseg are increased while the value of LLR is decreased. The results of the presented method are given in Table 2. It is clear in this table that the SNR and SNRseg values are high while the LLR value is low which indicates high residual intelligibility. This means that the decryption quality introduced by the proposed method is high.

Fig. 3. Quality measures under noise effect of (a) SNR; (b) SNRseg; (c) LLR vs. SNR of noise

Table 1. Residual intelligibility results for the encryption process 5.3. Effect of Noise

Table 2. Residual intelligibility results for the decryption process

Table 3. Quality measures results for decrypted speech signal at different SNR values of noise 6. Comparison of the presented system with current methods

Comparison of the suggested technique with current methods is made so as to measure the system security in the case of encryption and decryption in terms of the performance measurements which are SNR, SNRseg and LLR. This comparison for the first test speech signal (Signal 1) in encryption case is represented in Table 4. The values of SNR and SNRseg obtained by the introduced method compared to existing methods are high negative which implies that the noise power is much greater than the speech signal power. This means that the detecting of the suggested cryptosystem is very hard while larger value of LLR in this scheme indicates better quality for encryption operation. The comparison of the performance measurements in the case of decryption is reported in Table 5. The values of SNR and SNRseg gained by the presented approach are extremely larger while the value of LLR is extremely lower as compared to other schemes which manifest better quality of decryption. This reveals that in the most of quality criteria, the suggested speech cryptosystem outperforms other current techniques.

descriptionView Paper arrow_downwardDownload

Effects of speech codecs on a remote speaker recognition system using noval SAD

by riadh ajgou

2023

The Intemational Conference on Electronics and Communication Systems 02 Avril 2014 universite Sofia Bulgaria

descriptionView Paper arrow_downwardDownload

Novel Detection Algorithm of Speech Activity and the impact of Speech Codecs on Remote Speaker Recognition System

by riadh ajgou

2023

In this paper, we studied the effects of voice codecs on remote speaker recognition system, considering three types of speech codec: PCM, DPCM and ADPCM conforming to International Telecommunications Union Telecoms (ITU-T) recommendation... more

descriptionView Paper arrow_downwardDownload

Speech Bandwidth Extension with Wavenet

by Yannis Assael

2023, 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Large-scale mobile communication systems tend to contain legacy transmission channels with narrowband bottlenecks, resulting in characteristic 'telephone-quality' audio. While higher quality codecs exist, due to the scale and... more

descriptionView Paper arrow_downwardDownload

Hardware Implementation of Image Compression Technique using Wavelet

by Vandana Malode

2023

Today in the age of technology the use of digital visual system increasing at tremendous rate for information entertainment and education purpose therefore it has become essential to reduce the cost of image transmission and storage as... more

descriptionView Paper arrow_downwardDownload

Provably secure and efficient audio compression based on compressive sensing

by Vincent Nyangaresi

2023, International Journal of Electrical and Computer Engineering (IJECE)

The advancement of systems with the capacity to compress audio signals and simultaneously secure is a highly attractive research subject. This is because of the need to enhance storage usage and speed up the transmission of data, as well... more

Figure 1. General diagram of compressive sensing To find the values of n variables, it is necessary to have n or more equations. Here the number of equations is much smaller than variables, so there are an infinite number of possible solutions. The true solution of vector x could be found by sensing (in a deterministic recovery way) whether A reflects some properties, then the recovery is possible [23], using non-deterministic CS. It may also be resolved by using optimization methods like the metaheuristic evolutionary method [24] or linear programming (LP) based or pseudo-inverse methods. Other solutions could use deterministic CS that demands a certain recovery process to sense the signal vector, which looks like an encoding-decoding technique [25]. The system proposed ir this paper is based on CS technology to compress data and reduce signal size by multiplying it with an appropriate sensing matrix. This technology guarantees the retrieval of the signa loss, which is almost less than 0.01. It is also computationally inexpensive and system more efficient. with the least possible date uncomplicated, making the

Figure 2. General diagram of the proposed system (a) CS Scheme and (b) reconstruction scheme In this work, a simplified and efficient system is exploited for compressing an audio signal. The system is based on compressive sensing principle to reduce the number of samples of the audio signal. It uses Moore-Penrose pseudo-inverse for reconstruction operation. The general scheme of the system is shown in Figure 2. In Figure 2(a), the steps of CS scheme are illustrated while Figure 2(b) shows the reconstruction scheme for the retrieved signal.

Figure 3. Speech signal with FS 48 kHz (a) original signal with length 6.8e+4 samples, (b) sensed signal with compression rate 30% with length 1.96e+4 samples, (c) sensed signal with compression rate 50% with length 2.95e+4 samples, and (d) reconstructed signal The result is shown in Figure 3(a) which shows an original speech signal with FS 48 kHz and length 6.8e+4 samples. Figure 3(b) sensed signal with compression rate 30% with length 1.96e+4 samples. in Figure 3(c) a sensed signal with compression rate 50% with length 2.95e+4 samples, while in Figure 3(d), the reconstructed signal. Note how the sensed file shrinks in size while keeping the critical influencing values.

Figure 4. The original signal vs reconstructed signal with Fs (a) 48000 fs, (b) 11025 fs, and (c) fs44100 44,100 fs. The reconstructed signals were the same as their original signals either in length or in peaks. This indicates the accuracy of retrieval due to the reliance on the Gaussian matrix as a sensing matrix, which has the characteristics of retaining the effective values of the compressed matrix, supporting the retrieved values. Furthermore, several statistical analysis tests were performed to evaluate the reconstruction quality of the system in both compression rates.

Table 2. The size of the tested file before and after compression and time consumed by the system with 30% rate The pseudo-inverse technique used allowed the system to score excellent results in time consumption for implementation for both compression and reconstruction. Many different-sized files wer compressed with the 30% and 50% compression rates. The size of the compression file had steadily shrunk 11 the number of samples, while the storage size in bytes had shrunk in different sizes based on the original fil size, as illustrated in Tables | and 2. Tables 1 and 2 show that the time was relatively low and suitable fo online systems and smart devices, as well the compression rate was very convenient. Remark: if the file siz is large, its compression rate would also be high.

4.2. Pearson correlation analysis This is a significant metric for evaluating the similarity between the original and the reconstructed udio signals, which is computed:

Table 3. Statistical analysis values (PSNR, MSE, SSIM, correlation) of the proposed system calculated fo: different-sized files and compression rates (30% and 50%) 4.5. Comparison with previous systems 4.5.1. Comparative computational complexity analysis

5. CONCLUSION This paper presents a CS-based compression system for compressing and securing audio signals. The audio signals are segmented as frames of 8x4 small matrices. The frames are then multiplied by a sensing matrix of 3x8 or 4x8, which are generated using Gaussian random numbers. The whole system is a linear system Y=AX and could be solved to reconstruct X using the Moore-Penrose pseudoinverse to calculate A‘', which makes the system low-cost and easy to implement with less time consumption, while it provides good compression ratios with a reasonable rate of security.

descriptionView Paper arrow_downwardDownload

Provably secure and efficient audio compression based on compressive sensing

by Vincent Nyangaresi

2023, Research article

descriptionView Paper arrow_downwardDownload

Audio encoding using Huang and Hilbert transforms

by Thierry Chonavel

2023

In this paper an audio coding scheme based on the Empirical Mode Decomposition (EMD) in association with the Hilbert transform is presented. The audio signal is decomposed adaptively into intrinsic oscillatory components by EMD called... more

Fig. 2. Decomposition of an audio frame by EMD.

Fig. 1. CEL variation of the audio frame guitar.

Fig. 3. Instantaneous phase and instantaneous amplitude of IMF3.

Fig. 4. Autocorrelation function of instantaneous amplitude of IMF3.

Fig. 5. Instantaneous phase and their extrema of the IMF3.

Table 1. Variations of the TC and the SDG over the AR order. Table 2, shows that the improvement in TC provided by the proposed method varies from 9.96:1 to 11.3:1 than the TC achieved by wavelets and MP3. Even for a sing signal, we still can observe the effectiveness of the proposed method in com- pression. A careful examination of the results reported in Ta- ble 2, shows that the proposed approach performs remarkably better than wavelet and MP3 methods. Furthermore, when istening the decoded signal, the proposed method produces ower noise compared to the wavelet method and MP3. This result is shown in table 2, when we see the acquired SDG val- ues depending to TC is better than the other methods. The obtained results show the interest to encode both JA and IP.

Table 2. Compression results of audio signals (guitar, violin and sing) by the proposed approach, MP3 and the wavelet.

descriptionView Paper arrow_downwardDownload

Development of RSA with Random Permutation and Inversion Algorithm to Secure Speech in GSM Networks

by Khaled Merit

2023

Global System for Mobile Communications (GSM) is one of the most commonly used cellular technologies in the world. One of the objectives in mobile communication systems is the security of the exchanged data. GSM employs many cryptographic... more

descriptionView Paper arrow_downwardDownload

Audio steganography coding using the discrete wavelet transforms

by siwar rekik

2022

The performance of audio steganography compression system using discrete wavelet transform (DWT) is investigated. Audio steganography coding is the technology of transforming stegospeech into efficiently encoded version that can be... more

descriptionView Paper arrow_downwardDownload

Novel Detection Algorithm of Speech Activity and the impact of Speech Codecs on Remote Speaker Recognition System

by Riadh Ajgou

2022

descriptionView Paper arrow_downwardDownload

Enhanced Hybrid Encryption Algorithm for Security of Network

by Shabnam Parveen

2022

One of the biggest problems in cryptography is the distribution of keys. Suppose you live in the United States and want to pass information secretly to your friend in Europe. If you truly want to keep the information secret, you need to... more

descriptionView Paper arrow_downwardDownload

Provably secure and efficient audio compression based on compressive sensing

by International Journal of Electrical and Computer Engineering (IJECE) and

2022, International Journal of Electrical and Computer Engineering (IJECE)

descriptionView Paper arrow_downwardDownload

Investigations into the quality limitations of LPC speech

by Dale Stevenson

2022, The Journal of the Acoustical Society of America

descriptionView Paper arrow_downwardDownload

Data Transmission Using TCP, Gzip & Tiny Algorithms

by Durga Bhavani

2022, ijera.com

Abstract--The aim of the Paper is to provide the security during transmission of the data. Commonly used technologies are cryptography, Compression and decompression. This can be used for secure and fast sending for security purpose we... more

descriptionView Paper arrow_downwardDownload

Secure Transceiver Based on Independent Component Analysis (ICA) Algorithm

by Isam Hameed

2022, International Journal of Intelligent Engineering and Systems

In this paper, a system for the purpose of signals encryption using the technique of independent component analysis has been proposed. The proposed system mixes the original signal with arbitrary number of random signals in order to... more

descriptionView Paper arrow_downwardDownload

Secret Technique to Hiding Image after Compression In Cover Image

by SADDAM KAMIL

2022, uotechnology.edu.iq

This paper presents a technique for image compression and hiding in image of a high secret has been applied to wavelet transform and wavelet transform packet first apply two dimensional wavelet transform packet on the cover image was... more

descriptionView Paper arrow_downwardDownload

Novel Detection Algorithm of Speech Activity and the impact of Speech Codecs on Remote Speaker Recognition System

by said ghendir

2022

descriptionView Paper arrow_downwardDownload

Distortion of voicing and vocal tract parameters after codecs

by Mohamed Hesham

2022

In this work, we present results on the effect of well-known mixed excitation linear prediction (MELP) and code-excited linear prediction (CELP) codecs (coder/decoder) on voicing and vocal tract parameters of Arabic sounds. The study... more

descriptionView Paper arrow_downwardDownload

Covert VoIP Communication based on Audio Steganography

by Hameed R. Farhan

2022, International Journal of Computing and Digital Systems

Voice over Internet Protocol (VoIP) is a popular and important internet protocol for real-time voice calling. It is used in several software applications such as Skype, WhatsApp, and Google Talk. However, communications over the internet... more

Figure 1. General scheme of the introduced real time covert com- munication system

Figure 3. General block diagram for the embedding scheme of the proposed approach

Figure 2. Flowchart of the embedding process for a single secret digit

Figure 4. General block diagram for the recovery scheme of the proposed approach

Where: m is the total number of samples in the secret speech, k is the total number of bits in the secret speech, and b=0 at $;=S;, and b=1 at $;#S; To measure the similarity between the original and the retrieved secret speech, Normalized Cross-Correlation (NC) and Bit Error Rate (BER) are computed according to equation 6 [17], [18] and equation 7 [19], respectively.

TABLE IIL. Tests of secret speech immunity against AWGN for the introduced approach Figure 5. Simulation diagram to test the quality of the proposed system at a lossy channel with different packet loss rates

Figure 7. Effect of packet losses on the NC Figure 6. Effect of packet losses on the SNR

MiuUayaGd os. KOdG received Ns M.Sc. degrees from the Departm tronics and Communications, B.oc. anda ent of Elec- Al-Nahrain University-Iraq, in 2002 and 2005, respec- tively. He received the Ph.D. d the Department of Electrical and Electronics, University of UK, in 2016. Currently, he is at the Department of Electrical egree from Engineering Liverpool- a Lecturer Engineering and Electronics, University of Kerbala, Iraq. His current research interests include wireless power transfer and telemetry to implantable medical devices, weara plantable. ble and im-

TABLE IV. Comparative results of the perceptual quality and hiding rate with some related approaches

descriptionView Paper arrow_downwardDownload

Speech Compression Analysis Using Matlab

by Manas Arora

2022, International Journal of Research in Engineering and Technology

The growth of the cellular technology and wireless networks all over the world has increased the demand for digital information by manifold. This massive demand poses difficulties for handling huge amounts of data that need to be stored... more

descriptionView Paper arrow_downwardDownload

Protecting VoIP Communications in a Multipath Environment using Modified Secret Sharing Algorithm

by Dr.K.Maheswari CS

2022

Voice over Internet Prototcol (VoIP) is a technology to carry voice calls over the internet. It is a technology to replace the traditional Public Switched telephone Network (PSTN). VoIP is a growing technology. Its functions, facilities... more

descriptionView Paper arrow_downwardDownload

Efficient Low-Resource Compression of HIFU Data

by Pavel Zemcik

2022, Information

Large-scale numerical simulations of high-intensity focused ultrasound (HIFU), important for model-based treatment planning, generate large amounts of data. Typically, it is necessary to save hundreds of gigabytes during simulation. We... more

descriptionView Paper arrow_downwardDownload

Comparing Performance of Kalman Filtering and DWT based Speech Enhancement Techniques

by Suneel Miriyala

2022

This paper gives an idea about the importance of speech enhancement and the performance analysis of DWT and Kalman filter based speech enhancement techniques. The objectives of Speech Enhancement vary widely reduction of noise level,... more

descriptionView Paper arrow_downwardDownload

An Efficient Time Domain Speech Compression Algorithm Based on LPC and Sub-Band Coding Techniques

by Palaniandavar Venkateswaran

2022, Journal of Communications

Speech compression is a mature technology with many applications. Over the past decade, huge advances have been made in the area of speech coding for reduced bit-rate transmission. With perceptual audio coding, the signal is coded... more

descriptionView Paper arrow_downwardDownload

A New Sinusoidal Speech Coding Technique with Speech Enhancer at Low Bit Rates

by eyad alqam

2022

Speech coding deals with the problem of reducing the bit rate required for representing speech signals while preserving the quality of the speech reconstructed from that representation. In this paper, we propose a novel speech coding... more

descriptionView Paper arrow_downwardDownload

Effects of speech codecs on a remote speaker recognition system using noval SAD

by said ghendir

2022

The Intemational Conference on Electronics and Communication Systems 02 Avril 2014 universite Sofia Bulgaria

descriptionView Paper arrow_downwardDownload

SNR Improvement and Bandwidth Optimization Technique Using PCM-DSSS Encryption Scheme

by SAIFUL BAHRI MOHAMED

2022, International Journal on Advanced Science, Engineering and Information Technology

Cryptography, the scheme of information stashing and verification, entirely deals with protocols, algorithms and strategies to ensure the precise security facility of the signal consistently by hindering unauthorized access to the... more

descriptionView Paper arrow_downwardDownload

Text to Speech Synthesizer for Tigrigna Linguistic using Concatenative Based approach with LSTM model

by mezgebe araya

2022, Indian Journal of Science and Technology

The purpose of this study is to describe text-to-speech system for the Tigrigna language, using dialog fusion architecture and developing a prototype text-to-speech synthesizer for Tigrigna Language. Methods : The direct observation and... more

Fig 1. Proposed work Architecture This study describes the TTS system for the Tigrigna linguistic, using speech analysis architecture. When we use concatenative speech synthesis where the segments of recorded speech are concatenated to produce the desired output.

Fig 2. Long short-term memory (LSTM) structure One kind of RNN.The four important layers in LSTM models are point-wise operation, vector transfer, neural network layer, concatenate and copy. the three gates are computed as follow 3.2.1. LSTM 3.2 Models

Fig 3. Performance of LSTM in different Epoch number

The overall intelligibility of the system from twenty listeners for the ten Tigrigna sentences is found to be 3.27. Which means the synthesizer is ‘good’ as per the scale of the MOS test. The overall naturalness of the synthesizer found to be 3.28 which also approach to ‘good’ MOS scale. These values of intelligibility and naturalness look encouraging to come up with a better system. Though we tried to record the corpus data in a quiet room, there was also some noises from the computer itself listened during the synthesis of the data which degrades the naturalness of the sound for the listeners.

Fig 5. Formants of the word ”arba” Fig 6. The LPC feature extraction of the word ”arba”

Fig 7. The original signal of the word ”arba”

Fig 8. The original signal of the word ”arba” Figure 8. Segmentation of Recorded Speech using PRAAT Fig 9. GUI Text-to-Speech Synthesizer for Tigrigna language

Table 1. Scales used in MOSMOS Then the evaluators provide their ranks based on the MOS scale as shown in [Table 1]. To evaluate the synthesizers’ intelligibility and naturalness ten sentences are prepared as a test data for the synthesizer. All words used in the sentence are found in the compiled lexicon. Then the selected individuals listen to the synthesized waveform from the synthesizer and evaluate naturalness and intelligibility based on the MOS scale. The invited native Tigrigna speakers are given with the questionnaire to evaluate intelligibility and naturalness of the synthesized speech.

Table 2. Intelligibility (MOS) Scores of Tigrigna Speech Synthesizer Where P is the persons who are invited to evaluate the intelligibility of text to speech synthesizer for Tigrigna language.

Where P is the persons who are invited to evaluate the naturalness of text to speech synthesizer for Tigrigna language.

Table 4. Average MOS Scores of Tigrigna Speech Synthesizer

descriptionView Paper arrow_downwardDownload

Novel Detection Algorithm of Speech Activity and the impact of Speech Codecs on Remote Speaker Recognition System

by Salim Sbaa

2022

descriptionView Paper arrow_downwardDownload

Developing secure communication systems using the tms320c50 dsp

by Fernando Sousa

2022, Texas Instruments, SPRA318

TI warrants performance of its semiconductor products and related software to the specifications applicable at the time of sale in accordance with TI's standard warranty. Testing and other quality control techniques are utilized to... more

descriptionView Paper arrow_downwardDownload

Nvestigations of the D Istributions of P Honemic D Urations in H Indi and D Ogri

by padmini rajput

2022

Speech generation is one of the most important areas of research in speech signal processing which is now gaining a serious attention. Speech is a natural form of communication in all living things. Computers with the ability to... more

descriptionView Paper arrow_downwardDownload

Effect of Singular Value Decomposition Based Processing on Speech Perception

by padmini rajput

2022, International Journal on Natural Language Computing

Speech is an important biological signal for primary mode of communication among human being and also the most natural and efficient form of exchanging information among human in speech. Speech processing is the most important aspect in... more

descriptionView Paper arrow_downwardDownload

A Novel Encryption System using Layered Cellular Automata

by ARYAN SINGH

2022

As the technology is rapidly advancing day by day sharing of information over the internet is experiencing an explosive growth, which in turn is also posing new threats and vulnerabilities in the existing systems. The quest for more... more

descriptionView Paper arrow_downwardDownload

Effects of speech codecs on a remote speaker recognition system using noval SAD

by Salim Sbaa

2022

The Intemational Conference on Electronics and Communication Systems 02 Avril 2014 universite Sofia Bulgaria

descriptionView Paper arrow_downwardDownload

Novel Detection Algorithm of Speech Activity and the impact of Speech Codecs on Remote Speaker Recognition System

by Salim Sbaa

2022

descriptionView Paper arrow_downwardDownload

Image Compression Through Combination Advantages From Existing Techniques

by Habib Hamam

2022

The tremendous growth of digital data has led to a high necessity for compressing applications either to minimize memory usage or transmission speed. Despite of the fact that many techniques already exist, there is still space and need... more

descriptionView Paper arrow_downwardDownload

Image Compression Through Combination Advantages From Existing Techniques

by Habib Hamam

2022

descriptionView Paper arrow_downwardDownload