Multimedia signal processing Research Papers

Influence of visual depth and vibration on the high-level perception of reality in 3D contents

2025

This study investigated the influence of stereoscopic visual depth and body vibration on the high-level affective perception that concerns the senses of presence and verisimilitude. The multisensory content used in our experiment... more

descriptionView Paper arrow_downwardDownload

On Dual View Lipreading Using High Speed Camera

by leon rothkrantz

2025

Lipreading gets increasingly attention from the scientific society. However, many aspects related to lipreading are still unknown or poorly understood. In the current paper we present the entire process used for engineering the data for... more

descriptionView Paper arrow_downwardDownload

Dense disparity estimation in multiview video coding

by Wided MILED SOUID

2025, 2009 IEEE International Workshop on Multimedia Signal Processing

Multiview video coding is an emerging application where, in addition to classical temporal prediction, an efficient disparity prediction should be performed in order to achieve the best compression performance. A popular coder is the... more

descriptionView Paper arrow_downwardDownload

A convex programming approach for color stereo matching

by Wided MILED SOUID

2025, 2008 IEEE 10th Workshop on Multimedia Signal Processing

This paper addresses the problem of dense disparity estimation from a pair of color stereo images. Based on a convex set theoretic formulation, the stereo matching problem is cast as a convex programming problem in which a color-based... more

descriptionView Paper arrow_downwardDownload

Detection System for Grey and Colour Images Based on Extracting Features of Difference Image and Renormalized Histogram

by mohammad aljarf

2025

The literature has introduced many steganalysis methods intended to combat specific steganography techniques and to detect particular image formats. This paper proposes a detection system based on extracting histogram features. The... more

The literature has introduced many steganalysis methods intended to combat specific steganography techniques and to detect particular image formats. This paper proposes a detection system based on extracting histogram features. The features are extracted by exploiting the histogram of difference image, which is usually a generalised Gaussian distribution centred at 0. The histogram of difference image and the renormalized histogram are created for clean and stego images, therefore using the peak value and renormalized histogram as features for classification. To obtain the difference between neighbouring pixels, the difference images are computed for four directions (vertical, horizontal, diagonal, and anti-diagonal). The renormalized histogram of the difference image is created a number of times (n) for the four directions. This work implements two commonly-used steganography methods: the Least Significant Bit (LSB) and F5 algorithm to create a large database of stego images for system evaluation. Colour and grey images with different formats are chosen for training and testing the system. These formats are lossless and lossy compressions, with all features extracted from each colour channel (RGB) separately. The size of hidden files plays an important role in terms of detection. Therefore, to improve the proposed systems detection capacity, different sizes of hidden files have been considered. The proposed detection system was trained and tested to distinguish stego images from clean ones using the Discriminant Analysis (DA) classification method and Multilayer Perceptron neural network (MLP). The experimental results prove that the proposed system possesses reliable detection ability and accuracy. The chosen classification methods show dissimilar performance in terms of classifying grey and colour images. The system holds more generalisability than previous systems by covering different types of stego images, image formats and hidden file sizes. In addition, extensive experimental results show that the proposed steganalysis system outperforms some previous detection methods. .

descriptionView Paper arrow_downwardDownload

Pre-Service Teachers’ Views on the Implementation of Game-Based Learning for Academic Writing Skills

by Melor Md Yunus

2025, Sains Insani

Implementation of game-based learning has been perceived by educators as a means to enhance effective classroom learning. Aspects in games have been identified to motivate learners to actively engage throughout the learning as it provides... more

descriptionView Paper arrow_downwardDownload

THE DESIGN AND DEVELOPMENT OF MobiEko: A MOBILE EDUCATIONAL APP FOR MICROECONOMICS MODULE

by Melor Md Yunus

2025, Malaysian Journal of Learning and Instruction

Purpose – The purpose of this study is to presents the steps taken to produce a mobile learning application framework to learn Microeconomics for which is named “MobiEko Apps”. Mobile learning application is utilized because the framework... more

descriptionView Paper arrow_downwardDownload

Toward realtime side information decoding on multi-core processors

by Svetislav Momcilovic

2025, 2010 IEEE International Workshop on Multimedia Signal Processing

Most distributed source coding schemes involve the application of a channel code to the signal and transmission of the resulting syndromes. For low complexity encoding with superior compression performance, graph-based channel codes such... more

descriptionView Paper arrow_downwardDownload

Partitioning of Posteriorgrams Using Siamese Models for Unsupervised Acoustic Modelling

by Giampiero Salvi

2025

Unsupervised methods tend to discover highly speaker-specific representations of speech. We propose a method for improving the quality of posteriorgrams generated from an unsupervised model through partitioning of the latent classes. We... more

descriptionView Paper arrow_downwardDownload

Model-based demosaicking for acquisitions by a RGBW color filter array

by Mauro Dalla Mura

2025, arXiv (Cornell University)

Microsatellites and drones are often equipped with digital cameras whose sensing system is based on color filter arrays (CFAs), which define a pattern of color filter overlaid over the focal plane. Recent commercial cameras have started... more

descriptionView Paper arrow_downwardDownload

Multi-scale and Multi-orientation Face Recognition using Voting based Extreme Learning Machine

by Anil Khandelwal

2025, International Journal of Computer Applications

In our daily life human can remember many faces and can recognize them irrespective of illumination, aging, obstructions, variation in views. Most of researchers have worked on the problem of face recognition to develop an automatic face... more

descriptionView Paper arrow_downwardDownload

Measuring Learners’ Perceived Satisfaction Towards e-Learning Material and Environment

by Parilah M. Shah

2025

The use of effective teaching materials either the materials are paper-based or computer-based, ensures that knowledge is transferred effectively and meaningfully to students. The materials can even be more effective if they are... more

descriptionView Paper arrow_downwardDownload

Data Hiding in H. 264 Encoded Video Sequences

by Eleni Varsaki

2025, 2007 IEEE 9th Workshop on Multimedia Signal Processing

A new method for high capacity data hiding in H.264 streams is presented. The proposed method takes advantage of the different block sizes used by the H.264 encoder during the inter prediction stage in order to hide the desirable data. It... more

descriptionView Paper arrow_downwardDownload

Recent advances in transport level error control techniques for wireless video transmission

by Ryoichi Komiya

2025, 2009 International Multimedia, Signal Processing and Communication Technologies

Transmission of compressed video over wireless channels remains a challenging task due to the noisy nature of the wireless channels and a single bit error in the compressed video bit-stream might cause the reconstructed video to be... more

descriptionView Paper arrow_downwardDownload

Automatic Organisation, Segmentation, and Filtering of User-Generated Audio Content

by Sofia Cavaco

2025, arXiv (Cornell University)

Using solely the information retrieved by audio fingerprinting techniques, we propose methods to treat a possibly large dataset of user-generated audio content, that (1) enable the grouping of several audio files that contain a common... more

descriptionView Paper arrow_downwardDownload

Automatic organisation, segmentation, and filtering of user-generated audio content

by Sofia Cavaco

2025, 2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP)

Using solely the information retrieved by audio fingerprinting techniques, we propose methods to treat a possibly large dataset of user-generated audio content, that (1) enable the grouping of several audio files that contain a common... more

descriptionView Paper arrow_downwardDownload

Flexible disk scheduling for multimedia presentation servers

by Lillykutty Jacob

2025, 2002 IEEE Workshop on Multimedia Signal Processing.

descriptionView Paper arrow_downwardDownload

The history of video quality model validation

by Margaret Pinson

2025, 2013 IEEE 15th International Workshop on Multimedia Signal Processing (MMSP)

This paper describes objective video quality validation efforts conducted in the past two decades. Validation efforts to be examined include a validation test performed by the T1A1 committee in the early 1990's; five rounds of validation... more

descriptionView Paper arrow_downwardDownload

Audio-Visual Feature Extraction for Semi-Automatic Annotation of Meetings

by Marián Képesi

2025, 2006 IEEE Workshop on Multimedia Signal Processing

descriptionView Paper arrow_downwardDownload

Emotion recognition and its application to computer agents with spontaneous interactive capabilities

by Ryohei Nakatsu

2025

In this paper, we first study the recognition of emotions involved in human speech. We propose an emotion recognition algorithm based on a neural network and also propose a method to coIlect a large speech database that contains emotions.... more

descriptionView Paper arrow_downwardDownload

An Efficient Bottom-Up Image Segmentation Method Based on Region Growing, Region Competition and the Mumford Shah Functional

by Seddik Djouadi

2025, 2006 IEEE Workshop on Multimedia Signal Processing

Curve evolution implementations [3][17] [18] of the Mumford-Shah functional are of broad interest in image segmentation. These implementations, however, have initialization problems . A mathematical analysis of the initialization problem... more

descriptionView Paper arrow_downwardDownload

Efficient Implementation of the Chan-Vese Models Without Solving PDEs

by Seddik Djouadi

2025, 2006 IEEE Workshop on Multimedia Signal Processing

Efficient implementation methods are proposed for Chan-Vese models [3] [16]. The proposed methods do not require solutions of PDEs and are therefore fast. The advantages of level set methods, such as automatic handling of topological... more

descriptionView Paper arrow_downwardDownload

Analiza karakteristika MPLS mreznog simulatora

by Boban Pavlovic

2025, Vojnotehnički Glasnik

profesor dr Milojko Jevtovi}, dipl. in`.

descriptionView Paper arrow_downwardDownload

A robust multimedia watermarking technique using Zernike transform

by S. Shirani

2025, 2001 IEEE Fourth Workshop on Multimedia Signal Processing (Cat. No.01TH8564)

In this paper a new watermarking method using rotation-invariant Zernike moments is introduced. The watermark signal is embedded in the Zernike moments of the input image. The watermarked image does not show any quality degradation. Tests... more

descriptionView Paper arrow_downwardDownload

A significant motion vector protection-based error-resilient scheme in H.264

by Kuo-chin Fan

2025

This paper proposes a significant motion vector protection (SMVP) scheme for error-resilient transmission of videos. In terms of a rate-distortion optimization model, we show how to determine the significant motion vectors (SMVs) and how... more

descriptionView Paper arrow_downwardDownload

Biosignal and context monitoring: Distributed multimedia applications of Body Area Networks in healthcare

by Bert-Jan van Beijnum

2025, 2008 IEEE 10th Workshop on Multimedia Signal Processing

We are investigating the use of Body Area Networks (BANs), wearable sensors and wireless communications for measuring, processing, transmission, interpretation and display of biosignals. The goal is to provide telemonitoring and... more

descriptionView Paper arrow_downwardDownload

An audio-visual saliency model for movie summarization

by Petros Maragos

2024, 2007 IEEE 9Th International Workshop on Multimedia Signal Processing, MMSP 2007 - Proceedings

A saliency-based method for generating video summaries is presented, which exploits coupled audiovisual information from both media streams. Efficient and advanced speech and image processing algorithms to detect key frames that are... more

descriptionView Paper arrow_downwardDownload

Watermarking of Color Image Using DWT-SVD

by Madhuri Thube

2024

To enhance the security of the copyright image we propose imperceptible color image watermarking scheme. Recent day's internet technology is widely uses overall in the world. It uses many different type of data, digital images one of... more

descriptionView Paper arrow_downwardDownload

A Planar Microphone Array for Spatial Coherence-Based Source Separation

by Thushara Abhayapala

2024

We proposed a spatial coherence-based PSD estimation and source separation technique in [1] using a 32channel spherical microphone array. While the proposed spherical microphone-based method exhibited a satisfactory performance in... more

descriptionView Paper arrow_downwardDownload

A Blind Video Watermarking Algorithm for Copyright Protection based on Dual Tree Complex Wavelet Transform

by Roberto Cusani (UNIROMA1)

2024, Multimedia Signal Processing

DVDs and Blu-rays are among the most frequent victims of video content counterfeiting. Primarily, illegal distribution of movies on Internet is a growing menace to film industry. For this reason, authentication techniques are required to... more

descriptionView Paper arrow_downwardDownload

Pengklasifikasian Tinggi Dan Berat Badan Manusia Berdasarkan Citra Telapak Kaki Menggunakan Metode Discrete Wavelet Transform (dwt) Dan Support Vector Machine-multiclass (svm-mc)

by suci aulia

2024

ABSTRAK Berat badan merupakan salah satu parameter yang memberikan gambaran pada massa tubuh. Pada pengukuran berat badan yang telah dilakukan secara manual yaitu dengan menggunakan alat penimbang berat badan (timbangan injak) didapatkan... more

descriptionView Paper arrow_downwardDownload

Aerial communications using piano, clarinet, and bells

by Pedro M. Q. Aguiar

2024, 2002 IEEE Workshop on Multimedia Signal Processing.

This work explores novel mechanisms for aerial acoustic machine-machine communications. It builds on previous work by some of the authors [1], as well as others [2]. In this paper we describe aerial acoustic communication systems that... more

descriptionView Paper arrow_downwardDownload

Factorization with missing data for 3D structure recovery

by Pedro M. Q. Aguiar

2024, 2002 IEEE Workshop on Multimedia Signal Processing.

Matrix factorization methods are now widely used to recover 3D structure from 2D projections [1]. In practice, the observation matrix to be factored out has missing data, due to the limited field of view and the occlusion that occur in... more

descriptionView Paper arrow_downwardDownload

Hidden Markov model for automatic transcription of MIDI signals

by 茂樹嵯峨山

2024, 2002 IEEE Workshop on Multimedia Signal Processing.

This paper describes a Hidden Markov Model (HMM)-based method of automatic transcription of MIDI (Musical Instrument Digital Interface) signals of performed music. The problem is formulated as recognition of a given sequence of... more

descriptionView Paper arrow_downwardDownload

Visual Quality Assessment Using A Contrast Gain Control Model

by Stefan Winkler

2024

Much of the work on visual quality assessment has been devoted to gray-level images; metrics taking into account color information and the temporal component are still relatively rare. This paper presents a quality metric for color video... more

descriptionView Paper arrow_downwardDownload

Optimal resource allocation for multimedia cloud based on queuing model

by Ling Guan

2024, 2011 IEEE 13th International Workshop on Multimedia Signal Processing

descriptionView Paper arrow_downwardDownload

Combining stereo and visual hull information for on-line reconstruction and rendering of dynamic scenes

by M. Magnor

2024, 2002 IEEE Workshop on Multimedia Signal Processing.

In this paper, we present a novel system which combines depth-from-stereo and visual hull reconstruction for acquiring dynamic real-world scenes at interactive rates. First, we use the silhouettes from multiple views to construct a... more

descriptionView Paper arrow_downwardDownload

Depth map denoising using graph-based transform and group sparsity

by Oscar Au

2024, 2013 IEEE 15th International Workshop on Multimedia Signal Processing (MMSP)

Depth maps, characterizing per-pixel physical distance between objects in a 3D scene and a capturing camera, can now be readily acquired using inexpensive active sensors such as Microsoft Kinect. However, the acquired depth maps are often... more

descriptionView Paper arrow_downwardDownload

Rate Control Based on Zero-Residue Pre-Selection for Video Transcoding

by Oscar Au

2024, 2005 IEEE 7th Workshop on Multimedia Signal Processing

A common issue in video transcoding for heterogeneous network environment is to efficiently and accurately reduce the bit-rate such that the distortion is minimized under a given rate constraint. To convert the bit-rate of an encoded... more

descriptionView Paper arrow_downwardDownload

Improved approach for full search motion estimation on GPU

by Fatma Sayadi

2024

In order to speed up video coding efficiency such as H.264/AVC and H265/HEVC, we propose in this paper a parallel approach of full search (FS) algorithm for motion estimation on Graphic Processor Unit (GPU). We implemented the traditional... more

descriptionView Paper arrow_downwardDownload

Human interaction recognition based on the co-occurence of visual words

by nour el houda khadidja SLIMANI

2024, HAL (Le Centre pour la Communication Scientifique Directe)

This paper describes a novel methodology for automated recognition of high-level activities. A key aspect of our framework relies on the concept of cooccurring visual words for describing interactions between several persons. Motivated by... more

descriptionView Paper arrow_downwardDownload

Bidirectional hierarchical anchoring of motion fields for scalable video coding

by Reji Mathew

2024, 2014 IEEE 16th International Workshop on Multimedia Signal Processing (MMSP)

The ability to predict motion fields at finer temporal scales from coarser ones is a very desirable property for temporal scalability. This is at best very difficult in current state-of-theart video codecs (i.e., H.264, HEVC), where... more

descriptionView Paper arrow_downwardDownload

Emotion recognition and its application to computer agents with spontaneous interactive capabilities

by Ryohei Nakatsu

2024

In this paper, we first study the recognition of emotions involved in human speech. We propose an emotion recognition algorithm based on a neural network and also propose a method to coIlect a large speech database that contains emotions.... more

descriptionView Paper arrow_downwardDownload

PAPR reduction in MIMO-OFDM system using combination of OSTBC Encoder and Spreading code sequence

by Javaid A. Sheikh

2024, IMPACT-2013

Orthogonal Frequency Division Multiplexing (OFDM) is regarded as one of the most outstanding multicarrier modulation technique in fourth generation (4G) wireless networks, which makes it possible to transfer very high bit rates despite... more

descriptionView Paper arrow_downwardDownload

Gesture synthesis from SignWriting notation

by Yosra Bouzid

2024

Sign language synthesis has seen a large increase i n pplications over the past few decades, as it represents a poten tial solution to communication problem for the deaf community. All t hat is needed is to convert a writing form (books,... more

descriptionView Paper arrow_downwardDownload

Robust video transmission using Layered Compressed Sensing

by Nam Nguyen

2024, 2009 IEEE International Workshop on Multimedia Signal Processing

We propose a novel Layered Compressed Sensing (CS) approach for robust transmission of video signals over packet loss channels. In our proposed method, the encoder consists of a base layer and an enhancement layer. The base layer is a... more

descriptionView Paper arrow_downwardDownload

Assessment and prediction of negative symptoms of schizophrenia from RGB+D movement signals

by Bhing Leet Tan

2024, 2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP)

Negative symptoms of schizophrenia significantly affect the daily functioning of patients, especially movement and expressive gestures. The diagnosis of such symptoms is often difficult and require the expertise of a trained clinician.... more

descriptionView Paper arrow_downwardDownload

Frame rate and viseme analysis for multimedia applications

by Aggelos Katsaggelos

2024, Proceedings of First Signal Processing Society Workshop on Multimedia Signal Processing

In the future multimedia technology will be able to provide video frame rates equal to or better than 30 frames-per-second FPS. Until that time the hearing impaired community will be using band-limited communication systems over... more

descriptionView Paper arrow_downwardDownload

Rate-Distortion Optimization for Internet Video Summarization and Transmission

by Aggelos Katsaggelos

2024, 2005 IEEE 7th Workshop on Multimedia Signal Processing

The goal of video summarization is to generate a shorter video sequence of a lengthy original sequence using only the key frames of the original sequence. We consider a video summarization scheme that generates a video summary that can be... more

descriptionView Paper arrow_downwardDownload

Visual Speech Analysis, Application to Arabic Phonemes

by Fatma zohra CHELALI

2024

The aim of this work is to introduce a primary research on Arabic audiovisual analysis. Each language has multiple phonemes and visemes and each viseme can have multiple phonemes. The first part focuses on how to classify Arabic visemes... more

descriptionView Paper arrow_downwardDownload

Multimedia signal processing

Related Topics