Papers by Vishnu Baskaran
Dominant speaker detection using discrete Markov chain for multi-user video conferencing
2015 IEEE International Conference on Consumer Electronics - Taiwan, 2015
Dual layer video stream in HEVC through information hiding
2015 IEEE International Conference on Consumer Electronics - Taiwan, 2015
Building a real-time multiple H.264 video streaming system based on intel IPP
2010 IEEE Asia Pacific Conference on Circuits and Systems, 2010
This paper describes the required building blocks in developing a software based real-time multip... more This paper describes the required building blocks in developing a software based real-time multiple H.264 video streaming system. Here, we consider video capture using Microsoft® DirectShow and video codec based on Intel® Integrated Performance Primitives (IPP) H.264 codec library where the Real-time Transport Protocol (RTP) is utilized as the transport protocol for the compressed video packets. These modules are implemented
Audio mixer with Automatic Gain Controller for software based Multipoint Control Unit
2010 IEEE Asia Pacific Conference on Circuits and Systems, 2010
This paper proposes two audio mixing algorithms in software based Multipoint Control Unit (MCU) u... more This paper proposes two audio mixing algorithms in software based Multipoint Control Unit (MCU) using Automatic Gain Controller. The objectives of the proposed algorithms include performing selective mixing, minimizing audio clipping, assigning higher amplitude priority to the loudest speaker and ensuring a smooth transition as the primary speaker changes from one participant to another. The proposed algorithms are able to

Software-based serverless endpoint video combiner architecture for high-definition multiparty video conferencing
Journal of Network and Computer Applications, 2013
ABSTRACT This paper proposes an endpoint video combiner architecture in a multipoint control unit... more ABSTRACT This paper proposes an endpoint video combiner architecture in a multipoint control unit (MCU) system for high definition multiparty video conferencing. The proposed architecture addresses the current reliability, computational and quality drawbacks of a conventional centralized based video combiner architecture. This is achieved by redesigning the MCU video to move away the video combiner from the bridge and into the client endpoints. Moreover, the proposed architecture represents a serverless system and is able to scale a large number of clients at high resolutions in a multipoint video conferencing session. In order to realize this design, this paper also proposes a custom robust sustainable session management protocol which allows a dynamic multi-port management between the MCU video and client endpoints. In addition, the proposed custom session management protocol includes recommendation for a session protection structure. Experimental results suggest that the proposed architecture exhibits significant computational frame rate performance gains of up to 762.95% in comparison with the conventional centralized video combiner architecture based on a series of four and eight high definition combined video assessments. Moreover, reliability analysis suggests that the proposed architecture is also able to consistently sustain a high frame rate performance within a long duration high definition multipoint video conferencing session.

Active participant identification and tracking using depth sensing technology for video conferencing
2013 IEEE Conference on Open Systems (ICOS), 2013
ABSTRACT Video conferencing represents an effective method of point-to-point or multipoint real-t... more ABSTRACT Video conferencing represents an effective method of point-to-point or multipoint real-time communication between two or more participants. However, persistent manual adjustments of the video capture device to focus on an active participant represent a challenge, especially if the conference participant moves out of the video capture window. As such, this paper proposes an active-based participant identification and tracking system, which continuously tracks and automatically adjusts the video capture device to maintain focus of the active conference participant. The proposed system first applies a haarcascade face detection algorithm to register and store a set of facial images of the active participant. By leveraging on the depth sensing technology of Microsoft Kinect, this system compares the captured skeletal head position images of participants within the Kinect camera viewpoint, which is then compared against the aforementioned stored face detection images using the principle component analysis face recognition algorithm. The recognized user by the system is then continuously tracked as a skeletal object via a custom designed vertical and horizontal servo controlled motorized system. The custom motorized system sits under the Kinect sensor and is able to achieve 180 degrees in horizontal panning and 22.7 degrees in vertical tilting in line with tracking the movement of the active conference participant.

A parallel bloom filter string searching algorithm on a many-core processor
2013 IEEE Conference on Open Systems (ICOS), 2013
ABSTRACT This paper analyzes the underlying architecture of a serial Bloom filter string searchin... more ABSTRACT This paper analyzes the underlying architecture of a serial Bloom filter string searching algorithm to identify the performance impact of this algorithm for large datasets. Then, a parallel multi-core driven Bloom filter algorithm using software application threads is studied as benchmark. Experimental results suggest that for a set of 10 million strings, this algorithm exhibits speedups of up to 3.3× against a serial Bloom filter algorithm, when using an 8-logical processor multi-core architecture. To further improve the speedup, a many-core driven parallel Bloom filter algorithm is proposed using the Compute Unified Device Architecture (CUDA) parallel computing platform. The proposed algorithm segments the string list into blocks of words and threads in generating the bit table for the string searching process, which maximizes computational performance and sustains consistent string searching results. Experimental results suggest that the proposed algorithm extends the speedup to 5.5× against a serial Bloom filter algorithm, when using a 256-core CUDA graphics processing unit architecture.

Design and implementation of parallel video combiner architecture for multi-user video conferencing at ultra-high definition resolution
Multimedia Tools and Applications, 2014
ABSTRACT In software driven multi-user video conferencing systems, consumer progression towards u... more ABSTRACT In software driven multi-user video conferencing systems, consumer progression towards ultra-high definition (i.e., 8 k) resolution introduces new challenges to the video combination process in sustaining smooth combined video playback at high frame rates. As such, this paper analyzes the underlying architecture of a conventional video combiner to identify the performance impact of combined video frame rates at ultra-high definition resolutions. Then, two straightforward parallel video combination architectures using software application threads, namely PVC-1 and PVC-2, are studied as a benchmark. In PVC-1, the number of application threads tallies with the number of client videos to be combined, which improves playback performance at the expense of inconsistencies in inset client frame rates within a combined video. PVC-2 includes a synchronizer to address these inconsistencies, but exhibits marginal performance gains. To address the aforementioned problems, a balanced workload parallel video combiner architecture is proposed, namely PVC-3. In this architecture, a balanced workload management algorithm stitches client videos into an ultra-high definition combined frame, based on the number of available logical processors on a multi-core processor. This method improves frame rate performance and sustains consistent client frame rates within a combined video. Experimental results suggest that PVC-3 is superior over PVC-2 and achieves a frame rate performance gain of 27 % against a conventional video combiner for a combined video of 16 clients (each at a resolution of 720p) and zero standard deviation in combined frame rate consistency.

Journal of Network and Computer Applications, 2013
This paper proposes an endpoint video combiner architecture in a multipoint control unit (MCU) sy... more This paper proposes an endpoint video combiner architecture in a multipoint control unit (MCU) system for high definition multiparty video conferencing. The proposed architecture addresses the current reliability, computational and quality drawbacks of a conventional centralized based video combiner architecture. This is achieved by redesigning the MCU video to move away the video combiner from the bridge and into the client endpoints. Moreover, the proposed architecture represents a serverless system and is able to scale a large number of clients at high resolutions in a multipoint video conferencing session. In order to realize this design, this paper also proposes a custom robust sustainable session management protocol which allows a dynamic multi-port management between the MCU video and client endpoints. In addition, the proposed custom session management protocol includes recommendation for a session protection structure. Experimental results suggest that the proposed architecture exhibits significant computational frame rate performance gains of up to 762.95% in comparison with the conventional centralized video combiner architecture based on a series of four and eight high definition combined video assessments. Moreover, reliability analysis suggests that the proposed architecture is also able to consistently sustain a high frame rate performance within a long duration high definition multipoint video conferencing session.
Uploads
Papers by Vishnu Baskaran