Exploring augmented live video streams for remote participation

Wittkämper, Michael; Lindt, Irma; Broll, Wolfgang; Ohlenburg, Jan; Herling, Jan; Ghellal, Sabiha

doi:10.1145/1240866.1240915

Outline

Title

Abstract

Introduction

Design Considerations

Exploring augmented live video streams for remote participation

Irma Lindt

2007, CHI '07 extended abstracts on Human factors in computing systems - CHI '07

https://doi.org/10.1145/1240866.1240915

visibility

…

description

6 pages

link

1 file

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

Augmented video streams display information within the context of the physical environment. In contrast to Augmented Reality, they do not require special equipment, they can support many users and are location-independent. In this paper we are exploring the potentials of augmented video streams for remote participation. We present our design considerations for remote participation user interfaces, briefly describe their development and explain the design of three different application scenarios: watching a pervasive game, observing the quality of a production process and exploring interactive science exhibits. The paper also discusses how to develop high quality augmented video streams along with which information and control options are required in order to obtain a viable remote participation interface.

Related papers

Video-Augmented Environments

Quentin Stafford-Fraser

1996

In the future, the computer will be thought of more as an assistant than as a tool, and users will increasingly expect machines to make decisions on their behalf. As with a human assistant, a machine’s ability to make informed choices will often depend on the extent of its knowledge of activities in the world around it. Equipping personal computers with a large number of sensors for monitoring their environment is, however, expensive and inconvenient, and a preferable solution would involve a small number of input devices with a broad scope of application. Video cameras are ideally suited to many real- world monitoring applications for this reason. In addition, recent reductions in the manufacturing costs of simple cameras will soon make their widespread deployment in the home and office economically viable. The use of video as an input device also allows the creation of new types of user-interface, more suitable in some circumstances than those afforded by the conventional keyboard and mouse. This thesis examines some examples of these ‘Video-Augmented Environments’ and related work, and then describes two applications in detail. The first, a ‘software cameraman’, uses the analysis of one video stream to control the display of another. The second, ‘BrightBoard’, allows a user to control a computer by making marks on a conventional whiteboard, thus ‘augmenting’ the board with many of the facilities common to electronic documents, including the ability to fax, save, print and email the image of the board. The techniques which were found to be useful in the construction of these applications are common to many systems which monitor real-world video, and so they were combined in a toolkit called ‘Vicar’. This provides an architecture for ‘video plumbing’, which allows standard video- processing components to be connected together under the control of a scripting language. It is a single application which can be programmed to create a variety of simple Video-Augmented Environments, such as those described above, without the need for any recompilation, and so should simplify the construction of such applications in the future. Finally, opportunities for further exploration on this theme are discussed.

downloadDownload free PDF View PDFchevron_right

A Customisable Real-time Video and Audio Streaming Approach to Creating an Immersive Collaborative Distant Learning Environment.

Alfonso Torrejon, Victor Callaghan

This is a paper accompanying (and describing) a demonstration. The intention of this demonstration is to present a novel immersive telepresence system that enables remote students to participate in seminars and lectures using online streaming video and audio connections. In this system, a virtualized video view is created using a 360° panoramic video projected onto a 180° curved projected screen (immersive shell). This recreates a more natural human-like perception of real environments and thereby stimulating the learning process;. 3D audio is also collected and reproduced at the remote location adding to the realism. To accomplish this we use a 360° mirror situated in the classroom which we use with a camera to transmit a panoramic image to the remote users where they reconstruct the original image from spherical to Cartesian. To process the audio we use a small array of microphones at the classroom end. In addition, we provide various tools to allow the participants to control their position within the virtualized views, thereby creating an innovative technology and user experience. We will be demonstrating this system at the conference.

downloadDownload free PDF View PDFchevron_right

t-Room: Remote collaboration apparatus enhancing spatio-temporal experiences

Naomi Yamashita

In this paper, we describe the overall design of the remote collaboration apparatus t-Room and present three applications: playback of a recorded scene using a hand controller, an elevator effect at scene change, and remote golf lessons. These applications are realized by the high controllability and flexibility of the t-Room system, and they can provide the user with a novel type of spatiotemporal experience.

downloadDownload free PDF View PDFchevron_right

On developing tangible interfaces for video streaming control: a real case study

Marco Roccetti

2008

This paper presents a novel tangible user interface for the dynamic control and manipulation of multiple video streams. Streams can be activated, triggered and managed based on the physical use of markers attached to cubes on a table top, whose movement is recognized by a camera placed on top of the working area. Markers can be associated to a single video stream, as well as a group of streams. Videostreams can be played out, paused, stopped, put in foreground, maximized or minimized, thanks to the use of our tangible interface; audio volumes of different streams can be controlled. Needless to say, streams can be distributed at different servers, or dynamically generated at distributed nodes in real-time. We claim that such an interface results as quite intuitive and easy-to-use, and can be employed in several videostreaming-based application domains, such as tele-conferencing, video-on-demand, video mixing. Results coming from an experimental assessment confirm the viability of our approach.

downloadDownload free PDF View PDFchevron_right

Multi-user interaction on media facades through live video on mobile devices

Sebastian Boring

Proceedings of the …, 2011

The increasing number of media facades in urban spaces offers great potential for new forms of interaction -especially for collaborative multi-user scenarios. In this paper, we present a way to directly interact with them through live video on mobile devices. We extend the Touch Projector interface to accommodate multiple users by showing individual content on the mobile display that would otherwise clutter the facade's canvas or distract other users. To demonstrate our concept, we built two collaborative multi-user applications: (1) painting on the facade and (2) solving a 15-puzzle. We gathered informal feedback during the ARS Electronica Festival in Linz, Austria and found that our interaction technique is (1) considered easy-to-learn, but (2) may leave users unaware of the actions of others.

downloadDownload free PDF View PDFchevron_right

From remote media immersion to Distributed Immersive Performance

Chris Kyriakakis

2003

We present the architecture, technology and experimental applications of a real-time, multi-site, interactive and collaborative environment called Distributed Immersive Performance (DIP). The objective of DIP is to develop the technology for live, interactive musical performances in which the participants -subsets of musicians, the conductor and the audience -are in different physical locations and are interconnected by very high fidelity multichannel audio and video links. DIP is a specific realization of broader immersive technology -the creation of the complete aural and visual ambience that places a person or a group of people in a virtual space where they can experience events occurring at a remote site or communicate naturally regardless of their location. The DIP experimental system has interaction sites and servers in different locations on the USC campus and at several partners, including the New World Symphony of Miami Beach, FL. The sites have different types of equipment to test the effects of video and audio fidelity on the ease of use and functionality for different applications. Many sites have high-definition (HD) video or digital video (DV) quality images projected onto wide screen wall displays completely integrated with an immersive audio reproduction system for a seamless, fully three-dimensional aural environment with the correct spatial sound localization for participants. The system is capable of storage and playback of the many streams of synchronized audio and video data (immersidata), and utilizes novel protocols for the low-latency, seamless, synchronized realtime delivery of immersidata over local area networks and widearea networks such as Internet2. We discuss several recent interactive experiments using the system and many technical challenges common to the DIP scenario and a broader range of applications.

downloadDownload free PDF View PDFchevron_right

Shared interactive video for teleconferencing

Chunyuan Liao

Proceedings of the eleventh ACM international conference on Multimedia - MULTIMEDIA '03, 2003

We present a system that allows remote and local participants to control devices in a meeting environment using mouse or pen based gestures "through" video windows. Unlike state-of-theart device control interfaces that require interaction with text commands, buttons, or other artificial symbols, our approach allows users to interact with devices through live video of the environment. This naturally extends our video supported pan/tilt/zoom (PTZ) camera control system, by allowing gestures in video windows to control not only PTZ cameras, but also other devices visible in video images. For example, an authorized meeting participant can show a presentation on a screen by dragging the file on a personal laptop and dropping it on the video image of the presentation screen. This paper presents the system architecture, implementation tradeoffs, and various meeting control scenarios.

downloadDownload free PDF View PDFchevron_right

Multi-user Augmented Reality Application for Video Communication in Virtual Space

Mansi Sharma

Abstract Presented at ELFI 2019, European Light Field Imaging Workshop , Borovets, Bulgaria, 2019

Communication is the most useful tool to impart knowledge, understand ideas, clarify thoughts and expressions, organize plan and manage every single day-to-day activity. Although there are different modes of communication, physical barrier always affects the clarity of the message due to the absence of body language and facial expressions. These barriers are overcome by video calling, which is technically the most advance mode of communication at present. The proposed work concentrates around the concept of video calling in a more natural and seamless way using Augmented Reality (AR). AR can be helpful in giving the users an experience of physical presence in each other’s environment. Our work provides an entirely new platform for video calling, wherein the users can enjoy the privilege of their own virtual space to interact with the individual’s environment. Moreover, there is no limitation of sharing the same screen space. Any number of participants can be accommodated over a single conference without having to compromise the screen size.

downloadDownload free PDF View PDFchevron_right

A projected augmented reality system for remote collaboration

Mark Billinghurst

2013 IEEE International Symposium on Mixed and Augmented Reality (ISMAR), 2013

This paper describes an AR system for remote collaboration using a captured 3D model of the local user's scene. In the system a remote user can manipulate the scene independently of the view of the local user and add AR annotations that appear projected into the real world. Results from a pilot study and the design of a further full study are presented.

downloadDownload free PDF View PDFchevron_right

TeleFest: Augmented Virtual Teleportation for Live Concerts

Holly Downer, Nadia Pantidi

ACM International Conference on Interactive Media Experiences

Figure 1: TeleFest, our system for livestreaming events using tailored 360° content edited in real time by a producer. (Left): A real live performance streamed live to almost 2,000 people using TeleFest. Three 360° cameras were placed among the stage and crowd, and were livestreamed to YouTube with augmented 3D virtual content to enhance the remote viewing experience. (Right): The resulting livestream.

downloadDownload free PDF View PDFchevron_right

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

References (3)

Azuma, R.T. A Survey of Augmented Reality. In Presence: Teleoperators and Virtual Environments 6, 4 (1997), 355-385.
Ohlenburg, J., Herbst, I., Lindt, I., Fröhlich, T.; Broll, W. The Morgan Framework: enabling dynamic multi-user AR and VR projects. In Proc. VRST 2004, ACM Press (2004), 166-169.
Perkins, C. RTP Audio and Video for the Internet. Sams, 2002.

J. Naisbitt, Roy Campbel

2005

Ubiquitous computing environments and the plethora of mobile devices that populate them have led to a global trend for remote collaboration and interaction. This trend demands seamless interfaces for remote interaction with these ubiquitous environments and their inhabitants. Existing applications for remote interactions are limited and typically require the user to mentally translate virtual avatars, tags, or names into real objects. Location information from accurate location sensing technologies, such as Ubisense, and other location sensing devices, facilitate enhanced interactions. Fusing the location information with live video feeds from multiple standard pan-tilt cameras, we provide a seamless, easily reconfigurable, real-time interface for remote interactions within distributed ubiquitous environments. Users interact with objects directly through the video feed itself. In effect, our framework dynamically discovers all resources and programmable objects in the vicinity and allows remote users to interact with the environment and all of its resources and objects as if they were physically present in the environment. Figure 3. CameraControllerApp screenshot. Note, the user could interact with the light, displays, speakers, or the person in the figure.

downloadDownload free PDF View PDFchevron_right

Augmented reality for immersive remote collaboration

Kar-han Tan

2011

HP Laboratories HPL-2010-201 augmented reality, remote collaboration, computer vision, natural interaction, immersive experiences Video conferencing systems are designed to deliver a collaboration experience that is as close as possible to actually meeting in person. Current systems, however, do a poor job of integrating video streams presenting the users with shared collaboration content. Real and virtual content are unnaturally separated, leading to problems with nonverbal communication and the overall conference experience. Methods of interacting with shared content are typically limited to pointing with a mouse, which is not a natural component of face-to-face human conversation. This paper presents a natural and intuitive method for sharing digital content within a meeting using augmented reality and computer vision. Real and virtual content is seamlessly integrated into the collaboration space. We develop new vision based methods for interacting with inserted digital content including target finding and gesture based control. These improvements let us deliver an immersive collaboration experience using natural gesture and object based interaction.

downloadDownload free PDF View PDFchevron_right

Augmented Video Viewing: Transforming Video Consumption into an Active Experience

Maarten Wijnants

Traditional video productions fail to cater to the interactivity standards that the current generation of digitally native customers have become accustomed to. This paper therefore advertises the "activation" of the video consumption process. In particular, it proposes to enhance HTML5 video playback with interactive features in order to transform video viewing into a dynamic pastime. The objective is to enable the authoring of more captivating and rewarding video experiences for end-users. The proposed paradigm extends video consumption, much like Augmented Reality (AR) does with the physical world. Given this conceptual analogy, we have adopted the term Augmented Video Viewing (AVV) to denominate our approach. The current AVV implementation embraces two independent yet cooperative video augmentation concepts: enriching video playback with interactive overlay elements, and erecting real-time interaction bridges between video and digital games. A total of four AVV test cases are presented to provide a hint of how our vision can be realized and of the attainable creative results.

downloadDownload free PDF View PDFchevron_right

Remote collaboration using augmented reality videoconferencing

Dieter Schmalstieg

2004

This paper describes an Augmented Reality Videoconferencing System, which is a novel remote collaboration tool combining a desktop-based AR system and a videoconference module. The novelty of our system is the combination of these tools with AR applications superimposed on live video background displaying the conference parties' real environment, merging the advantages of the natural face-to-face communication of videoconferencing and AR's interaction capabilities with distributed virtual objects using tangible physical artifacts. The simplicity of the system makes it affordable for everyday use. We explain our system design based on concurrent video streaming, optical tracking and 3D application sharing, and provide experimental proof that it yields superior quality compared to pure video streaming with successive optical tracking from the compressed streams. We demonstrate the system's collaborative features with a volume rendering application that allows users to display and examine volumetric data simultaneously and to highlight or explore slices of the volume by manipulating an optical marker as a cutting plane interaction device.

downloadDownload free PDF View PDFchevron_right

User interface for video observation over the internet

Franc Solina

Journal of Network and Computer Applications, 1998

This paper presents the design and application of a system for live video transmission and remote camera control over the World Wide Web. Extensive testing of the Internet Video Server (IVS) prompted us to improve its user interface. The GlobalView extension of IVS was developed which enables the generation of panoramic images of the environment and a more intuitive control of the camera. The live video frame is superimposed on a 360°static panoramic picture. By interactively moving a rectangular frame in the panoramic picture, the user locally selects the new direction of the camera. Once the view is selected the users prompts the selection and the command is issued over the Internet to the remotely-controlled camera. The static panoramic image is constantly updated in areas where new live video information gets available. Two methods are described for static panoramic image generation: one uses geometric transformation and the other is the brute-force scanning approach. We discuss how visual summaries of activities on an observed location can be generated and custom queries made with a similar intuitive user interface.

downloadDownload free PDF View PDFchevron_right

Enhanced Videogame Livestreaming by Reconstructing an Interactive 3D Game View for Spectators

Jeremy Hartmann

CHI Conference on Human Factors in Computing Systems

Figure 1: Enhanced videogame livestreaming examples for the game Titanfall 2 on a desktop environment: (a) a single RGB frame captured from the game depicts the default screen space view; (b) the 3D projected geometry creates a volumetric space, setting the spectator inside the titan with enhanced camera interactions; (c) the 3D projected geometry of the game is composited with a low-fidelity environment, creating a world space view that is decoupled from the streamer.

downloadDownload free PDF View PDFchevron_right

Wedge Video: Supporting Remote Participants in a Mixed-Mode Videoconference Meeting

Mark Apperley

Interacting with Computers

As global COVID-19 pandemic response has moved from full lockdowns and partial lockdowns in most parts of the world to a post-COVID era, an interesting new phenomenon that has emerged is the increased prevalence of hybrid meetings with a mixture of online and in-person attendees. The opportunity for remote participants to observe the responses and interactions of others in the meeting is generally accepted as being limited. An experimental prototype system, called Wedge Video, has been constructed as an attempt to improve the experience of remote participants in hybrid in-person/remote meetings. Wedge Video uses standard screen and camera equipment with existing video conferencing software (Zoom). An evaluation of the prototype system was conducted based on three simple games that each required players to interact rapidly and with some use of body language or gaze direction. Encouraging results led to the examination of the geometry of screen and camera placement in detail. A system...

downloadDownload free PDF View PDFchevron_right

How to Design Virtual Video Production for Augmented Student Presentations

Jonas Collin

European Conference on e-Learning

e-Learning environments have been developed and used by teachers and learners for decades. However, it is well known that sending, recording or meeting online can have a lack of presence and immersion. Furthermore, the configuration of a studio environment typically depends on physical props and technologies, which can be time consuming and hard to use for teaching purposes where each session may need a different configuration. Virtual Video Production (VVP) is a relatively new technology that builds on advances in extended reality (XR), supported by game engines and computer-controlled camera equipment. Camera data (pan, tilt, zoom, position) can be sent to a virtual camera in the game engine. The scene can be rendered via a green screen or with large LED displays. This provides an immersive presence with virtual 3D objects positioned in the room. Light settings can be mixed into the scene with remote control of LED lights to be in sync with virtual lights. Thus, VVP opens many opp...

downloadDownload free PDF View PDFchevron_right

Active feedback for enhancing the construction of panoramic live mobile video streams

Motaz El-Saban, Ayman Kaheel

2011

Constructing a panoramic video out of multiple incoming live mobile video streams is a challenging problem. This problem involves multiple users live streaming the same scene from different angles, using their mobile phones, with the objective of constructing a panoramic video of the scene. The main challenge in this problem is the lack of coordination between the streaming users, resulting in too much, too little, or no overlap between incoming streams. To add to the challenge, the streaming users are generally free to move, which means that the amounts of overlap between the different streams are dynamically changing. In this paper, we propose a method for automatically coordinating between the streaming users, such that the quality of the resulting panoramic video is enhanced. The method works by analyzing the incoming video streams, and automatically providing active feedback to the streaming users. We investigate different methods for generating the active feedback and presenting it to the streaming users resulting in an improved panoramic video output compared to the case where no feedback is utilized.

downloadDownload free PDF View PDFchevron_right

Exploring augmented live video streams for remote participation

Sign up for access to the world's latest research

Abstract

Related papers

References (3)

Related papers

Related topics