3rd ACM International Conference on Educational Technology, Language and Technical Communication, 2021
When seated users of multimodal augmented reality (AR) systems attempt to navigate unfamiliar env... more When seated users of multimodal augmented reality (AR) systems attempt to navigate unfamiliar environments, they can become disoriented during their initial travel through a remote environment that is displayed for them via that AR display technology. Even when the multimodal displays provide mutually coherent visual, auditory, and vestibular cues to the movement of seated users through a remote environment (such as a maze), those users may make errors in judging their own orientation and position relative to their starting point, and also may have difficulty determining what moves to make in order to return themselves to their starting point. In a number of investigations using multimodal AR systems featuring realtime servocontrolled movement of seated users, the relative contribution of spatial auditory display technology was examined across a variety of spatial navigation scenarios. The results of those investigations have implications for the effective use of the auditory component of a multimodal AR system in applications supporting spatial navigation through a physical environment.
Using motion capture for real-time augmented reality scenes
Proceedings of the 13th International Conference on Humans and Computers, Dec 8, 2010
Because computer performance is continually upgrading, we can display complicated 3D models, and ... more Because computer performance is continually upgrading, we can display complicated 3D models, and it is possible to easily add information to a computer such as a mobile telephone. Core technologies of augmented reality are position recognition, image recognition, and orientation recognition. In the past, high performance machines were needed to process these. Using modern hardware, we can process such data using mobile phone as well as notebook. Recently, Sekai Camera became famous as augmented reality ...
Proc. HC2006: Ninth Int. Conf. on Human and Computer, 2006
This article describes the force display and dual steering in the Networked Driving Simulator whi... more This article describes the force display and dual steering in the Networked Driving Simulator which can share a virtual space using the CVE (Collaborative Virtual Environments), our group ware. The Networked Driving Simulator has two types of force display, the Sc haire and Force Feed-back Wheel (ffbw). The Schaire (Share-Chair), a rotary motion platform being developed by our Spatial Media Group, is rotated with a servo- motor, the rotation angle controlled via internet through the cve. The ffbw is displays force when a vehicle in the Networked Driving Simulator collides. The technique of feeding back the effect em- ploys programs using C++ and DirectInput, and escapes the execution file, called ForceManager, from the Networked Driving Simulator which is implemented with Java3D. The pattern of effect is changed by the arguments of ForceManager. Moreover, our Networked Driving Simulator can select driving vehicle, such as a normal car or long- style fire truck which has seats for drivers in front and in rear (tiller), as its front and rear wheels are independent. The dual steering is usefull to turn narrow corners rapidly and smoothly in case of emergencies.
Proceedings of 1993 2nd IEEE International Workshop on Robot and Human Communication
Abstract Augmented audio reality consists of hybrid presentations in which computer-generated sou... more Abstract Augmented audio reality consists of hybrid presentations in which computer-generated sounds are overlayed on top of more directly acquired audio signals. We are exploring the alignability of binaural signals with artificially spatialized sources, synthesized by convolving monaural signals with left/right pairs of directional transfer functions. We use MAW (multidimensional audio windows), a NeXT-based system, as a binaural directional mixing console. Since the rearrangement of a dynamic map is used to dynamically select ...
Integration of laptop sudden motion sensor as accelerometric control for virtual environments
Proceedings of The 7th ACM SIGGRAPH International Conference on Virtual-Reality Continuum and Its Applications in Industry, 2008
Abstract We have developed a gestural controller for a multimodal client suite using a sudden mot... more Abstract We have developed a gestural controller for a multimodal client suite using a sudden motion sensor (SMS) deployed with many modern laptop computers. Interpreted commands inferred from the SMS accelerometer can be used to adjust position---orientation and location---of egocentric perspectives and exocentric avatars to control panoramic browsing and spatialized sound, adjusting the lateralization, directionalization, and spatialization of musical and audio channels.
The Fifth International Conference on Computer and Information Technology (CIT'05), 2005
This article describes research on a mobile interface that allows remote control of electric appl... more This article describes research on a mobile interface that allows remote control of electric appliances and avatars rendered on a workstation or personal computer to be synchronized with a mobile phone's ringtone. The interfaceimplemented with DoJa, one of the profiles of J2ME (Java 2, Micro Edition)-controls electric appliances via a "LANtap, " an internet-connected power supply, and controls avatars by changing position parameters and attributes on a mobile phone. In a mobile phone network, delay of transmission frequently occurs because of the interference of radio waves. To synchronize appliances and avatars with a ringtone, our interface sends parameters anticipating the network delay. When a short delay occurs, the interface adjusts the time lag between the avatar and the ringtone automatically; when a long delay occurs, it can be adjusted by pushing the mobile phone's keys. We used a 2.5G (2.5th generation) mobile phone for this research. The interface was able to synchronize networked avatars with a slow tempo ringtone for a short time, but the connection speed was not fast enough for quick real-time synchronization. As a proofof-concept, we deployed such phase-locked musical animation roomware with 3D avatars in a virtual space enhanced by entertainment effects displayed in a real space enhanced by some similar effects.
Multipresence allows each human user, in, for example, a distributed groupware system like the in... more Multipresence allows each human user, in, for example, a distributed groupware system like the internet, to have source and sink presence in multiple spaces at once, by designating an arbitrary number of representatives as "self." Operations which filter information streams by blocking sources or sinks are useful for distributing and focusing attention and presence. Such "narrowcasting" operations can be modeled as an extended "media mixer": source operations apply to first-order mixing; sink operations apply to second-order mixing. The value of such an multipresent interface, articulated with narrowcasting operations, can be characterized as exponential in the number of users, the power set of the number of coteries. Audio windows are, in analogy to graphical windowing systems, an idiom for managing multiple duplex streams of information, as in a teleconference, chatspace, or virtual concert. We are developing audio windowing interfaces to groupware and VR-style applications, deployed on both workstations and mobile devices, and present a multipresenceenabled audio windows media mixer.
Rhythm of music animating virtual environment models
Proceedings of the 2012 Joint International Conference on Human-Centered Computer Environments - HCCE '12, 2012
Abstract Alice is an innovative 3d programming environment that makes it easy to create an animat... more Abstract Alice is an innovative 3d programming environment that makes it easy to create an animation. Many virtual environment (ve) models are available in the Alice 3d environment. We created ve scenes using the Alice 3d ide (integrated development environment). We deploy a beat detector to detect the rhythm of a song, based on pd (Pure Data, a free dataflow programming environment similar to max/msp). It can extract the beat of a song in realtime. Using our Alice--Cve (Collaborative Virtual Environment) Bridge and Pd--Cve ...
Presence: Teleoperators and Virtual Environments, 2000
Non-immersive perspectives in virtual environments enable flexible paradigms of perception, espec... more Non-immersive perspectives in virtual environments enable flexible paradigms of perception, especially in the context of frames of reference for conferencing and musical audition. Traditional mixing idioms for enabling and disabling various audio sources employ mute and solo functions, that, along with cue, selectively disable or focus on respective channels. Exocentric interfaces which explicitly model not only sources but also sinks, motivate the generalization of mute and solo (or cue) to exclude and include, manifested for sinks as deafen and attend (confide and harken). Such functions, which narrow stimuli by explicitly blocking out and/or concentrating on selected entities, can be applied not only to other users' sinks for privacy, but also to one's own sinks for selective attendance or presence. Multiple sinks are useful in groupware, where a common environment implies social inhibitions to rearranging shared sources like musical voices or conferees, as well as indivi...
The Journal of the Acoustical Society of America, 1999
Shared virtual environments, especially those supporting spatial sound, require generalized contr... more Shared virtual environments, especially those supporting spatial sound, require generalized control of user‐dependent media streams. Nonimmersive perspectives in virtual environments enable fluid paradigms of perception, especially in the context of frames‐of‐reference for conferencing and musical audition. Traditional mixing idioms for enabling and disabling various sources employ mute and solo functions, which, along with cue, selectively disable or focus on respective channels. Exocentric interfaces which explicitly model not ...
International Journal of Man-Machine Studies, 1991
Abstract This paper proposes and organization of presentation and control that implements a flexi... more Abstract This paper proposes and organization of presentation and control that implements a flexible audio management system we call “audio windows”. The result is a new user interface integrating and enhanced spatial sound presentation system, an audio emphasis system, and a gestural input recognition system. We have implemented these ideas in a modest prototype, also described, designed as an audio server appropriate for a teleconferencing system. Our system combines a gestural front end (currently based on a ...
Diffusion curves are a new kind of primitive in vector graphics, capable of representing smooth c... more Diffusion curves are a new kind of primitive in vector graphics, capable of representing smooth color transitions among boundaries. Their rendering requires solving Poisson's equation; much previous research relied on traditional solvers, which commonly require GPU acceleration to achieve real-time rasterization. This obviously restricts deployment on the Internet—for example, as rich Internet applications, in which various computing environments are involved. Diffusion effects are similar to locally defined interpolation with a particular orientation and magnitude. Inspired by that observation, a mesh-based framework combined with mean value coordinates (MVC) interpolants efficiently renders diffusion curve images on a CPU. This method employs a visibility algorithm to efficiently find and sort neighboring curve nodes for each vertex. It then assigns the vertex colors according to MVC interpolation with the neighboring curve nodes. Experiments produced rendering results comparable to traditional solvers, but this method is computationally more efficient and runs much faster on a CPU.
Alternative non-immersive perspectives enable new paradigms of perception, especially in the cont... more Alternative non-immersive perspectives enable new paradigms of perception, especially in the context of frames-of-reference for musical audition and groupware. MAW (acronymic for multidimensional audio windows) is an application for manipulating sound sources and sinks in virtual rooms, featuring an exocentric graphical interface driving an egocentric audio backend. Listening to sound presented in such a spatial fashion is as different from conventional stereo mixes as sculpture is from painting.
ICAT 18th International Conference on Artificial Reality and Telexistence, 2008
We present a gestural interpretation interface that optically senses hand positions and motions. ... more We present a gestural interpretation interface that optically senses hand positions and motions. These postures and gestures are recognized and processed, closing the interaction loop by controlling computer media—such as Apple TV or Google Earth—as well as physical devices such as domestic appliances—including adjusting curtains, changing television channel and volume, turning lights and fans on and off, and piloting a cleaning robot. Keywords: ambient information systems, calm technology, computer vision, ...
A Case Study of VoIP Performance Across Different Networks
Proc. icece: 3rd Int. Conf. on Electrical & Computer Engineering (Dhaka), Dec 1, 2004
SIP, the Session Initiation Protocol, is a signaling protocol for internet conferencing, telephon... more SIP, the Session Initiation Protocol, is a signaling protocol for internet conferencing, telephony, presence applications, event notification, and instant messaging. This article reports observed VoIP traffic when clients are in a LAN, a WAN, and in a hybrid network with an ADSL link. The deployed architecture consists of a SIP server (with registrar, proxy, and presence functions) and two SIP clients. Using a traffic analyzer, we monitored the SIP messages as well as the real-time quality, measured in the form of delay, packet ...
Uploads
Papers by Michael Cohen