The Speech + Mobility group uses speech technologies and portable devices to enhance human communication and make digitized audio more useful as a data type. Our focus is on developing novel applications, user interfaces, and services to exploit computer speech processing for interacting with and through computers far removed from keyboards and monitors.
Research Projects
4chan and /b/: Anonymity and Ephemerality
Michael S. Bernstein, Andrés Monroy-Hernández, Drew Harry, Paul André, Katrina Panovich and Greg VargasMany of our online interactions take place in community spaces. We keep track of friends and share pictures on Facebook, chatter with friends on Twitter, and participate in discussions on online forums. But how do the design choices we make impact the kinds of social spaces that develop? To better understand this relationship, we conducted a study of a discussion forum with a particularly unusual design: 4chan.org. Perhaps best known for its role in driving Internet culture and its involvement with the "Anonymous" group, we believe 4chan's design plays a large role in its success, despite its counter-intuitiveness. In our first paper exploring this area, we quantify 4chan's ephemerality (there are no archives; most posts are deleted in a matter of minutes) and anonymity (there are no traditional user accounts, and most posts are fully anonymous) and discuss how the community adapts to these unusual design strategies.
Back Talk
Chris Schmandt and Andrea ColacoThe living room is the heart of social and communal interactions in a home. Often present in this space is a screen: the television. When in use, this communal gathering space brings together people and their interests, and their varying needs for company, devices, and content. This project focuses on using personal devices such as mobile phones with the television; the phone serves as a controller and social interface by offering a channel to convey engagement, laughter, and viewer comments, and to create remote co-presence.
ByConsens.us
Chris Schmandt and Charlie DeTarBringing deliberative process and consensus decision making to the 21st century, asynchronous case! A practical, enhanced mailing list and CC bot that allows intelligent extraction of contextual information from a discussion to support decision making and deliberation.
Flickr This
Chris Schmandt and Dori LinInspired by the fact that people are communicating more and more through technology, Flickr This explores ways for people to have emotion-rich conversations through all kinds of media provided by people and technology—a way for technology to allow remote people to have conversations more like face-to-face experiences by grounding them in shared media. Flickr This lets viewable contents provide structure for a conversation; with a grounding on the viewable contents, conversation can move between synchronous and asynchronous, and evolve into a richer collaborative conversation/media.
frontdesk
Chris Schmandt and Andrea ColacoCalling a person versus calling a place has quite distinctive affordances. With the arrival of mobile phones, the concept of calling has moved from calling a place to calling a person. Frontdesk proposes a place-based communication tool that is accessed primarily through any mobile device and features voice calls and text chat. The application uses “place” loosely to define a physical space created by a group of people that have a shared context of that place. Examples of places could be different parts of a workspace in a physical building, such as the machine shop, café, or Speech + Mobility group area at the Media Lab. When a user calls any of these places, frontdesk routes their call to all people that are “checked-in” to the place.
Going My Way
Chris Schmandt and Jaewoo ChungWhen friends give directions, they often don't describe the whole route, but instead provide landmarks along the way which with they think we'll be familiar. Friends can assume we have certain knowledge because they know our likes and dislikes. Going My Way attempts to mimic a friend by learning about where you travel, identifying the areas that are close to the desired destination from your frequent path, and picking a set of landmarks to allow you to choose a familiar one. When you select one of the provided landmarks, Going My Way will provide directions from it to the destination.
Guiding Light
Chris Schmandt, Jaewoo Chung, Ig-Jae Kim and Kuang XuGuiding Light is a navigation-based application that provides directions by projecting them onto physical spaces both indoors and outdoors. It enables a user to get relevant spatial information by using a mini projector in a cell phone. The core metaphor involved in this design is that of a flashlight which reveals objects in and information about the space it illuminates. For indoor navigation, Guiding Light uses a combination of e-compass, accelerometer, proximity sensors, and tags to place information appropriately. In contrast to existing heads-up displays that push information into the user's field of view, Guiding Light works on a pull principle, relying entirely on users' requests and control of information.
Indoor Location Sensing Using Geo-Magnetism
Chris Schmandt, Jaewoo Chung and MatthewWe present an indoor positioning system that measures location using disturbances of the Earth's magnetic field by structural steel elements in a building. The presence of these large steel members warps the geomagnetic field such that lines of magnetic force are locally not parallel. We measure the divergence of the lines of the magnetic force field using e-compass parts with slight physical offsets; these measurements are used to create local position signatures for later comparison with values in the same sensors at a location to be measured. We demonstrate accuracy within one meter 88% of the time in experiments in two buildings and across multiple floors within the buildings.
Life as a Puzzle
Chris Schmandt, Sinchan Banerjee and Drew Erikson HarryHow can one understand and visualize the lifestyle of a person on the other side of the world? Life as a Puzzle attempts to tackle this question by designing a picture puzzle that users collaboratively solve with pictures from their lifestyles.
LocoRadio
Chris Schmandt and Wu-Hsi LiLocoRadio is a mobile, augmented-reality, audio browsing system that immerses you within a soundscape as you move. To enhance the browsing experience in high-density spatialized audio environments, we introduce a UI feature, "auditory spatial scaling," which enables users to continuously adjust the spatial density of perceived sounds. The audio will come from a custom, geo-tagged audio database. The current demo uses iconic music to represent restaurants. As users move in the city, they encounter a series of music and the perception enhances their awareness of the numbers, styles, and locations of nearby restaurants.
Merry Miser
Chris Schmandt and Charlie DeTarMerry Miser is a mobile application which persuades people to spend less money, and think more about their spending. By combining users' real financial transaction information, their location, and personal assessments of spending, the application presents deeply personalized and compelling interventions at the time and place when they are near an opportunity to shop. The interventions help to reinforce choices that are in the users' better long-term self interest, against short-term impulses.
Musicpainter
Chris Schmandt, Barry Vercoe and Wu-Hsi LiMusicpainter is a networked, graphical composing environment that encourages sharing and collaboration within the composing process. It provides a social environment where users can gather and learn from each other. The approach is based on sharing and managing music creation in small and large scales. At the small scale, users are encouraged to begin composing by conceiving small musical ideas, such as melodic or rhythmic fragments, all of which are collected and made available to all users as a shared composing resource. The collection provides a dynamic source of composing material that is inspiring and reusable. At the large scale, users can access full compositions that are shared as open projects. Users can listen to and change any piece. The system generates an attribution list on the edited piece, allowing users to trace how it evolves in the environment.
OnTheRun
Matthew Joseph DonahoeOnTheRun is a location-based exercise game designed for the iPhone. The player assumes the role of a fugitive trying to gather clues to clear his name. The game is played outdoors while running, and the game creates missions that are tailored to the player's neighborhood and running ability. The game is primarily an audio experience, and gameplay involves following turn-by-turn directions, outrunning virtual enemies, and reaching destinations.
Radio-ish Media Player
Chris Schmandt, Barry Vercoe and Wu-Hsi LiHow many decisions does it take before you hear a desired piece of music on your iPod? First, you are asked to pick a genre, then an artist, then an album, and finally a song. The more songs you own, the tougher the choices are. To resolve the issues, we turn the modern music player into an old analog radio tuner, the Radio-ish Media Player. No LCDs, no favorite channels, all you have is a knob that will help you surf through channel after channel accompanied by synthesized noise. Radio-ish is our attempt to revive the lost art of channel surfing in the old analog radio tuner. Let music find you: your ears will tell you if the music is right. This project is not only a retrospective design, but also our reflection on lost simplicity in the process of digitalization. A mobile phone version is also available for demo.
ROAR
Chris Schmandt and Drew HarryThe experience of being in a crowd is visceral. We feel a sense of connection and belonging through shared experiences like watching a sporting event, speech, or performance. In online environments, though, we are often part of a crowd without feeling it. ROAR is designed to allow very large groups of distributed spectators have meaningful conversations with strangers or friends while creating a sense of presence of thousands of other spectators. ROAR is also interested in creating opportunities for collective action among spectators and providing flexible ways to share content among very large groups. These systems combine to let you feel the roar of the crowd even if you're alone in your bedroom.
SeeIt-ShareIt
Chris Schmandt, Andrea ColacoNow that mobile phones are starting to have 3D display and capture capabilities, there are opportunities to enable new applications that enhance person-person communication or person-object interaction. This project explores one such application: acquiring 3D models of objects using cell phones with stereo cameras. Such models could serve as shared objects that ground communication in virtual environments and mirrored worlds or in mobile augmented reality applications.
TagSpot
Chris Schmandt and Misha SraWe are what we eat, but we are also what we do and where we go. With this app, when you visit a spot you can tag it as healthy, sporty, artsy, or fun and the spot tags you back with the the highest rated tag for that spot. Your personal tags evolve as the spots you visit evolve. Don't like that a spot tagged you +1 Healthy? Explore newer spots to get different tags! Follow tag trails or create new ones, find new places, and learn about yourself and your city!
tailor
Chris Schmandt, Drew Harry, and Mark FayngershIn online environments, we often rely on some combination of pictures and pseudonyms to show others what kind of person we are. In more anonymous contexts, we tend to rely on non-representational images while on services like Facebook, identifiable pictures are the norm. In this project, we seek to turn the iconic profile picture into something more active and dynamic. With tailor, you can easily apply dynamic graphical effects and data-driven information to an existing profile picture on Twitter. Tailor will automatically update your profile picture based on these overlays. These overlays might include your current location, number of followers, or visualizations of your favorite team's recent record. Tailor will also support scenarios like automatically updating your picture when you upload pictures to Flickr, visually aging pictures the longer you have them, morphing between different pictures, or programmatically generating interesting abstract profile pictures.
Tin Can
Chris Schmandt, Matthew Donahoe and Drew HarryDistributed meetings present a set of interesting challenges to staying engaged and involved. Because one person speaks at a time, it is easy (particularly for remote participants) to disengage from the meeting undetected. However, non-speaking roles in a meeting can be just as important as speaking ones, and if we could give non-speaking participants ways to participate, we could help support better-run meetings of all kinds. Tin Can collects background tasks like taking notes, managing the agenda, sharing relevant content, and tracking to-dos in a distributed interface that uses meeting participants' phones and laptops as input devices, and represents current meeting activities on an iPad in the center of the table in each meeting location. By publicly representing these background processes, we provide meeting attendees with new ways to participate and be recognized for their non-verbal participation.
Tin Can Classroom
Chris Schmandt, Drew Harry and Eric Gordon (Emerson College)Classroom discussions may not seem like an environment that needs a new kind of supporting technology. But we've found that augmenting classroom discussions with an iPad-based environment to help promote discussion, keep track of current and future discussion topics, and create a shared record of class keeps students engaged and involved with discussion topics, and helps restart the discussion when conversation lags. Contrary to what you might expect, having another discussion venue doesn't seem to add to student distraction; rather it tends to focus distracted students on this backchannel discussion. For the instructor, our system offers powerful insights into the engagement and interests of students who tend to speak less in class, which in turn can empower less-active students to contribute in a venue in which they feel more comfortable.