Please participate in our Dialogue Challenge wiki
Let's Go Lab: Evaluation of Spoken Dialog Techniques with Real Users

Let's Go Lab is a research project on speech interfaces and dialog systems conducted by researchers from the Language Technologies Institute at Carnegie Mellon University's School of Computer Science. Funding for Let's Go Lab comes from the U.S. National Science Foundation.

For additional information contact Maxine Eskenazi (max(at)cs(dot)cmu(dot)edu) or Alan W Black.


People | Publications | Databases | Media Coverage | Acknowledgments

System statistics (December 1st, 2008 -January 23, 2009 )

Total Number of calls from March 2005 to March 2009:70,731
Total Number of calls in the period:2,072
Average number of user turns per dialogue:13.04
Number of complete dialogues (with database access):63%

* A significant amount of calls contain no speech directed to the system (e.g. the user hangs up before saying anything). The number of calls with 4 user turns or more is a simple estimate of the number of valid dialogues.

Project overview

Although many recent systems have achieved effective spoken interaction with a computer, they tend to target the "average" portions of the population, those whose speech and hearing fall within the norm. Systems capable of finding and presenting useful information for the average user have been developed, but these systems cannot be used by everyone. People who are, for some reason, considered to be outliers in the general population cannot yet access the information these systems provide. This project is working on providing a system that can be used by everyone, including the extremes found in the general public.

The Let's Go! project is working in the domain of bus information, providing information such as schedules and route information for the Port Authority of Allegheny County buses in and around Pittsburgh. You can interact with a version of this system right now by calling 412-268-3526 (requires some knowledge of Pittsburgh's transit system).

This system is connected to the Port Authority's main phone line during non-business hours (M-F 7pm-6am, Weekends/Holidays 6pm-8am), providing schedule information over the phone to Allegheny County bus riders when they would otherwise be unable to get it.

An example interaction with this system is also available.

People

Publications

  • Raux, A., Eskenazi, M. (2008) Optimizing Endpointing Thresholds using Dialogue Features in a Spoken Dialogue System, , SIGdial 2008, Columbus, OH, USA..( pdf )
  • Raux, A., Eskenazi, M. (2007) A Multi-Layer Architecture for Semi-Synchronous Event-Driven Dialogue Management, ASRU 2007, Kyoto, Japan.( pdf )
  • Ai, H., Raux, A., Bohus, D., Eskenazi, M., and Litman, D. (2007) Comparing Spoken Dialog Corpora Collected with Recruited Subjects versus Real Users, 8th SIGDial Workshop on Discourse and Dialogue, Antwerp, Belgium. ( pdf )
  • Bohus, D., Raux, A., Harris, T., Eskenazi, M., Rudnicky, A. (2007) Olympus: an open-source framework for conversational spoken language interface research, HLT-NAACL 2007 workshop on Bridging the Gap: Academic and Industrial Research in Dialog Technology, Rochester, NY, USA. ( pdf )
  • Bohus, D., Langner, B., Raux, A., Black, A., Eskenazi, M., Rudnicky, A. (2006) Online Supervised Learning of Non-understanding Recovery Policies, SLT 2006, Aruba, Aruba.( pdf )
  • Raux, A., Bohus, D., Langner, B., Black A., Eskenazi, M. (2006) Doing Research on a Deployed Spoken Dialogue System: One Year of Let's Go! Experience, Interspeech 2006, Pittsburgh, USA. ( pdf )
  • Langner, B., Black, A. (2005) Using Speech in Noise to Improve Understandability for Elderly Listeners, ASRU 2005, San Juan, Puerto Rico. ( pdf )
  • Raux, A., Langner, B., Black, A., Eskenazi, M. (2005) Let's Go Public! Taking a Spoken Dialog System to the Real World, Interspeech 2005 (Eurospeech), Lisbon, Portugal. ( pdf )
  • Langner, B., Black, A. (2005) Improving the Understandability of Speech Synthesis by Modeling Speech In Noise, ICASSP 2005, Philadelphia, USA ( pdf )
  • Langner, B., Black, A. (2004) An Examination of Speech In Noise and its Effect on Understandability for Natural and Synthetic Speech, Carnegie Mellon University, Language Technologies Institute, Technical Report CMU-LTI-04-187. ( pdf )
  • Raux, A., Automated Lexical Adaptation and Speaker Clustering based on Pronunciation Habits for Non-Native Speech Recognition, INTERSPEECH (ICSLP) 2004, Jeju Island, Korea. ( pdf )
  • Raux, A. and Singh, R. (2004) Maximum Likelihood Adaptation of Semi-Continuous HMMs by Latent Variable Decomposition of State Distributions, INTERSPEECH (ICSLP) 2004, Jeju Island, Korea. ( pdf )
  • Langner, B., Black, A. (2004) Creating a Database of Speech In Noise For Unit Selection Synthesis, 5th ISCA Speech Synthesis Workshop, Pittsburgh, USA. ( pdf )
  • Raux, A. and Eskenazi, M. (2004) Non-Native Users in the Let's Go! Spoken Dialog System: Dealing With Linguistic Mismatch HLT/NAACL 2004, Boston, MA ( pdf )
  • Raux, A. and Black, A. (2003) A Unit Selection Approach to F0 Modeling and Its Application to Emphasis ASRU 2003, St Thomas, US Virgin Is. ( pdf )
  • Raux, A., Langner, B., Black, A. and Eskenazi, M. (2003) LET'S GO: Improving Spoken Dialog Systems for the Elderly and Non-natives, Eurospeech 2003, Geneva, Switzerland. ( pdf, html )

Databases

Media Coverage

Acknowledgments

This work is supported by the US National Science Foundation under grant number 0208835, "LET'S GO: improved speech interfaces for the general public". Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.

We would like to thank Maureen Bertocci and Megan Huff from the Port Authority of Allegheny County for their help in this work.

LTI LET'S GO! is a project within the Language Technologies Institute at Carnegie Mellon University.
This page was last updated 1/9/06