|
Let's Go Lab is a research project on speech interfaces and dialog systems
conducted by researchers from the Language Technologies Institute at
Carnegie Mellon University's School of Computer Science. Funding for
Let's Go Lab comes from the U.S. National Science Foundation.
For additional information contact Maxine
Eskenazi (max(at)cs(dot)cmu(dot)edu) or Alan W Black.
People | Publications | Databases | Media Coverage | Acknowledgments
| Total Number of calls from March 2005 to March 2009: | | 70,731 |
| Total Number of calls in the period: | | 2,072 |
| Average number of user turns per dialogue: | | 13.04 |
| Number of complete dialogues (with database access): | | 63% |
* A significant amount of calls contain no speech directed to the system
(e.g. the user hangs up before saying anything). The number of calls
with 4 user turns or more is a simple estimate of the number of valid dialogues.
Although many recent systems have achieved effective
spoken interaction with a computer, they tend
to target the "average" portions of the population, those whose
speech and hearing fall within the norm.
Systems capable of finding and presenting useful information for the
average user have been developed, but these systems
cannot be used by everyone. People who are, for some
reason, considered to be outliers in the general population cannot
yet access the information these systems provide. This project is working
on providing a system that can be used by everyone, including the extremes
found in the general public.
The Let's Go! project is working in the domain of bus information, providing
information such as schedules and route information for the
Port Authority
of Allegheny County buses in and around Pittsburgh. You can interact with a
version of this system right now by calling 412-268-3526 (requires some knowledge
of Pittsburgh's transit system).
This system is connected to the Port Authority's main phone line during
non-business hours (M-F 7pm-6am, Weekends/Holidays 6pm-8am), providing
schedule information over the phone to Allegheny County bus riders when
they would otherwise be unable to get it.
An example interaction with this system is also available.
- Raux, A., Eskenazi, M. (2008) Optimizing Endpointing Thresholds using Dialogue Features in a Spoken Dialogue System, , SIGdial 2008, Columbus, OH, USA..( pdf )
- Raux, A., Eskenazi, M. (2007) A Multi-Layer Architecture for Semi-Synchronous Event-Driven Dialogue Management, ASRU 2007, Kyoto, Japan.( pdf )
- Ai, H., Raux, A., Bohus, D., Eskenazi, M., and Litman, D. (2007) Comparing Spoken Dialog Corpora Collected with Recruited Subjects versus Real Users, 8th SIGDial Workshop on Discourse and Dialogue, Antwerp, Belgium. ( pdf )
- Bohus, D., Raux, A., Harris, T., Eskenazi, M., Rudnicky, A. (2007) Olympus: an open-source framework for conversational spoken language interface research, HLT-NAACL 2007 workshop on Bridging the Gap: Academic and Industrial Research in Dialog Technology, Rochester, NY, USA. ( pdf )
- Bohus, D., Langner, B., Raux, A., Black, A., Eskenazi, M., Rudnicky, A. (2006) Online Supervised Learning of Non-understanding Recovery Policies, SLT 2006, Aruba, Aruba.( pdf )
- Raux, A., Bohus, D., Langner, B., Black A., Eskenazi, M. (2006) Doing Research on a Deployed Spoken Dialogue System: One Year of Let's Go! Experience, Interspeech 2006, Pittsburgh, USA. ( pdf )
- Langner, B., Black, A. (2005) Using Speech in Noise to Improve Understandability for Elderly Listeners, ASRU 2005, San Juan, Puerto Rico. ( pdf )
- Raux, A., Langner, B., Black, A., Eskenazi, M. (2005) Let's Go Public! Taking a Spoken Dialog System to the Real World, Interspeech 2005 (Eurospeech), Lisbon, Portugal. ( pdf )
- Langner, B., Black, A. (2005) Improving the Understandability of Speech Synthesis by Modeling Speech In Noise, ICASSP 2005, Philadelphia, USA ( pdf )
- Langner, B., Black, A. (2004) An Examination of Speech In Noise and its Effect on Understandability for Natural and Synthetic Speech, Carnegie Mellon University, Language Technologies Institute, Technical Report CMU-LTI-04-187. ( pdf )
- Raux, A., Automated Lexical Adaptation and Speaker Clustering based on Pronunciation Habits for Non-Native Speech Recognition, INTERSPEECH (ICSLP) 2004, Jeju Island, Korea. ( pdf )
- Raux, A. and Singh, R. (2004) Maximum Likelihood Adaptation of Semi-Continuous HMMs by Latent Variable Decomposition of State Distributions, INTERSPEECH (ICSLP) 2004, Jeju Island, Korea. ( pdf )
- Langner, B., Black, A. (2004) Creating a Database of Speech In Noise For Unit Selection Synthesis, 5th ISCA Speech Synthesis Workshop, Pittsburgh, USA. ( pdf )
- Raux, A. and Eskenazi, M. (2004) Non-Native Users in the Let's Go! Spoken Dialog System: Dealing With Linguistic Mismatch HLT/NAACL 2004, Boston, MA ( pdf )
- Raux, A. and Black, A. (2003) A Unit Selection Approach to F0 Modeling and Its Application to Emphasis ASRU 2003, St Thomas, US Virgin Is. ( pdf )
- Raux, A., Langner, B., Black, A. and Eskenazi, M. (2003) LET'S GO: Improving Spoken Dialog Systems for the Elderly and Non-natives, Eurospeech 2003, Geneva, Switzerland. ( pdf, html )
This work is supported by the US National Science Foundation
under grant number 0208835, "LET'S GO: improved speech interfaces
for the general public". Any opinions, findings, and conclusions
or recommendations expressed in this material are those of the
authors and do not necessarily reflect the views of the National
Science Foundation.
We would like to thank Maureen Bertocci and Megan
Huff from the Port Authority of Allegheny County for their help in
this work.
|