Using Probabilistic Feature Matching to Understand Spoken Descriptions

Zukerman, Ingrid; Makalic, Enes; Niemann, Michael

doi:10.1007/978-3-540-89378-3_16

Using Probabilistic Feature Matching to Understand Spoken Descriptions

Ingrid Zukerman³,
Enes Makalic³ &
Michael Niemann³

Conference paper

1793 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5360))

Abstract

We describe a probabilistic reference disambiguation mechanism developed for a spoken dialogue system mounted on an autonomous robotic agent. Our mechanism performs probabilistic comparisons between features specified in referring expressions (e.g. size and colour) and features of objects in the domain. The results of these comparisons are combined using a function weighted on the basis of the specified features. Our evaluation shows high reference resolution accuracy across a range of spoken referring expressions.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Sowa, J.: Conceptual Structures: Information Processing in Mind and Machine. Addison-Wesley, Reading (1984)
MATH Google Scholar
Zukerman, I., Makalic, E., Niemann, M., George, S.: A probabilistic approach to the interpretation of spoken utterances. In: Ho, T.-B., Zhou, Z.-H. (eds.) PRICAI 2008. LNCS (LNAI), vol. 5351, pp. 581–592. Springer, Heidelberg (2008)
Google Scholar
Makihara, Y., Takizawa, M., Shirai, I., Miura, J., Shimada, N.: Object recognition supported by user interaction for service robots. In: Proceedings of the 16th International Conference on Pattern Recognition, Quebec, Canada, vol. 3, pp. 561–564 (2002)
Google Scholar
Dale, R., Reiter, E.: Computational interpretations of the Gricean maxims in the generation of referring expressions. Cognitive Science 18(2), 233–263 (1995)
Article Google Scholar
Wyatt, J.: Planning clarification questions to resolve ambiguous references to objects. In: Proceedings of the 4th IJCAI Workshop on Knowledge and Reasoning in Practical Dialogue Systems, Edinburgh, Scotland, pp. 16–23 (2005)
Google Scholar
Leacock, C., Chodorow, M.: Combining local context and WordNet similarity for word sense identification. In: Fellbaum, C. (ed.) WordNet: An Electronic Lexical Database, pp. 265–285. MIT Press, Cambridge (1998)
Google Scholar
Pedersen, T., Patwardhan, S., Michelizzi, J.: WordNet: Similarity – measuring the relatedness of concepts. In: AAAI 2004 – Proceedings of the 19th National Conference on Artificial Intelligence, San Jose, California, pp. 25–29 (2004)
Google Scholar
Puzicha, J., Buhmann, J., Rubner, Y., Tomasi, C.: Empirical evaluation of dissimilarity measures for color and texture. In: Proceedings of the 7th IEEE International Conference on Computer Vision, Kerkyra, Greece, vol. 2, pp. 1165–1172 (1999)
Google Scholar
Kelleher, J.: Attention driven reference resolution in multimodal contexts. Artificial Intelligence Review 25, 21–35 (2006)
Article Google Scholar
Potamianos, G., Neti, C.: Stream confidence estimation for audio-visual speech recognition. In: ICSLP 2000 Proceedings, Beijing, China, vol. 3, pp. 746–749 (2000)
Google Scholar
Carletta, J.: Assessing agreement on classification tasks: The Kappa statistic. Computational Linguistics 22(2), 249–254 (1996)
Google Scholar
Ang, J., Dhillon, R., Krupski, A., Shriberg, E., Stolcke, A.: Prosody-based automatic detection of annoyance and frustration in human-computer dialog. In: ICSLP 2002 Proceedings, Denver, Colorado, pp. 2037–2040 (2002)
Google Scholar
Siddharthan, A., Copestake, A.: Generating referring expressions in open domains. In: Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, Barcelona, Spain, pp. 407–414 (2004)
Google Scholar
Pfleger, N., Alexandersson, J., Becker, T.: A robust and generic discourse model for multimodal dialogue. In: Proceedings of the 3rd IJCAI Workshop on Knowledge and Reasoning in Practical Dialogue Systems, Acapulco, Mexico (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Information Technology, Monash University, Clayton, VICTORIA, 3800, Australia
Ingrid Zukerman, Enes Makalic & Michael Niemann

Authors

Ingrid Zukerman
View author publications
You can also search for this author in PubMed Google Scholar
Enes Makalic
View author publications
You can also search for this author in PubMed Google Scholar
Michael Niemann
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Wales, School of Computer Science and Engineering,, University of New South, NSW 2052, Sydney, Australia
Wayne Wobcke
School of Mathematics, Statistics and Computer Science, Victoria University of Wellington, P.O. Box 600, 6140, Wellington, New Zealand
Mengjie Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zukerman, I., Makalic, E., Niemann, M. (2008). Using Probabilistic Feature Matching to Understand Spoken Descriptions. In: Wobcke, W., Zhang, M. (eds) AI 2008: Advances in Artificial Intelligence. AI 2008. Lecture Notes in Computer Science(), vol 5360. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89378-3_16

Download citation

DOI: https://doi.org/10.1007/978-3-540-89378-3_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89377-6
Online ISBN: 978-3-540-89378-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics