skip to main content
10.1145/1821748.1821820acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmommConference Proceedingsconference-collections
research-article

Audio data model for multi-criteria query formulation and retrieval

Published: 14 December 2009 Publication History

Abstract

The amount of available audio data is increasing rapidly in consequence of advancements in media creation, storage and compression technologies. This rapid increase imposes new demands in audio data management and retrieval.
In this work, we proposed an audio data model and repository model to fulfill user requirements in retrieving audio data from large collections. The proposed audio data repository model facilitates a multi-criteria query formulation and audio data retrieval where by audio can be queried both by its low- and high-level features. In the proposed model, a generic audio repository model that can handle a general audio as well as a sub-repository model that can manipulate speech through its constituent units is discussed. Finally, the viability of the proposed model is demonstrated by a prototype system developed for an application in the medical domain.

References

[1]
G. Tzanetakis and P. Cook "MARSYAS: a framework for audio analysis," Organized Sound 4(3), Cambridge University, 2000.
[2]
M. Slaney. Semantic-audio retrieval. In Proc. 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing, volume 4, pages IV4108--11, 2002.
[3]
A. I. Zayed. Advances in Shannon's Sampling. Theory, CRC Press, Boca Raton, pp. 157--159, 1993.
[4]
G. Tzanetakis, P. Cook. Multifeature audio segmentation for browsing and annotation, In Proc. of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New York, pp 17--20, 1999.
[5]
T. Zhang and C. J. Kuo. Hierarchical system for content-based audio classification and retrieval. In Proc. International Conference on Acoustic, Speech, Signal Processing, volume 6, pp 3001--3004, 1998.
[6]
E. Wold, T. Blum, D. Keislar, and J. Wheaton. Content-based classification, search, and retrieval of audio. In IEEE Multimedia, vol. 3, pages 27--36, 1996.
[7]
D. Pye. Content-based methods for the management of digital music. In Proc. the international Conference on Acoustics, Speech, and Signal Processing, 2000.
[8]
J. Foote. Content-based retrieval of music and audio. In Proc. SPIE, pages 138--147, 1997.
[9]
M. Liu and C. Wan, A study on content-based classification and retrieval of audio database. In Proc. Int. Database Engineering and Applications Symposium, Grenoble, France; IEEE Computer Society Press, pp 339--45, 2001.
[10]
D. Mitrovic, M. Zeppelzauer and C. Breiteneder. Discrimination and Retrieval of Animal Sounds, In Proceedings of the IEEE conference on Multimedia Modeling, 2006.
[11]
ISO/IEC JTC1/SC29/WG11 (MPEG). Multimedia content description interface - part 4: Audio International Standard 15938--4, 2001.
[12]
H-G. Kim, N. Moreau & T. Sikora, MPEG-7 audio and beyond. West Sussex: Wiley, 2005.
[13]
E. Scheirer and M. Slaney, Construction and evaluation of a robust multi-feature speech/music discriminator. In Proc. ICASSP, Munich, Germany, pp 1331--1334, 1997.
[14]
B. Whitman, D. Roy, and B. Vercoe, Learning word meanings and descriptive parameter spaces from music. In HLT-NAACL03, 2003.
[15]
Solomon Atnafu, Lionel Brunie, and Harald Kosch, Similarity-Based Operators and Query Optimization for Multimedia Database Systems; Int. Database Engineering & Applications Symposium (IDEAS'01), Grenoble, France; IEEE Computer Society Press, pp. 346--355, 2001
[16]
Zhong, D. and Chang, S.-F. An integrated approach for content-based video object segmentation and retrieval, IEEE Transactions on Circuits and Systems for Video Technology, vol. 9(8), 1259--1268, Dec. 1999.
[17]
Musclefish homepage, available on: http://www.musclefish.com (consulted on July 16, 2006). Chrisitan Spevak, Emmanuel Favreau: SoundSpotter- A prototype system for content-based audio retrieval. In Proc. of the 5th Int. Conference on Digital Audio Effects, Hamburg, Germany, September 26--28, 2002.
[18]
Wei-Ta Chu, Wen-Huang Cheng, Jane Yung-Jen Hsu and Ja-LingWu: Toward semantic indexing and retrieval using hierarchical audio models. Multimedia Systems, 10(6): 570--583, 2005.
[19]
J. A. Haitsma and T. Kalker, A Highly Robust Audio Fingerprinting System, Proc. ISMIR 2002, Paris, 2002.
[20]
Benetos, E., Kotti, M., Kotropoulos, C., Burred, J., Eisenberg, G., Haller, M., & Sikora, T. Comparison of Subspace Analysis-Based and Statistical Model-Based Algorithms for Musical Instrument Classification. 2nd Workshop on Immersive Communication and Broadcast Systems (ICOB), 2005.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
MoMM '09: Proceedings of the 7th International Conference on Advances in Mobile Computing and Multimedia
December 2009
663 pages
ISBN:9781605586595
DOI:10.1145/1821748
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • Johannes Kepler University

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 December 2009

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. audio data management for medical application (ADMMA)
  2. audio data model
  3. audio data repository model
  4. multi-criteria audio data retrieval

Qualifiers

  • Research-article

Conference

MoMM '09
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 124
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 15 Sep 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media