1
An Open Hardware and Software Microphone Array System for Robotic Applications Franc ¸ois Grondin, Dominic L´ etourneau, Franc ¸ois Ferland, Franc ¸ois Michaud IntRoLab – Intelligent, Interactive, Integrated Robotics Lab Interdisciplinary Institute for Technological Innovation Universit´ e de Sherbrooke, 3000, boul. de l’Universit´ e, Sherbrooke (QC) Canada J1K 2A0 Email: {francois.grondin2, dominic.letourneau, francois.ferland, francois.michaud}@USherbrooke.ca Fig. 1. ManyEars Open Framework KEYWORDS Open source – Sound source localization – Sound source separation – Mobile robotics – USB sound card – Open hardware – Microphone array ABSTRACT Autonomous robots must be able to perceive sounds from their environment in order to interact naturally with humans. ManyEars is an open framework for microphone array-based audio processing, which allows a robot to localize, track, and separate multiple sound sources, for improved speech and sound recognition in real-world settings [1]. The system runs in real-time on a personal computer and is able to reliably localize and track up to four of the loudest sound sources in reverberant and noisy environments when eight microphones are used [1]. It can also separate up to three sources in an adverse environment with a suitable signal-to-noise ratio improvement for speech recognition [2]. To facilitate maintenance and portability, ManyEars is im- plemented in C as a modular library, with no dependance on external libraries. The source code is available online [3] under the GNU GPL license. A Graphical User Interface (GUI) (also available online [4]) is a complementary tool used to display in real-time the tracked sound sources and to facilitate configuration and tuning of the parameters of the ManyEars library. ManyEars is also implemented as an open software library with Willow Garage’s Robot Operating System (ROS) [4]. ManyEars can be used with commercially available sound cards and microphones. However, these sound cards present limitations when used for embedded robotic applications: they are usually expensive, they have superfluous functionalities, and they also require significant amount of power and size. To facilitate the use of ManyEars on various robotic platforms, customized microphone boards and sound card are distributed as open hardware solutions. The proposed XMOS based sound card is low-cost, compact (12.5 cm × 7.4 cm × 1.5 cm), has eight analog inputs and one stereo output, and can be powered by either the Universal Serial Bus (USB) or an external wide range DC power supply. Audio data are transferred to the computer with the USB 2.0 High-Speed Audio Class 2.0 standard, which makes it compatible with Window, Linux and Mac OS X. The first implementation of ManyEars has been used on the robot Spartacus [5] and the new C version is currently used on the robot IRL-1 [6]. ACKNOWLEDGMENT This work was supported in part by the Natural Sciences and Engineering Research Council of Canada, the Fonds de recherche du Qu´ ebec – Nature et technologies, the Canadian Foundation for Innovation and the Canada Research Chair program. REFERENCES [1] J.-M. Valin, F. Michaud, and J. Rouat, “Robust localization and tracking of simultaneous moving sound sources using beamforming and particle filtering,” Robotics and Autonomous Systems, vol. 55, no. 3, pp. 216–228, 2006. [2] S. Yamamoto, K. Nakadai, M. Nakano, H. Tsujino, J.-M. Valin, K. Ko- matani, T. Ogata, and H. Okuno, “Real-time Robot Audition System that Recognizes Simultaneous Speech in the Real World,” in Proceedings of the IEEE International Conference on Intelligent Robots and Systems, 2006, pp. 5333–5338. [3] F. Grondin, J.-M. Valin, and D. L´ etourneau. (2012) The ManyEars Project: Microphone Array-Based Audition for Mobile Robots. [Online]. Available: http://manyears.sourceforge.net [4] IntRoLab. (2012) ManyEars ROS Package. [Online]. Available: http://introlab.github.com/introlab-ros-pkg/ [5] F. Michaud, C. Cˆ ot´ e, D. L´ etourneau, Y. Brosseau, J.-M. Valin, E. Beaudry, C. Raievsky, A. Ponchon, P. Moisan, P. Lepage, Y. Morin, F. Gagnon, P. Giguere, M.-A. Roux, S. Caron, P. Frenette, and F. Kabanza, “Spartacus Attending the 2005 AAAI Conference,” Autonomous Robots, vol. 22, no. 4, pp. 369 – 383, 2007. [6] F. Ferland, D. L´ etourneau, A. Aumont, J. Fr´ emy, M.-A. Legault, M. Lau- ria, and F. Michaud, “Natural interaction design of a humanoid robot,” To appear in the Journal of Human-Robot Interaction.

An Open Hardware and Software Microphone Array System for Robotic … · 2018-11-26 · library with Willow Garage’s Robot Operating System (ROS) [4]. ManyEars can be used with

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: An Open Hardware and Software Microphone Array System for Robotic … · 2018-11-26 · library with Willow Garage’s Robot Operating System (ROS) [4]. ManyEars can be used with

An Open Hardware and Software MicrophoneArray System for Robotic Applications

Francois Grondin, Dominic Letourneau, Francois Ferland, Francois MichaudIntRoLab – Intelligent, Interactive, Integrated Robotics Lab

Interdisciplinary Institute for Technological InnovationUniversite de Sherbrooke, 3000, boul. de l’Universite, Sherbrooke (QC) Canada J1K 2A0

Email: {francois.grondin2, dominic.letourneau, francois.ferland, francois.michaud}@USherbrooke.ca

Fig. 1. ManyEars Open Framework

KEYWORDS

Open source – Sound source localization – Sound sourceseparation – Mobile robotics – USB sound card – Openhardware – Microphone array

ABSTRACT

Autonomous robots must be able to perceive sounds fromtheir environment in order to interact naturally with humans.ManyEars is an open framework for microphone array-basedaudio processing, which allows a robot to localize, track, andseparate multiple sound sources, for improved speech andsound recognition in real-world settings [1]. The system runsin real-time on a personal computer and is able to reliablylocalize and track up to four of the loudest sound sources inreverberant and noisy environments when eight microphonesare used [1]. It can also separate up to three sources inan adverse environment with a suitable signal-to-noise ratioimprovement for speech recognition [2].

To facilitate maintenance and portability, ManyEars is im-plemented in C as a modular library, with no dependanceon external libraries. The source code is available online [3]under the GNU GPL license. A Graphical User Interface (GUI)(also available online [4]) is a complementary tool used todisplay in real-time the tracked sound sources and to facilitateconfiguration and tuning of the parameters of the ManyEarslibrary. ManyEars is also implemented as an open softwarelibrary with Willow Garage’s Robot Operating System (ROS)[4].

ManyEars can be used with commercially available soundcards and microphones. However, these sound cards presentlimitations when used for embedded robotic applications: theyare usually expensive, they have superfluous functionalities,and they also require significant amount of power and size. Tofacilitate the use of ManyEars on various robotic platforms,customized microphone boards and sound card are distributedas open hardware solutions. The proposed XMOS based soundcard is low-cost, compact (12.5 cm × 7.4 cm × 1.5 cm), haseight analog inputs and one stereo output, and can be poweredby either the Universal Serial Bus (USB) or an external widerange DC power supply. Audio data are transferred to thecomputer with the USB 2.0 High-Speed Audio Class 2.0standard, which makes it compatible with Window, Linux andMac OS X. The first implementation of ManyEars has beenused on the robot Spartacus [5] and the new C version iscurrently used on the robot IRL-1 [6].

ACKNOWLEDGMENT

This work was supported in part by the Natural Sciencesand Engineering Research Council of Canada, the Fonds derecherche du Quebec – Nature et technologies, the CanadianFoundation for Innovation and the Canada Research Chairprogram.

REFERENCES

[1] J.-M. Valin, F. Michaud, and J. Rouat, “Robust localization and trackingof simultaneous moving sound sources using beamforming and particlefiltering,” Robotics and Autonomous Systems, vol. 55, no. 3, pp. 216–228,2006.

[2] S. Yamamoto, K. Nakadai, M. Nakano, H. Tsujino, J.-M. Valin, K. Ko-matani, T. Ogata, and H. Okuno, “Real-time Robot Audition System thatRecognizes Simultaneous Speech in the Real World,” in Proceedings ofthe IEEE International Conference on Intelligent Robots and Systems,2006, pp. 5333–5338.

[3] F. Grondin, J.-M. Valin, and D. Letourneau. (2012) The ManyEarsProject: Microphone Array-Based Audition for Mobile Robots. [Online].Available: http://manyears.sourceforge.net

[4] IntRoLab. (2012) ManyEars ROS Package. [Online]. Available:http://introlab.github.com/introlab-ros-pkg/

[5] F. Michaud, C. Cote, D. Letourneau, Y. Brosseau, J.-M. Valin, E. Beaudry,C. Raievsky, A. Ponchon, P. Moisan, P. Lepage, Y. Morin, F. Gagnon,P. Giguere, M.-A. Roux, S. Caron, P. Frenette, and F. Kabanza, “SpartacusAttending the 2005 AAAI Conference,” Autonomous Robots, vol. 22,no. 4, pp. 369 – 383, 2007.

[6] F. Ferland, D. Letourneau, A. Aumont, J. Fremy, M.-A. Legault, M. Lau-ria, and F. Michaud, “Natural interaction design of a humanoid robot,”To appear in the Journal of Human-Robot Interaction.