Skip to main content

Bando Lab

  • Demonstrating our real-time dialogue analysis system at an exhibition
  • Field test of unsupervised audio-visual sound source localization
  • Demonstration of dialogue analysis in reverberant environments
  • Hose-shaped robot operating in extreme environments

Bando Lab is a sub-team of the Social Intelligence Research Team, Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), and is also affiliated with the Intelligent Acoustics Laboratory at the University of Tsukuba through the Cooperative Graduate School Program.

We focus on intelligent systems centered on machine listening, including sound event detection (SED) and distant automatic speech recognition (DASR). Our current research emphasizes self-supervised learning and large-scale pretraining to establish frameworks robust in complex real-world conditions.

Contact #

2-4-7 Aomi, Koto-ku, Tokyo 135-0064, Japan
AIST Waterfront Annex 8F

y.bando [at] aist.go.jp

Recent

M3L is an open-source toolkit for multi-modal machine listening, providing infrastructure and recipes to build, train, and evaluate models that combine audio with other modalities.

Aiaccel

Aiaccel is an open-source Python toolkit designed to accelerate machine learning research, especially on high-performance computing (HPC) clusters such as ABCI.

LEAD Dataset

The LEAD dataset provides strong labels for sound events, in which each clip has 20 different annotations. It allows us to investigate how annotations vary among different annotators and develop SED models that are robust to the variations.