いびきGANs.合成データを用いた自動いびき音分類の改善

/ /

日本語AIでPubMedを検索

PubMedの提供する医学論文データベースを日本語で検索できます。AI(Deep Learning)を活用した機械翻訳エンジンにより、精度高く日本語へ翻訳された論文をご参照いただけます。

IEEE J Biomed Health Inform.2020 01;24(1):300-310. doi: 10.1109/JBHI.2019.2907286.Epub 2019-04-01.

いびきGANs.合成データを用いた自動いびき音分類の改善

Snore-GANs: Improving Automatic Snore Sound Classification With Synthesized Data.

Zixing Zhang
Jing Han
Kun Qian
Christoph Janott
Yanan Guo
Bjorn Schuller

PMID: 30946682 DOI: 10.1109/JBHI.2019.2907286.

抄録

いびき音自動分類(ASSC)の開発を難しくしている問題の一つに、十分な教師付き学習データがないことが挙げられる。この問題に対処するために、本研究では半教師付き条件生成逆説ネットワーク(scGAN)を用いて、ランダムなノイズ空間から元のデータ分布へのマッピング戦略を自動的に学習することを目的とした新しいデータ増強アプローチを提案する。提案するアプローチは、アノテーション処理を必要とせず、現実的な高次元データの合成が可能である。さらに、GANのモード崩壊問題を扱うために、生成データの多様性を高めるためのアンサンブル戦略を導入する。広く利用されているミュンヘン・パッソーのいびき音コーパスを用いた系統的な実験により、scGANsをベースとしたシステムが他の古典的なデータ増強システムを著しく凌駕することが実証され、また最近報告されたASSCのための他のシステムと比較しても競争力があることが示された。

One of the frontier issues that severely hamper the development of automatic snore sound classification (ASSC) associates to the lack of sufficient supervised training data. To cope with this problem, we propose a novel data augmentation approach based on semi-supervised conditional generative adversarial networks (scGANs), which aims to automatically learn a mapping strategy from a random noise space to original data distribution. The proposed approach has the capability of well synthesizing "realistic" high-dimensional data, while requiring no additional annotation process. To handle the mode collapse problem of GANs, we further introduce an ensemble strategy to enhance the diversity of the generated data. The systematic experiments conducted on a widely used Munich-Passau snore sound corpus demonstrate that the scGANs-based systems can remarkably outperform other classic data augmentation systems, and are also competitive to other recently reported systems for ASSC.