Compressively sampled speech: How good is the recovery?

Authors

Kenneth V. Domingo National Institute of Physics, University of the Philippines Diliman
Maricor N. Soriano National Institute of Physics, University of the Philippines Diliman

Abstract

Modern signal acquisition technologies are made possible by the Nyquist-Shannon sampling theorem (NST). However, this paradigm is extremely wasteful as the signal is compressed before storing it by systematically discarding imperceptible information. Compressive sensing (CS) aims to directly sense the relevant information. Current literature focus either on formulating more computationally-efficient algorithms, or methods which improve the reconstruction quality. In this paper, we quantify the reconstruction quality of compressively sampled speech with a perceptually intuitive metric–the Perceptual Evaluation of Speech Quality (PESQ)–and with the standard average segmental SNR (SNR_seg). The quality of recovery of compressively sampled speech evaluated using PESQ is dependent on the compression ratio, and independent of the number of subbands used to represent the signal in the spectrogram domain.

Downloads

Issue

2020: Proceedings of the 38th Samahang Pisika ng Pilipinas Physics Conference

Article ID

SPP-2020-4C-04

Section

Instrumentation, Imaging, and Signal Processing

Published

2020-10-19

How to Cite

[1]

KV Domingo and MN Soriano, Compressively sampled speech: How good is the recovery?, Proceedings of the Samahang Pisika ng Pilipinas 38, SPP-2020-4C-04 (2020). URL: https://proceedings.spp-online.org/article/view/SPP-2020-4C-04.