Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shoichi Koyama

Weighted Pressure and Mode Matching for Sound Field Reproduction: Theoretical and Experimental Comparisons

Mar 23, 2023

Shoichi Koyama, Keisuke Kimura, Natsuki Ueno

Abstract:Two sound field reproduction methods, weighted pressure matching and weighted mode matching, are theoretically and experimentally compared. The weighted pressure and mode matching are a generalization of conventional pressure and mode matching, respectively. Both methods are derived by introducing a weighting matrix in the pressure and mode matching. The weighting matrix in the weighted pressure matching is defined on the basis of the kernel interpolation of the sound field from pressure at a discrete set of control points. In the weighted mode matching, the weighting matrix is defined by a regional integration of spherical wavefunctions. It is theoretically shown that the weighted pressure matching is a special case of the weighted mode matching by infinite-dimensional harmonic analysis for estimating expansion coefficients from pressure observations. The difference between the two methods are discussed through experiments.

* Accepted to Journal of Audio Engineering Society, Special Issue on Spatial Audio

Via

Access Paper or Ask Questions

Kernel interpolation of acoustic transfer functions with adaptive kernel for directed and residual reverberations

Mar 07, 2023

Juliano G. C. Ribeiro, Shoichi Koyama, Hiroshi Saruwatari

Abstract:An interpolation method for region-to-region acoustic transfer functions (ATFs) based on kernel ridge regression with an adaptive kernel is proposed. Most current ATF interpolation methods do not incorporate the acoustic properties for which measurements are performed. Our proposed method is based on a separate adaptation of directional weighting functions to directed and residual reverberations, which are used for adapting kernel functions. Thus, the proposed method can not only impose constraints on fundamental acoustic properties, but can also adapt to the acoustic environment. Numerical experimental results indicated that our proposed method outperforms the current methods in terms of interpolation accuracy, especially at high frequencies.

* 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
* To appear in ICASSP 2023

Via

Access Paper or Ask Questions

Weighted Pressure Matching Based on Kernel Interpolation For Sound Field Reproduction

Oct 26, 2022

Shoichi Koyama, Kazuyuki Arikawa

Abstract:A sound field reproduction method called weighted pressure matching is proposed. Sound field reproduction is aimed at synthesizing the desired sound field using multiple loudspeakers inside a target region. Optimization-based methods are derived from the minimization of errors between synthesized and desired sound fields, which enable the use of an arbitrary array geometry in contrast with integral-equation-based methods. Pressure matching is widely used in the optimization-based sound field reproduction methods because of its simplicity of implementation. Its cost function is defined as the synthesis errors at multiple control points inside the target region; then, the driving signals of the loudspeakers are obtained by solving a least-squares problem. However, in pressure matching, the region between the control points is not taken into consideration. We define the cost function as the regional integration of the synthesis error over the target region. On the basis of the kernel interpolation of the sound field, this cost function is represented as the weighted square error of the synthesized pressures at the control points. Experimental results indicate that the proposed weighted pressure matching outperforms conventional pressure matching.

* Presented at 24th International Congress on Acoustics (ICA) 2022

Via

Access Paper or Ask Questions

Head-Related Transfer Function Interpolation from Spatially Sparse Measurements Using Autoencoder with Source Position Conditioning

Jul 22, 2022

Yuki Ito, Tomohiko Nakamura, Shoichi Koyama, Hiroshi Saruwatari

Figure 1 for Head-Related Transfer Function Interpolation from Spatially Sparse Measurements Using Autoencoder with Source Position Conditioning

Figure 2 for Head-Related Transfer Function Interpolation from Spatially Sparse Measurements Using Autoencoder with Source Position Conditioning

Figure 3 for Head-Related Transfer Function Interpolation from Spatially Sparse Measurements Using Autoencoder with Source Position Conditioning

Abstract:We propose a method of head-related transfer function (HRTF) interpolation from sparsely measured HRTFs using an autoencoder with source position conditioning. The proposed method is drawn from an analogy between an HRTF interpolation method based on regularized linear regression (RLR) and an autoencoder. Through this analogy, we found the key feature of the RLR-based method that HRTFs are decomposed into source-position-dependent and source-position-independent factors. On the basis of this finding, we design the encoder and decoder so that their weights and biases are generated from source positions. Furthermore, we introduce an aggregation module that reduces the dependence of latent variables on source position for obtaining a source-position-independent representation of each subject. Numerical experiments show that the proposed method can work well for unseen subjects and achieve an interpolation performance with only one-eighth measurements comparable to that of the RLR-based method.

* Accepted to International Workshop on Acoustic Signal Enhancement (IWAENC) 2022

Via

Access Paper or Ask Questions

Physics-informed convolutional neural network with bicubic spline interpolation for sound field estimation

Jul 22, 2022

Kazuhide Shigemi, Shoichi Koyama, Tomohiko Nakamura, Hiroshi Saruwatari

Figure 1 for Physics-informed convolutional neural network with bicubic spline interpolation for sound field estimation

Figure 2 for Physics-informed convolutional neural network with bicubic spline interpolation for sound field estimation

Figure 3 for Physics-informed convolutional neural network with bicubic spline interpolation for sound field estimation

Abstract:A sound field estimation method based on a physics-informed convolutional neural network (PICNN) using spline interpolation is proposed. Most of the sound field estimation methods are based on wavefunction expansion, making the estimated function satisfy the Helmholtz equation. However, these methods rely only on physical properties; thus, they suffer from a significant deterioration of accuracy when the number of measurements is small. Recent learning-based methods based on neural networks have advantages in estimating from sparse measurements when training data are available. However, since physical properties are not taken into consideration, the estimated function can be a physically infeasible solution. We propose the application of PICNN to the sound field estimation problem by using a loss function that penalizes deviation from the Helmholtz equation. Since the output of CNN is a spatially discretized pressure distribution, it is difficult to directly evaluate the Helmholtz-equation loss function. Therefore, we incorporate bicubic spline interpolation in the PICNN framework. Experimental results indicated that accurate and physically feasible estimation from sparse measurements can be achieved with the proposed method.

* Accepted to International Workshop on Acoustic Signal Enhancement (IWAENC) 2022

Via

Access Paper or Ask Questions

Region-to-region kernel interpolation of acoustic transfer function with directional weighting

May 05, 2022

Juliano G. C. Ribeiro, Shoichi Koyama, Hiroshi Saruwatari

Figure 1 for Region-to-region kernel interpolation of acoustic transfer function with directional weighting

Figure 2 for Region-to-region kernel interpolation of acoustic transfer function with directional weighting

Figure 3 for Region-to-region kernel interpolation of acoustic transfer function with directional weighting

Figure 4 for Region-to-region kernel interpolation of acoustic transfer function with directional weighting

Abstract:A method of interpolating the acoustic transfer function (ATF) between regions that takes into account both the physical properties of the ATF and the directionality of region configurations is proposed. Most spatial ATF interpolation methods are limited to estimation in the region of receivers. A kernel method for region-to-region ATF interpolation makes it possible to estimate the ATFs for both source and receiver regions from a discrete set of ATF measurements. We newly formulate the reproducing kernel Hilbert space and associated kernel function incorporating directional weight to enhance the interpolation accuracy. We also investigate hyperparameter optimization methods for this kernel function. Numerical experiments indicate that the proposed method outperforms the method without the use of directional weighting.

* ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, pp. 576-580
* To appear in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Via

Access Paper or Ask Questions

Spatial active noise control based on individual kernel interpolation of primary and secondary sound fields

Feb 10, 2022

Kazuyuki Arikawa, Shoichi Koyama, Hiroshi Saruwatari

Figure 1 for Spatial active noise control based on individual kernel interpolation of primary and secondary sound fields

Figure 2 for Spatial active noise control based on individual kernel interpolation of primary and secondary sound fields

Figure 3 for Spatial active noise control based on individual kernel interpolation of primary and secondary sound fields

Figure 4 for Spatial active noise control based on individual kernel interpolation of primary and secondary sound fields

Abstract:A spatial active noise control (ANC) method based on the individual kernel interpolation of primary and secondary sound fields is proposed. Spatial ANC is aimed at cancelling unwanted primary noise within a continuous region by using multiple secondary sources and microphones. A method based on the kernel interpolation of a sound field makes it possible to attenuate noise over the target region with flexible array geometry. Furthermore, by using the kernel function with directional weighting, prior information on primary noise source directions can be taken into consideration. However, whereas the sound field to be interpolated is a superposition of primary and secondary sound fields, the directional weight for the primary noise source was applied to the total sound field in previous work; therefore, the performance improvement was limited. We propose a method of individually interpolating the primary and secondary sound fields and formulate a normalized least-mean-square algorithm based on this interpolation method. Experimental results indicate that the proposed method outperforms the method based on total kernel interpolation.

* Accepted to International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022

Via

Access Paper or Ask Questions

Mean-square-error-based secondary source placement in sound field synthesis with prior information on desired field

Dec 10, 2021

Keisuke Kimura, Shoichi Koyama, Natsuki Ueno, Hiroshi Saruwatari

Figure 1 for Mean-square-error-based secondary source placement in sound field synthesis with prior information on desired field

Figure 2 for Mean-square-error-based secondary source placement in sound field synthesis with prior information on desired field

Figure 3 for Mean-square-error-based secondary source placement in sound field synthesis with prior information on desired field

Figure 4 for Mean-square-error-based secondary source placement in sound field synthesis with prior information on desired field

Abstract:A method of optimizing secondary source placement in sound field synthesis is proposed. Such an optimization method will be useful when the allowable placement region and available number of loudspeakers are limited. We formulate a mean-square-error-based cost function, incorporating the statistical properties of possible desired sound fields, for general linear-least-squares-based sound field synthesis methods, including pressure matching and (weighted) mode matching, whereas most of the current methods are applicable only to the pressure-matching method. An efficient greedy algorithm for minimizing the proposed cost function is also derived. Numerical experiments indicated that a high reproduction accuracy can be achieved by the placement optimized by the proposed method compared with the empirically used regular placement.

* Accepted to IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2021

Via

Access Paper or Ask Questions

Sound Field Reproduction With Weighted Mode Matching and Infinite-Dimensional Harmonic Analysis: An Experimental Evaluation

Nov 22, 2021

Shoichi Koyama, Keisuke Kimura, Natsuki Ueno

Figure 1 for Sound Field Reproduction With Weighted Mode Matching and Infinite-Dimensional Harmonic Analysis: An Experimental Evaluation

Figure 2 for Sound Field Reproduction With Weighted Mode Matching and Infinite-Dimensional Harmonic Analysis: An Experimental Evaluation

Figure 3 for Sound Field Reproduction With Weighted Mode Matching and Infinite-Dimensional Harmonic Analysis: An Experimental Evaluation

Figure 4 for Sound Field Reproduction With Weighted Mode Matching and Infinite-Dimensional Harmonic Analysis: An Experimental Evaluation

Abstract:Sound field reproduction methods based on numerical optimization, which aim to minimize the error between synthesized and desired sound fields, are useful in many practical scenarios because of their flexibility in the array geometry of loudspeakers. However, the reproduction performance of these methods in a practical environment has not been sufficiently investigated. We evaluate weighted mode matching, which is a sound field reproduction method based on the spherical wavefunction expansion of the sound field, in comparison with conventional pressure matching. We also introduce a method of infinite-dimensional harmonic analysis for estimating the expansion coefficients of the sound field from microphone measurements. Experimental results indicated that weighted mode matching using the expansion coefficients of the transfer functions estimated by the infinite-dimensional harmonic analysis outperforms conventional pressure matching, especially when the number of microphones is small.

* Accepted to International Conference on Immersive and 3D Audio (I3DA) 2021

Via

Access Paper or Ask Questions

Kernel Learning For Sound Field Estimation With L1 and L2 Regularizations

Oct 12, 2021

Ryosuke Horiuchi, Shoichi Koyama, Juliano G. C. Ribeiro, Natsuki Ueno, Hiroshi Saruwatari

Figure 1 for Kernel Learning For Sound Field Estimation With L1 and L2 Regularizations

Figure 2 for Kernel Learning For Sound Field Estimation With L1 and L2 Regularizations

Figure 3 for Kernel Learning For Sound Field Estimation With L1 and L2 Regularizations

Abstract:A method to estimate an acoustic field from discrete microphone measurements is proposed. A kernel-interpolation-based method using the kernel function formulated for sound field interpolation has been used in various applications. The kernel function with directional weighting makes it possible to incorporate prior information on source directions to improve estimation accuracy. However, in prior studies, parameters for directional weighting have been empirically determined. We propose a method to optimize these parameters using observation values, which is particularly useful when prior information on source directions is uncertain. The proposed algorithm is based on discretization of the parameters and representation of the kernel function as a weighted sum of sub-kernels. Two types of regularization for the weights, $L_1$ and $L_2$, are investigated. Experimental results indicate that the proposed method achieves higher estimation accuracy than the method without kernel learning.

* Accepted to IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2021

Via

Access Paper or Ask Questions