12
Audio Engineering Society Convention Paper Presented at the 140 th Convention 2016 June 4–7, Paris, France This convention paper was selected based on a submitted abstract and 750-word precis that have been peer reviewed by at least two qualified anonymous reviewers. The complete manuscript was not peer reviewed. This convention paper has been reproduced from the author’s advance manuscript without editing, corrections, or consideration by the Review Board. The AES takes no responsibility for the contents. This paper is available in the AES E-Library (http://www.aes.org/e-lib), all rights reserved. Reproduction of this paper, or any portion thereof, is not permitted without direct permission from the Journal of the Audio Engineering Society. Local Wave Field Synthesis by Spatial Band-limitation in the Circular/Spherical Harmonics Domain Nara Hahn, Fiete Winter, and Sascha Spors Institute of Communications Engineering, University of Rostock, Germany Correspondence should be addressed to Nara Hahn ([email protected]) ABSTRACT The achievable accuracy of sound field synthesis (SFS) techniques, such as Wave Field Synthesis (WFS), is mainly limited in practice due to the limited loudspeaker density. Above the so called spatial aliasing frequency, considerable artifacts are introduced in the synthesized sound field. In local SFS, the accuracy within a local listening area is increased at the cost of degradations outside. In this paper a new approach for local WFS is proposed. The WFS driving functions are computed based on an order-limited harmonics expansion of the target sound field. A local listening area is created around the shifted expansion center where the synthesized sound field exhibits higher accuracy. The size of the local area is controlled by the expansion order of the driving function. The derivations of 2D, 3D and 2.5D driving functions are given, and the synthesized sound fields are evaluated by numerical simulations. 1 Introduction Sound field synthesis (SFS) is a spatial sound re- production technique where a desired sound field is physically reconstructed by a loudspeaker array. The loudspeakers, termed secondary sources, are driven individually in such a way that the super- position of the reproduced sound fields approaches the target sound field. Two well established SFS methods are Wave Field Synthesis (WFS) and near-field compensated higher-order Ambisonics (NFC-HOA) [1, 2, 3, 4]. Theoretically, a continu- ous distribution of secondary sources is required for a perfect synthesis. In practice, only a finite number of secondary sources can be placed at dis- crete positions. The spatial discretization results in spatial aliasing and a prominent artifacts are in- troduced in the synthesized sound field. In general, the amount of artifacts depends on the number and the distribution of the secondary sources, as well as on the virtual source position and the listener position [5]. Instead of considering the entire listening area, local SFS techniques aim at increasing the physi- cal accuracy inside a predetermined local listening area [6, 7, 8, 9]. This is typically achieved at the cost of stronger artifacts outside the local area. In NFC-HOA, for instance, the local listening area can be shifted by translating the expansion cen- ter to a target listening position [7]. The size of 9596

Audio Engineering Society Convention Paper 9596 · 2016-09-13 · Audio Engineering Society Convention Paper Presented at the 140th Convention 2016 June 4–7, Paris, ... i.e. parallel

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Audio Engineering Society Convention Paper 9596 · 2016-09-13 · Audio Engineering Society Convention Paper Presented at the 140th Convention 2016 June 4–7, Paris, ... i.e. parallel

Audio Engineering Society

Convention PaperPresented at the 140th Convention

2016 June 4–7, Paris, FranceThis convention paper was selected based on a submitted abstract and 750-word precis that have been peer reviewedby at least two qualified anonymous reviewers. The complete manuscript was not peer reviewed. This conventionpaper has been reproduced from the author’s advance manuscript without editing, corrections, or considerationby the Review Board. The AES takes no responsibility for the contents. This paper is available in the AESE-Library (http://www.aes.org/e-lib), all rights reserved. Reproduction of this paper, or any portion thereof, isnot permitted without direct permission from the Journal of the Audio Engineering Society.

Local Wave Field Synthesis by Spatial Band-limitation inthe Circular/Spherical Harmonics DomainNara Hahn, Fiete Winter, and Sascha Spors

Institute of Communications Engineering, University of Rostock, Germany

Correspondence should be addressed to Nara Hahn ([email protected])

ABSTRACT

The achievable accuracy of sound field synthesis (SFS) techniques, such as Wave Field Synthesis (WFS),is mainly limited in practice due to the limited loudspeaker density. Above the so called spatial aliasingfrequency, considerable artifacts are introduced in the synthesized sound field. In local SFS, the accuracywithin a local listening area is increased at the cost of degradations outside. In this paper a new approachfor local WFS is proposed. The WFS driving functions are computed based on an order-limited harmonicsexpansion of the target sound field. A local listening area is created around the shifted expansion centerwhere the synthesized sound field exhibits higher accuracy. The size of the local area is controlled by theexpansion order of the driving function. The derivations of 2D, 3D and 2.5D driving functions are given,and the synthesized sound fields are evaluated by numerical simulations.

1 Introduction

Sound field synthesis (SFS) is a spatial sound re-production technique where a desired sound fieldis physically reconstructed by a loudspeaker array.The loudspeakers, termed secondary sources, aredriven individually in such a way that the super-position of the reproduced sound fields approachesthe target sound field. Two well established SFSmethods are Wave Field Synthesis (WFS) andnear-field compensated higher-order Ambisonics(NFC-HOA) [1, 2, 3, 4]. Theoretically, a continu-ous distribution of secondary sources is requiredfor a perfect synthesis. In practice, only a finitenumber of secondary sources can be placed at dis-

crete positions. The spatial discretization resultsin spatial aliasing and a prominent artifacts are in-troduced in the synthesized sound field. In general,the amount of artifacts depends on the number andthe distribution of the secondary sources, as wellas on the virtual source position and the listenerposition [5].

Instead of considering the entire listening area,local SFS techniques aim at increasing the physi-cal accuracy inside a predetermined local listeningarea [6, 7, 8, 9]. This is typically achieved at thecost of stronger artifacts outside the local area. InNFC-HOA, for instance, the local listening areacan be shifted by translating the expansion cen-ter to a target listening position [7]. The size of

9596

Page 2: Audio Engineering Society Convention Paper 9596 · 2016-09-13 · Audio Engineering Society Convention Paper Presented at the 140th Convention 2016 June 4–7, Paris, ... i.e. parallel

Hahn et al. Local WFS by Spatial Band-limitation

the local area is controlled by the spatial band-width (circular/spherical harmonic order) of thedriving function. Some extensions of local NFC-HOA were introduced and investigated in [10]. Forlocal WFS, two approaches are known so far. In [8],a virtual loudspeaker array is created using focusedsources. By choosing a dense distribution of virtualloudspeakers, the spatial aliasing frequency is in-creased and a better performance can be achievedin the local area. The second local WFS methodproposed in [9] employs the equivalent scatteringapproach [11]. Here, the boundary of the local lis-tening area is assumed to be a sound-soft scatterer.For a given incident field (target field), the localWFS driving functions are computed based on thesound field scattered by the virtual boundary.

In this paper, an alternative method for local WFSis presented. Unlike the above mentioned localWFS methods, no explicit boundary is consid-ered. It rather exploits the properties of a spatiallyband-limited sound field which exhibits higher ac-curacy around the expansion center. Similar to(local) NFC-HOA, the desired sound field is de-scribed as a circular/spherical harmonics expansionwith a limited order. The local WFS driving func-tions are obtained by (i) computing the directionalgradient at each secondary source position and(ii) applying a spatial window that activates onlythe secondary sources illuminated by the virtualsource [12]. Since the derivatives of the spheri-cal/circular harmonics and the Bessel functionsare known, analytic driving functions can be de-rived for typical virtual sources, e.g. plane wavesand point sources.

The derivation of WFS driving function based on aharmonics representation was also presented in [13],but in a slightly different context. It was assumedthat the sound field is captured e.g. by a sphericalmicrophone array and thus the direction of arrivalof the source is unknown. The captured soundfield is thus decomposed into plane waves so thatthe selection of active secondary sources becomesmore convenient. In this paper, it is assumedthat the position of the virtual source is known.The driving functions are thus computed directlyfrom the harmonics coefficients. The focus is onincreasing the local accuracy of the synthesizedsound field.

It is worth noting that the presented derivationsare mathematically similar to the works in [14],[15] and [16]. In the latter studies, the sourcedirectivity is represented by spherical/circular har-monics. The center of expansion is placed outsidethe listening area, and thus the sound field is ex-pressed as an exterior expansion using singularbasis functions. The expansion order depends onthe complexity of the directivity. In this paper,on the contrary, the expansion center has to belocated within the listening area, and the soundfield is described by an interior expansion usingregular basis functions. The expansion order ofthe sound field is determined by the size of thelocal area.

This paper is structured as follows. Notational andmathematical conventions are defined in Sec. 1.1.The concept of local SFS are reviewed and thelocal WFS driving functions are derived in Sec. 2.The proposed method is evaluated by numericalsimulations and the properties of the synthesizedsound field are discussed in Sec. 3.

1.1 Nomenclature and MathematicalPreliminaries

In this paper, the complex sound pressure at posi-tion x is denoted by uppercase S(x, ω) where theradial frequency ω is related to the temporal fre-quency by ω = 2πf . The complex unit i is definedas i2 = −1, and the speed of sound is denoted byc and assumed to be constant.

The cylindrical coordinates (ρ, ϕ, z) is related toits Cartesian coordinates (x, y, z) by

x = ρ cosϕ

y = ρ sinϕ

z = z

where ρ denotes the radial distance from the z-axisand ϕ the polar angle from the x-axis. The spheri-cal coordinates (r, θ, φ) is related to the Cartesiancoordinates by

x = r cosφ sin θ

y = r sinφ sin θ

z = r cosφ

AES 140th Convention, Paris, France, 2016 June 4–7Page 2 of 12

Page 3: Audio Engineering Society Convention Paper 9596 · 2016-09-13 · Audio Engineering Society Convention Paper Presented at the 140th Convention 2016 June 4–7, Paris, ... i.e. parallel

Hahn et al. Local WFS by Spatial Band-limitation

where r denotes the radial distance from the origin,φ the azimuth angle from the x-axis, and θ thecolatitude angle measured from the z-axis.

Within a source-free region, a propagating soundfield can be represented as a combination of theregular basis solutions of the wave equation. A two-dimensional homogeneous sound field independentto the z-coordinate can be expanded with respectto a line (ρ = ρc, ϕ = ϕc) parallel to the z-axis [17,Eq. (4.49)]

S(ρ′, ϕ′, ω) =∞∑

µ=−∞Sµ(ω)Jµ(ωc ρ

′)eiµϕ′

(1)

where Jµ(ωc ρ′) denotes the Bessel function of the

first kind of order µ, eiµϕ′the µ-th circular har-

monic and Sµ(ω) the corresponding expansion co-efficient. The primed variables (ρ′, ϕ′) denote theradial distance and the polar angle from the lineof expansion, respectively.

A three-dimensional homogeneous sound field canbe represented as a spherical harmonics expan-sion [17, Eq. (6.140)]

S(x−xc, ω)

=

∞∑n=0

n∑m=−n

Smn (ω)jn(ωc r′)Y mn (θ′, φ′) (2)

where jn(ωc r′) denotes the spherical Bessel function

of the first kind of order n, Y mn (θ′, φ′) the sphericalharmonic, and Smn (ω) the corresponding expansioncoefficient. The spherical harmonic is defined as

Y mn (θ′, φ′) =√

2n+14π

(n−m)!(n+m)!P

mn (cos θ′)eimφ

′(3)

with Pmn (·) denoting the associated Legendre poly-nomial. The primed variables are defined asx− xc = (r′, θ′, φ′).

Inside a cylindrical (ρ′ < R) or spherical (r′ < R)region, the sound field can be approximated by afinite number of harmonics with bounded error [18].Both for (1) and (2), the required order is typicallyapproximated by N ≡ dωcRe where d·e denotesthe ceiling function. The corresponding numberof harmonics is 2N + 1 for the circular harmonicsexpansion

SN (x− xc, ω) =N∑

µ=−NSµ(ω)Jµ(ωc ρ

′)eiµϕ′

(4)

virtualsource

x0

n0

V

∂V

P (x, ω)

Vl

∂Vl

Fig. 1: Local sound field synthesis. The desiredsound field is synthesized inside the locallistening area Vl by using the secondarysources distributed on ∂V .

and (N+1)2 for the spherical harmonics expansion

SN (x− xc, ω)

=N∑n=0

n∑m=−n

Smn (ω)jn(ωc r′)Y mn (θ′, φ′). (5)

In the remainder, N is frequently referred to asthe spatial bandwidth of a sound field.

For convenience, the coordinate system is alwaystranslated by −xc so that the expansion centercoincides with the origin of the coordinate sys-tem. Thus, (ρ′, ϕ′, z′) = (ρ, ϕ, z) and (r′, θ′, φ′) =(r, θ, φ).

2 Local WFS

2.1 Problem Statement

As illustrated in Fig. 1, local SFS techniques aim atthe physical reconstruction of a desired sound fieldS(x, ω) within a local region x ∈ Vl by using mul-tiple loudspeakers distributed on x0 ∈ ∂V . Thesynthesized sound field P (x, ω) is a superpositionof the sound fields emitted by the individual sec-ondary sources. For a continuous secondary sourcedistribution, the reproduced sound field reads

P (x, ω) =

∫∂V

D(x0, ω)G(x− x0, ω)dA0 (6)

AES 140th Convention, Paris, France, 2016 June 4–7Page 3 of 12

Page 4: Audio Engineering Society Convention Paper 9596 · 2016-09-13 · Audio Engineering Society Convention Paper Presented at the 140th Convention 2016 June 4–7, Paris, ... i.e. parallel

Hahn et al. Local WFS by Spatial Band-limitation

where D(x0, ω) denotes the driving function ofthe secondary source at x0 and G(x − x0, ω)the Green’s function characterizing the spatio-temporal transfer function of the secondary sources.Equation (6) is either a surface integral or a con-tour integral depending on the dimensionality ofthe problem. The goal is to find D(x0, ω) that sat-isfies P (x, ω) = S(x, ω) for x ∈ Vl. If Vl coincideswith V it becomes a conventional SFS problem.

2.2 Driving Function

WFS constitutes a high-frequency/far-field solu-tion of (6) based on the Kirchhoff-Helmholtz inte-gral equation. A detailed theoretical introductionon WFS can be found in [2]. The WFS drivingfunction is given as the directional gradient of thesound field evaluated at the secondary source posi-tion x0 ∈ ∂V ,

D(x0, ω) = −2a(x0)⟨∇S(x, ω)|x=x0

,n0⟩

(7)

where n0 denotes the normal vector on ∂V point-ing inward, and

⟨·, ·⟩the scalar product of two

vectors. The window function a(x0) activates onlythe secondary sources where the inner product ofthe propagation direction and n0 is positive [12].In the following derivation, a(x0) is omitted forconvenience. The gradient ∇ is defined as

∇Cart ≡ ex∂

∂x+ ey

∂y+ ez

∂z(8)

in Cartesian coordinates,

∇cyl ≡ eρ∂

∂ρ+ eϕ

1

ρ

∂φ+ ez

∂z(9)

in cylindrical coordinates, and

∇sph ≡ er∂

∂r+ eθ

1

r

∂θ+ eφ

1

r sin θ

∂φ(10)

in spherical coordinates where e denotes the re-spective unit vector.

2.3 Local WFS by Spatial Band-limitation

In this paper, the sound field in the local listeningarea is controlled by limiting the spatial band-width of S(x, ω) in the circular/spherical harmon-ics domain. Due to the properties of a spatiallyband-limited sound field discussed in Sec. 1.1, only

circular/spherical local listening area is considered.The virtual sound field S(x, ω) is expanded withrespect to the center of the local area Vl. If a localarea with a radius of Rl is desired, the spatial band-width is limited by N = dωcRle. The larger thespatial bandwidth, the bigger the local listeningarea.

In the following, WFS driving functions are derivedfor different secondary source distributions anddimensionalities of the virtual sound field. Thedriving functions are summarized in Table. 1.

2.4 2D Local WFS Driving Function

Two-dimensional (2D) WFS is first considered.Here a height-invariant sound field is synthesizedby using secondary line sources having a 2D dis-tribution, e.g. linear or circular array. The sec-ondary line sources are aligned perpendicular tothe horizontal plane, i.e. parallel to the z-axis.The spatio-temporal response of a line source isgiven by the 2D Green’s function [17, Eq. (8.51)]

G2D(x− x0, ω) = − i4H

(2)0 (ωc ‖x− x0‖) (11)

where H(2)0 (·) denotes the 0-th Hankel function of

the second kind.

The virtual sound field is represented by a circularharmonics expansion (4). To derive the drivingfunction by (7), the individual components of thedirectional gradient are computed [19],

∂∂ρS(x, ω) =

N∑µ=−N

Sµ(ω) ω2c[Jµ−1(ωc ρ)− Jµ+1(ωc ρ)

]eiµϕ

(12)

and

∂∂ϕS(x, ω) =

N∑µ=−N

Sµ(ω)Jµ(ωc ρ)(iµ)eiµϕ (13)

while∂∂zS(x, ω) = 0 (14)

due to the height invariance. In (12), the recur-rence relation of Jµ(·) is exploited [19, Eq. (9.1.27)].The directional gradient is computed by substitut-ing (12), (13) and (14) into (9). The 2D WFS

AES 140th Convention, Paris, France, 2016 June 4–7Page 4 of 12

Page 5: Audio Engineering Society Convention Paper 9596 · 2016-09-13 · Audio Engineering Society Convention Paper Presented at the 140th Convention 2016 June 4–7, Paris, ... i.e. parallel

Hahn et al. Local WFS by Spatial Band-limitation

driving function is then obtained by substituting(9) into (7)

D2D(x0, ω) = −2N∑

µ=−NSµ(ω)eiµϕ0

×[〈er,n0〉 ω2c

{Jµ−1(ωc ρ0)− Jµ+1(ωc ρ0)

}+ 〈eϕ,n0〉iµJµ(ωc ρ0)

]. (15)

Although not considered in this paper, the largeargument approximation of Jµ(·) may be consid-ered to reduce the computational complexity [19,Eq. (9.2.1)].

2.5 3D Driving Function

In three-dimensional (3D) WFS, a virtual soundfield is synthesized by using secondary pointsources that have a 3D distribution, e.g. planar orspherical array. The spatio-temporal transfer func-tion of a point source is given as the 3D Green’sfunction [17, Eq. (8.41)],

G3D(x− x0, ω) =1

e−iωc ‖x−x0‖

‖x− x0‖. (16)

The virtual sound field exhibits three-dimensionalproperties and can be represented by its sphericalharmonics expansion. The individual componentsof the directional gradient in (10) are

∂∂rS(x, ω) =

N∑n=0

n∑m=−n

Smn (ω) (17)

× ωc

njn−1(ωc r)− (n+ 1)jn+1(ωc r)

2n+ 1Y mn (θ, φ),

∂∂θS(x, ω) =

N∑n=0

n∑m=−n

Smn (ω)jn(ωc r) (18)

× −1sin2 θ

[(n+ 1) cos θY mn (θ, φ)

−√

2n+12n+3 ((n+ 1)2 −m2)Y mn+1(θ, φ)

]and

∂∂φS(x, ω)

=N∑n=0

n∑m=−n

Smn (ω)jn(ωc r)(im)Y mn (θ, φ). (19)

In (17) the recurrence relation of jn(·) is ex-ploited [19, Eq. (10.1.20)], and in (18) the recur-rence relation of Pmn (·) is used [19, Eq. (8.5.3) and(8.5.4)]. The directional gradient is computed bysubstituting (17), (18) and (19) into (10). TheWFS driving function is computed by substituting(10) into (7). The resulting driving function (25)is listed in Table 1.

Obviously, a SFS system with a 3D secondarysource distribution is capable of synthesizing a 2Dsound field that is represented by Sµ(ω). One maydirectly use the driving functions derived in (15).If necessary, Sµ(ω) can be converted to Smn (ω) byusing the relation in [20, Eq. (14)] and plugging itinto the 3D driving function (25).

2.6 2.5D Driving Function I – Sµ(ω)

In practice, 2D WFS is infeasible since secondaryline sources are not available. A typical loud-speaker exhibits a point-source-like characteristic.Therefore a 2D distribution (e.g. linear or circulararray) of secondary point sources is often used inSFS systems. This constitutes a dimensionalitymismatch and the 2D driving functions in (15)cannot be used directly.

According to the large argument approximation ofH(2)m (·) [19, Eq. (9.2.4)], the 2D Green’s function

is related to the 3D Green’s function by

G2D(x−x0, ω) ≈

√2π‖x−x0‖

iωcG3D(x−x0, ω)

(20)for ωc ‖x−x0‖ � 1. The latter is also known as thestationary phase approximation. If (20) is pluggedinto (6), the synthesis equation reads

P (x, ω) =∫∂V

D2D(x0, ω)

√2π‖x−x0‖

iωcG3D(x− x0, ω)dA0.

(21)

The above equation can be interpreted as SFSwhere secondary point sources are driven by the2.5D driving function

D2.5D(x0, ω) =

√2π‖x− x0‖

iωcD2D(x0, ω). (22)

AES 140th Convention, Paris, France, 2016 June 4–7Page 5 of 12

Page 6: Audio Engineering Society Convention Paper 9596 · 2016-09-13 · Audio Engineering Society Convention Paper Presented at the 140th Convention 2016 June 4–7, Paris, ... i.e. parallel

Hahn et al. Local WFS by Spatial Band-limitation

x / m-1 0 1

y/m

-1.5

-1

-0.5

0

0.5

1

1.5

(a) Local WFS

x / m0.4 0.6 0.8 1 1.2

y/m

-0.4

-0.2

0

0.2

0.4

(b) Local WFS (zoom)

x / m-1 0 1

y/m

-1.5

-1

-0.5

0

0.5

1

1.5

(c) Conventional WFS

x / m0.4 0.6 0.8 1 1.2

y/m

-0.4

-0.2

0

0.2

0.4

(d) Conventional WFS (zoom)

Fig. 2: Virtual point source at (1, 1.7, 0) with frequency of f = 4 kHz was synthesized by using a circularloudspeaker array (56 loudspeakers, r0 = 1.5 m). Local WFS was used in (a) and (b) whereasthe conventional WFS was used in (c) and (d). For local WFS, the local area is centered at(0.75, 0, 0) with a radius of 0.3 m which is indicated by dashed circles. The driving functionswere computed from the spherical harmonics expansion with the maximum order of N = 22.

The driving function (22) depends on x, and thesynthesized sound field is accurate only at x, Thisis a well-known property of 2.5D SFS, which isattributed to the mismatch of the sound field (2D)and the secondary source distribution (3D). Thedriving function is typically computed for a ref-erence point which is, in this case, obviously thecenter of the local listening area x = 0. The driv-ing function thus reads

D2.5D(x0, ω) =

√2πρ0iωc

D2D(x0, ω). (23)

Note that the additional term corrects the ampli-

tude decay and compensates the low-pass charac-teristic of the secondary line sources.

2.7 2.5D Driving Function II – Smn (ω)

Another type of 2.5D WFS is considered whereS(x, ω) is represented by a spherical harmonicsexpansion but the secondary sources have a 2D dis-tribution, e.g. linear or circular array. While thevirtual sound field exhibits 3D properties, only a2D sound field can be synthesized. It is thus conve-nient to convert the spherical harmonics expansioninto a circular harmonics expansion. This processreduces one dimension of the sound field, and thusthe conversion is non-invertible. One possibility

AES 140th Convention, Paris, France, 2016 June 4–7Page 6 of 12

Page 7: Audio Engineering Society Convention Paper 9596 · 2016-09-13 · Audio Engineering Society Convention Paper Presented at the 140th Convention 2016 June 4–7, Paris, ... i.e. parallel

Hahn et al. Local WFS by Spatial Band-limitation

x / m-1 0 1

y/m

-1.5

-1

-0.5

0

0.5

1

1.5

(a) Local WFS

x / m0.4 0.6 0.8 1 1.2

y/m

-0.4

-0.2

0

0.2

0.4

(b) Local WFS (zoom)

x / m-1 0 1

y/m

-1.5

-1

-0.5

0

0.5

1

1.5

(c) Conventional WFS

x / m0.4 0.6 0.8 1 1.2

y/m

-0.4

-0.2

0

0.2

0.4

(d) Conventional WFS (zoom)

Fig. 3: Virtual plane waves (φPW = −π2 ) synthesized by using a circular loudspeaker array (56 loudspeak-ers, radius of r0 = 1.5 m). Local WFS was used in (a) and (b) whereas the conventional WFSwas used in (c) and (d). The local listening area is centered at (0.75, 0, 0) with radius of 0.3 mwhich is indicated by dashed circles. The driving functions were computed from the circularharmonics expansion where the expansion order depends on the frequency N (f) = d 2πfc Rle. Themaximum order was 122 for f = fs/2.

is to project the sound field onto the horizontalplane, as suggested in [13]. In this paper, onlythe horizontal slice of the virtual sound field issynthesized, i.e. S(x, ω)|z=0.

In order to convert Smn (ω) to Sµ(ω), the order ofsummation in (5) is exchanged,

∞∑m=−∞

∞∑n=|m|

Smn (ω)jn(ωc r)Ymn (θ, 0)

eimφ

(28)where Y mn (θ, φ) = Y mn (θ, 0)eimφ is exploited. Com-

paring (28) with (4) on the xy-plane (θ = π2 ) yields

∞∑n=|m|

Smn (ω)jn(ωc r)Ymn (θ, 0) = SmJm(ωc r) (29)

where ρ is substituted by r. As shown in [20,Eq. (13)], the Bessel function can be representedas a weighted sum of spherical Bessel functions,

Jm(ωc r) =∞∑

n=|m|

4πim−nY −mn (π2 , 0)jn(ωc r)Ymn (π2 , 0), (30)

AES 140th Convention, Paris, France, 2016 June 4–7Page 7 of 12

Page 8: Audio Engineering Society Convention Paper 9596 · 2016-09-13 · Audio Engineering Society Convention Paper Presented at the 140th Convention 2016 June 4–7, Paris, ... i.e. parallel

Hahn et al. Local WFS by Spatial Band-limitation

Driving functions Section

D2D(x0, ω) = −2a(x0)N∑

µ=−NSµ(ω)eiµϕ0

×[〈er,n0〉 ω2c

{Jµ−1(ωc ρ0)− Jµ+1(ωc ρ0)

}+ 〈eϕ,n0〉iµJµ(ωc ρ0)

] (24) 2.4

D3D(x0, ω) = −2a(x0)N∑n=0

n∑m=−n

Smn (ω)[〈er,n0〉 1

2n+1ωc

{njn−1(ωc r0)− (n+ 1)jn+1(ωc r0)

}Y mn (θ0, φ0)

+ 〈eθ,n0〉 −1r0 sin2 θ0

jn(ωc r0){

(n+ 1) cos θ0Ymn (θ0, φ0)

−√

2n+12n+3 ((n+ 1)2 −m2)Y mn+1(θ0, φ0)

}+ 〈eφ,n0〉 im

r0 sin θ0jn(ωc r0)Y mn (θ0, φ0)

](25) 2.5

D2.5D(x0, ω) =

√2π‖x− x0‖

iωc×D2D(x0, ω) (26) 2.6

D2.5D(x0, ω) = −2a(x0)N∑

m=−N

Sm|m|(ω)eimϕ0

4πi|m|−mY −m|m| (π2 , 0)

×[〈er,n0〉 ω2c

{Jm−1(ωc ρ0)− Jm+1(ωc ρ0)

}+ 〈eϕ,n0〉imJm(ωc ρ0)

] (27) 2.7

Table 1: WFS driving functions derived from circular/spherical harmonics expansions.

and thus

Sm(ω) = (31)∑∞n=|m| S

mn (ω)jn(ωc r)Y

mn (π2 , 0)∑∞

n=|m| 4πim−nY −mn (π2 , 0)jn(ωc r)Y

mn (π2 , 0)

.

The conversion of the coefficients is only validfor a distinct radius r and θ. Thus, the soundpressures of the spherically and the cylindricallyexpanded sound fields do only coincide at thatcoordinate. Choosing the origin and applying the

rule of l’Hospital results in

Sm(ω) =Sm|m|(ω)

4πim−|m|Y −m|m| (π2 , 0). (32)

The 2.5D driving function can be computed bysubstituting (32) into (26) (see Table 1). Theresulting driving function (27) is listed in Table 1.

Note that the driving function only requires asubset of Smn (ω) where n = |m|. Not surprisingly,NFC-HOA also uses the same coefficients for 2.5Dsynthesis using a circular array [4, Eq. (20)].

AES 140th Convention, Paris, France, 2016 June 4–7Page 8 of 12

Page 9: Audio Engineering Society Convention Paper 9596 · 2016-09-13 · Audio Engineering Society Convention Paper Presented at the 140th Convention 2016 June 4–7, Paris, ... i.e. parallel

Hahn et al. Local WFS by Spatial Band-limitation

Frequency / Hz10

210

310

4

Magnitude/dB

-15

-10

-5

0

5

10

15

Conventional WFS Local WFS

(a) expansion center at (0, 0, 0)

Frequency / Hz10

210

310

4

Magnitude/dB

-15

-10

-5

0

5

10

15

Conventional WFS Local WFS

(b) expansion center at (0.5, 0, 0)

Fig. 4: Frequency responses of a broadband plane wave (φPW = −π2 ) synthesized by a circular loudspeakerarray (56 loudspeakers, r0 = 1.5 m). For local WFS, the local area (Rl = 0.3 m) in (a) and (b)are centered at (0, 0, 0) and (0.5, 0, 0), respectively. The driving functions were computed fromthe circular harmonics expansion with a fixed order N = 22.

3 Evaluation

The driving functions derived in the previous sec-tion are used in numerical simulations of variousscenarios. Only 2.5D WFS is considered whereeither a circular or rectangular array is used. Thecircular array consists of 56 loudspeakers as thesystem installed in the Technical University ofBerlin. The rectangular array with 64 loudspeak-ers is identical to the system installed in the Uni-versity of Rostock [21]. The loudspeakers are as-sumed to be monopole point sources. Free-fieldsound propagation was considered and the speed ofsound was set to 343 m/s. The sampling frequencywas fs = 44.1 kHz. All simulation results wereobtained by the Sound Field Synthesis Toolboxver. 2.1.0 [22].

The advantage of local WFS compared to (non-local) WFS is demonstrated in Fig. 2. A monochro-matic sound field of a point source is synthesizedwhere the frequency (4 kHz) is well above the spa-tial aliasing frequency (≈ 1 kHz) of the system.The driving functions are computed by using (27)together with the spherical harmonics expansioncoefficients of a point source,

Smn (ω) = −iωc h(2)n (ωc rPS)Y −mn (θPS, φPS) (33)

where (rPS, θPS, φPS) is the spherical coordinatesof the source position. The reproduced sound fieldfor conventional WFS suffers from strong artifactsregardless of the listening position. In local WFS,on the other hand, the sound field within the lo-cal listening area (dashed circle) exhibits a betterspatial structure. The amplitude is more evenlydistributed and also the phase is correctly synthe-sized.

In Fig. 3, a broadband plane wave is synthesizedcarrying a Dirac shaped signal. The driving func-tion is computed for discrete frequencies and trans-formed to the time domain by inverse Fourier trans-form. In conventional WFS, shown in Fig. 3(c) and3(d), the first wavefront is perfectly synthesizedbut it is followed by multiple wavefronts arrivingfrom different directions. In local WFS, a sharpwavefront is observed in the middle of the localarea. Interestingly, the sound field in Fig. 3(a)looks very similar to the sound field synthesizedby NFC-HOA [23, Fig. 1(c)] except the spatialshift by (0.5, 0, 0). This is not surprising becauseboth in NFC-HOA and local WFS, spatial band-limitation is applied to the target sound field. Thisrepresentation of the sound field seems to have astrong influence on the synthesized sound field.

AES 140th Convention, Paris, France, 2016 June 4–7Page 9 of 12

Page 10: Audio Engineering Society Convention Paper 9596 · 2016-09-13 · Audio Engineering Society Convention Paper Presented at the 140th Convention 2016 June 4–7, Paris, ... i.e. parallel

Hahn et al. Local WFS by Spatial Band-limitation

x / m-1 0 1

y/m

-1.5

-1

-0.5

0

0.5

1

1.5

(a) Local WFS

x / m0.4 0.6 0.8 1 1.2

y/m

-0.4

-0.2

0

0.2

0.4

(b) Local WFS (zoom)

x / m-1 0 1

y/m

-1.5

-1

-0.5

0

0.5

1

1.5

(c) Conventional WFS

x / m0.4 0.6 0.8 1 1.2

y/m

-0.4

-0.2

0

0.2

0.4

(d) Conventinal WFS (zoom)

Fig. 5: Virtual plane wave (φPW = −π4 , f = 4 kHz) synthesized by local WFS and conventional WFSusing a rectangular loudspeaker array (64 loudspeakers). Local WFS was used in (a) and (b)whereas the conventional WFS was used in (c) and (d). The local listening area is indicatedby dashed circles centered at (0.75, 0, 0) with a radius of 0.3 m. The driving functions werecomputed from the spherical harmonics expansion with maximum order of N = 22.

The frequency responses for different listening po-sitions are shown in Fig. 4. Two different locallistening areas centered at (0, 0, 0) and (0.5, 0, 0)are considered. The radius of the listening areais 0.3 m in both cases. Below the aliasing fre-quency (≈ 1 kHz) the magnitude responses arealmost identical, but at high frequencies the ad-vantage of local WFS is clearly visible. In con-ventional WFS, spatial aliasing adds more energyin high frequencies thereby resulting in a high-pass characteristic. Due to the strong fluctuation,only an approximate equalization is possible. Intypical WFS systems, a pre-equalization filter isapplied to the input signal [24]. Local WFS, onthe other hand, exhibits better spectral properties.

The fluctuation is in the order of ±1 dB. Both forconventional and local WFS, a low-pass character-istic is observed below 100 Hz which is attributedto the high-frequency assumption in the derivationof WFS.

WFS can be applied for arbitrary secondary sourcedistributions. In Fig. 5, a monochromatic soundfield is synthesized by using a rectangular array.The advantage of local WFS is clearly observable.

4 Conclusion

A new approach for local WFS is presented inthis paper. It is based on the representation of

AES 140th Convention, Paris, France, 2016 June 4–7Page 10 of 12

Page 11: Audio Engineering Society Convention Paper 9596 · 2016-09-13 · Audio Engineering Society Convention Paper Presented at the 140th Convention 2016 June 4–7, Paris, ... i.e. parallel

Hahn et al. Local WFS by Spatial Band-limitation

the virtual sound field by a band-limited circu-lar/spherical harmonics expansion. Due to theproperty of spatial band-limitation, the sound fieldwithin a local region can be synthesized with higherprecision than conventional WFS. Local WFS driv-ing functions are derived for 2D, 2.5D and 3DWFS as summarized in Table 1. Two variations of2.5D WFS driving functions are presented wherecircular and spherical harmonics representationsare considered, respectively.

As demonstrated by numerical simulations, thelocal SFS method improves the technical perfor-mance inside a predetermined target region. There-fore, local SFS can be applied where the listener(s)can be assumed to be in a fixed region or if thelistener position can be estimated by a trackingsystem. The achievable perceptual improvement,however, is still under investigation. The compari-son between different local SFS methods is an opentopic and will be addressed in following studies.

References

[1] Berkhout, A. J., de Vries, D., and Vogel, P.,“Acoustic Control by Wave Field Synthesis,”The Journal of the Acoustical Society of Amer-ica (JASA), 93(5), pp. 2764–2778, 1993.

[2] Spors, S., Rabenstein, R., and Ahrens, J.,“The Theory of Wave Field Synthesis Revis-ited,” in Proc. of 124th Audio EngineeringSociety (AES) Convention, pp. 17–20, 2008.

[3] Daniel, J., Moreau, S., and Nicol, R., “Fur-ther Investigations of High-order Ambisonicsand Wavefield Synthesis for Holophonic SoundImaging,” in Proc. of 114th Audio EngineeringSociety (AES) Convention, 2003.

[4] Ahrens, J. and Spors, S., “An Analytical Ap-proach to Sound Field Reproduction UsingCircular and Spherical Loudspeaker Distri-butions,” Acta Acustica united with Acustica,94(6), pp. 988–999, 2008.

[5] Spors, S. and Rabenstein, R., “Spatial Alias-ing Artifacts Produced by Linear and Circu-lar Loudspeaker Arrays used for Wave FieldSynthesis,” in Proc. of 120th Audio Engineer-ing Society (AES) Convention, Paris, France,2006.

[6] Wu, Y. J. and Abhayapala, T. D., “Spa-tial Multizone Soundfield Reproduction,” inProc. of International Conference on Acous-tics, Speech and Signal Processing (ICASSP),pp. 93–96, IEEE, 2009.

[7] Ahrens, J. and Spors, S., “An Analytical Ap-proach to Sound Field Reproduction with aMovable Sweet Spot using Circular Distribu-tions of Loudspeakers,” in Proc. of Interna-tional Conference on Acoustics, Speech andSignal Processing (ICASSP), Taipei, Taiwan,2009.

[8] Spors, S. and Ahrens, J., “Local SoundField Synthesis by Virtual Secondary Sources,”in Proc. of 40th Audio Engineering Soci-ety (AES) International Conference on Spa-tial Audio, Tokyo, Japan, 2010, ISBN 978-0-937803-77-6.

[9] Spors, S., Helwani, K., and Ahrens, J., “Lo-cal Sound Field Synthesis by Virtual Acous-tic Scattering and Time-reversal,” in Proc. of131st Audio Engineering Society (AES) Con-vention, Audio Engineering Society (AES),New York, USA, 2011.

[10] Winter, F., Ahrens, J., and Spors, S., “OnAnalytic Methods for 2.5 D Local SoundField Synthesis Using Circular Distributionsof Secondary Sources,” IEEE Transactions onSpeech and Audio Processing, pp. 914–926,2016.

[11] Fazi, F. M., Sound field reproduction, Ph.D.thesis, University of Southampton, 2010.

[12] Spors, S., “Extension of an Analytic SecondarySource Selection Criterion for Wave Field Syn-thesis,” in Proc. of 123rd Audio EngineeringSociety (AES) Convention, New York, USA,2007.

[13] Ahrens, J. and Spors, S., “Wave Field Synthe-sis of a Sound Field Described by SphericalHarmonics Expansion Coefficients,” Journalof the Acoustical Society of America (JASA),131(3), pp. 2190–2199, 2012, doi:10.1121/1.3682036.

AES 140th Convention, Paris, France, 2016 June 4–7Page 11 of 12

Page 12: Audio Engineering Society Convention Paper 9596 · 2016-09-13 · Audio Engineering Society Convention Paper Presented at the 140th Convention 2016 June 4–7, Paris, ... i.e. parallel

Hahn et al. Local WFS by Spatial Band-limitation

[14] Fazi, F. M., Olivieri, F., Carpentier, T., andNoisternig, M., “Wave Field Synthesis of Vir-tual Sound Sources with Axisymmetric Ra-diation Pattern Using a Planar LoudspeakerArray,” in Proc. of 134th Audio EngineeringSociety (AES) Convention, Audio Engineer-ing Society, 2013.

[15] Ahrens, J. and Spors, S., “Implementationof Directional Sources in Wave Field Synthe-sis,” in Proc. of Workshop on Applicationsof Signal Processing to Audio and Acoustics(WASPAA), IEEE, New Paltz, USA, 2007.

[16] Corteel, E., “Synthesis of Directional Sourcesusing Wave Field Synthesis, Possibilities, andLimitations,” EURASIP Journal on Advancesin Signal Processing, 2007(1), pp. 1–18, 2007.

[17] Williams, E. G., Fourier Acoustics: Sound Ra-diation and Nearfield Acoustical Holography,Academic Press, 1999.

[18] Kennedy, R. A., Sadeghi, P., Abhayapala,T. D., and Jones, H. M., “Intrinsic Limitsof Dimensionality and Richness in RandomMultipath Fields,” IEEE Transactions on Sig-nal Processing, 55(6), pp. 2542–2556, 2007.

[19] Abramowitz, M. and Stegun, I. A., Handbookof Mathematical Functions: with Formulas,Graphs, and Mathematical Tables, 55, CourierDover Publications, 1972.

[20] Hahn, N. and Spors, S., “Sound Field Synthe-sis of Virtual Cylindrical Waves using Circularand Spherical Loudspeaker Arrays,” in Proc.of 138th Audio Engineering Society (AES)Convention, Warsaw, Poland, 2015.

[21] Erbes, V., Geier, M., Weinzierl, S., and Spors,S., “Database of Single-channel and Binau-ral Room Impulse Responses of a 64-channelLoudspeaker Array,” in Proc. of 138th Au-dio Engineering Society (AES) Convention,Warsaw, Poland, 2015.

[22] Wierstorf, H. and Spors, S., “Sound FieldSynthesis Toolbox,” in Proc. of 132nd AudioEngineering Society (AES) Convention, 2012.

[23] Ahrens, J., Spors, S., and Wierstorf, H.,“Comparison of Higher Order Ambisonics and

Wave Field Synthesis with respect to Spa-tial Discretization Artifacts in Time Domain,”in Proc. of 40th Audio Engineering Society(AES) International Conference on SpatialAudio, 2010.

[24] Spors, S. and Ahrens, J., “Analysis andImprovement of Pre-equalization in 2.5-dimensional Wave Field Synthesis,” in Proc.of 128th Audio Engineering Society (AES)Convention, London, UK, 2010, ISBN 978-0-937803-74-5.

AES 140th Convention, Paris, France, 2016 June 4–7Page 12 of 12