Let us consider the problem of determining
the structure of objects. Similar problems often arise when solving various
technical problems of object control. Of particular interest are the tasks of
medical diagnostics and non-destructive testing methods. When researching, it
is necessary to rely on one principle, such as non-invasiveness. This means
that the object under consideration must maintain integrity during the
research; it is impossible to penetrate and destroy the object under
consideration. In cases where it comes to medical diagnostics, additional
restrictions are imposed on the measurement methods used. These problems are
often solved using electrodynamic or acoustic methods by exposing the object of
measurement to a radiation source. Such approaches are well studied and are
essentially classical in acoustics and electrodynamics and are called “inverse
problems of acoustics and electrodynamics”.
In acoustics or electrodynamics, the term
“inverse problem” usually refers to problems related to the search for and
identification of inhomogeneities in objects. Two main classes of these
problems can be distinguished: time-dependent and time-independent problems.
The first class of problems is effectively solved by using finite-difference
methods. The second class is the most difficult, so it is usually solved by
reducing the boundary value problem to integral equations. In this paper, the
emphasis is on problems that are not related to time. It is worth noting that
most inverse problems are ill-posed and nonlinear, which makes the process of
solving them extremely difficult. Even small changes in the input data can
significantly affect the results. Nonlinearity, in turn, adds difficulties to
the solution.
Initial attempts to solve inverse problems
were tied to the development of simple iterative methods, which have their pros
and cons. Among the advantages is the ability to work with incomplete data,
while a crucial drawback is the necessity of finding a high-quality initial approximation.
This paper proposes a numerical method for
solving acoustic problems using neural networks, which is due to the need to
find effective methods for filtering data. The importance of developing new
methods can be demonstrated using the example of medical diagnostics, where,
despite the existing modern diagnostic equipment, the issues of detection
accuracy and procedure safety remain relevant. Effective mathematical
algorithms can help solve these problems not only in medicine, but also in flaw
detection.
Multiple attempts to solve inverse
diffraction problems on screens have been well studied in the works of domestic
and foreign researchers [1-16].
The propagation of sound waves in free
space
and their interaction with objects is a
complex problem that is actively studied in acoustics. Imagine a
two-dimensional object, such as an airplane wing or part of a building, located
in free space. A sound wave
,
approaches
this object, for example, a loudspeaker, an airplane engine, or even a person.
Our goal is to determine the complete sound field
that
occurs around and on the surface of this object, taking into account both the
incident sound wave from the source and the reflected and scattered waves from
the object itself. This problem, determining the complete field from a known
incident field, is called the forward problem of acoustics. Solving this
problem is critical in many fields. In aeroacoustics, for example,
understanding the interaction of sound waves with an airplane wing allows us to
design quieter aircraft. In architectural acoustics, modeling sound propagation
in a room helps optimize the acoustic performance of concert halls or recording
studios by minimizing echo and reverberation. In medicine, acoustic modeling is
used to design and optimize ultrasound diagnostic devices. Even in the field of
underwater sonar, understanding how sound waves interact with underwater
objects is key to detecting and identifying targets. Methods for solving direct
acoustic problems vary and depend on the complexity of the object and the
frequency range of the sound waves. For simple geometric objects such as
spheres or cylinders, analytical solutions based on diffraction and scattering
theory can be used. However, for objects of complex shape, there are no
analytical solutions, and numerical methods must be used. The choice of an
appropriate method depends on many factors, and the constant development of numerical
methods and computing technology allows us to solve increasingly complex
problems.
Fig. 1. The problem of diffraction on a body.
The behavior of the scattered field
can be determined by solving the
inhomogeneous Helmholtz equation:
|
(1)
|
The function is piecewise continuous and
is determined by the relation
.
Here
- defines the wave parameters inside the object,
.
- is the wave parameter
of free space. The right-hand side of equation (1) is given by a known function
with a compact support. We require that the
conjugation conditions be met at the interface between the two media.
|
(2)
|
where
denotes
a field jump.
To ensure
the uniqueness of the problem, we write the Sommerfeld radiation conditions:
|
(3)
|
Problem
(1)-(3) is reduced to a linear inhomogeneous Lippmann-Schwinger integral
equation using the second Green formula [3,8]:
|
(4)
|
where
are
the Hankel functions.
The
Lippmann–Schwinger integral equation is a powerful mathematical tool widely
used to solve wave scattering problems in a variety of fields of physics. Its
fundamental importance lies in the ability to reformulate the problem of wave
interaction with an obstacle (or potential) from a differential equation, often
difficult to solve analytically, into an integral equation. This integral
equation takes into account the influence of the scattering object on the
incident wave by means of an integral covering the entire interaction domain.
This approach is especially useful when the geometry of the scattering object
is complex or when the interaction potential is not a smooth function. In
acoustics, for example, the Lippmann–Schwinger equation allows one to calculate
the sound field scattered by an object of arbitrary shape under the influence
of a sound wave. Here, the integral describes the sum of elementary spherical
waves emitted by each point on the surface of the object in response to the
incident wave. The amplitude and phase of these secondary sources are
determined by both the incident field and the properties of the object's
material, which affect the reflection and transmission coefficients. The resulting
solution provides a complete description of the scattered field, including the
amplitude, phase, and direction of propagation of scattered waves. A similar
approach is applicable in electrodynamics, where the Lippmann-Schwinger
equation is used to describe the scattering of electromagnetic waves by various
objects, from microscopic particles to large antenna systems. In this case, the
integral takes into account the contribution of each element of the scattering
object to the resulting electromagnetic field. The solution allows one to
predict the characteristics of the scattered radiation, such as the effective
scattering cross-section (ESR), an important characteristic for radar and
remote sensing technologies.
The
operator form of equation (4) is obtained after introducing the following
notation
|
(5)
|
then
the equation takes the form:
|
(6)
|
where
We will search for solutions to equation
(4) using the space.
Statement 1.
The operator
s
Fredholm with zero index..
Lemma 1.
[3] The solution to problem (1)-(3) is
unique.
Statement 2.
The operator
is
continuously invertible..
In works [2-7] numerical studies of the integral equation (4) were carried out.
The use of identification approaches in
diffraction problems in medical diagnostics is possible only with the use of
non-invasive methods. Let us consider the issue of choosing observation points
when examining an object. It should be noted that a poor choice of points can
have a significant impact on the diagnostic results. Observation points should
be located close enough to the object under study and, if possible, cover the body
evenly from all sides. However, personal experience in studying the problem has
shown that the radiation source should be slightly removed from the observation
points to avoid the so-called sensor overexposure. We recommend removing the
observation points by one or two integration step lengths. We will place these
points evenly along the boundaries of the object under study at a small
distance from each other in several layers. A wave propagating from a point
source is used as incident radiation. In such a formulation of the problem, it
is possible to use a two-step algorithm for identifying inhomogeneities.
Fig. 2. Object, radiation source and observation points.
Let's divide a flat object into cells
.
Let's introduce the assumption that the
inhomogeneity parameters inside each cell do not change
.
Let's apply a two-step algorithm.
1) In the first step, using the field
values
measured at the observation points
,
we calculate the current value
by solving the following equation.
|
(7)
|
It should be noted that equation (7) is
the most complex part of the two-step method, since it is an equation of the
first kind. The system of linear algebraic equations obtained as a result of
solving the integral equation (7) is ill-conditioned, which leads to highly
noisy reconstructed data.
2) In the second step, we recalculate the
value of the inhomogeneity parameters
using
the value
|
(8)
|
The use of various regularization methods
and matrix preconditioning algorithms in some cases reduces the condition
number. These approaches work effectively until the matrix condition number
exceeds 1014. Therefore, these approaches are not universal. The paper also
uses an approach based on the use of neural networks.
Consider the noise reduction problem for
the two-step algorithm using neural networks. Noise reduction will be performed
at the stage of restoration of values
.
The
problem can be defined:
|
(9)
|
- noisy data represented
the sum of true signal
and some noise
.
The essence of the basic methods is to
approximate the
using
.
We choose a convolutional autoencoder as a
model for solving the filtering problem and introduce the function of recovery
error (loss function) from the true and processed by the model data:
|
(10)
|
Autoencoder is a neural network, which designed
to encode the input into a compressed representation and then decode it back such
that the reconstructed input is similar as possible to the original data. During
the dimensionality change, the data
is compressed into some
latent-space.
|
(11)
|
where
- is the
activation function.
The result of network computations is a
multiple application of (11) with different parameters. The first part of
transformations changing the dimensionality of the input tensor to the
dimensionality of the latent space is called encoder.
Then a transformation is applied to it
bringing it to its original dimensionality.
|
(12)
|
Noisy autoencoder is a stochastic extension
of the classical autoencoder designed to recover raw data from their noisy
variants. Such models can be combined into complex architectures, creating deep
neural networks to solve more complex problems.
Convolutional autoencoders utilize the
described idea of autoencoders by replacing transformations (11) with
convolutional layers.
The convolution operation is understood as
the following matrix transformation:
|
(13)
|
- output data (feature map),
- layer filter (trainable parameter)
- input data,
- bias.
From a
mathematical perspective, the convolution operation involves the sliding
application of a filter (kernel) to input data. At each step, the sum of the
products of the kernel elements and the corresponding values of the local
region of the input matrix is computed. The key parameters defining the nature
of this process are the kernel size and the stride. The former specifies the
processing window (e.g., 3×3 or 5×5), while the latter regulates
the distance between adjacent filter positions, influencing the size and overlap
of the extracted submatrices.
An important
complement to convolutions in neural networks is the pooling operation. Its
goal is to progressively reduce the spatial dimensionality of the data while
preserving the most significant features. Unlike convolution, pooling does not
use trainable parameters but instead aggregates information within local
regions (e.g., selecting the maximum value in max-pooling). This enhances the
model’s robustness to minor distortions in the input data.
|
(14)
|
In this case, the dimensions of the output
matrix can be found as:
|
(15)
|
|
(16)
|
- pooling submatrix
size
Convolutional autoencoders demonstrate a
significant advantage over classical autoencoders when working with images due
to their specialized architecture based on convolutional neural networks
(CNNs). Unlike the fully connected layers of traditional autoencoders,
convolutional layers operate on local regions of data, preserving the spatial
structure of the image and uncovering hierarchical patterns (e.g., edges,
textures, objects). This enables the model to efficiently capture the
topological features of the input data matrix, minimize the number of trainable
parameters through shared weights, and reduce the risk of overfitting.
Thus, convolutional autoencoders are not
only adapted for handling multidimensional structured data but also provide
more meaningful compression of information, which is critically important for
tasks such as image reconstruction and generation.
An example of such a model is shown in
Figure 3.
Fig.
3. Example of convolutional autoencoder architecture.
In formulas
are the
model parameters that are optimal in terms of minimizing the recovery error,
which can be achieved using different loss functions such as RMS error or
cross-entropy.
Thus, the model accumulates information
about the distribution of the reconstructed data
from the
pair estimates of the training sample.
.
Generating an example
from the training dataset.
Noising of the generated example
from
.
Estimation of the probability distribution
For the experiment, training and test
datasets were created, containing 6,000 and 2,000 examples, respectively. Each
data instance was generated according to the following algorithm:
Structure Generation: The number, geometric
shape, size, and physical parameters of inhomogeneities were assigned
stochastically based on the problem conditions.
Noise Addition: Artificial noise with a
uniform distribution was applied to the resulting matrix at three intensity
levels—15%, 30%, and 50% of the original signal’s amplitude.
The original and noise-modified data were
saved in a format suitable for neural network processing and used during its
training and validation stages. Visualizations of typical examples from the
dataset (including variations with different noise levels) are shown in Figure
4.
As an example, consider a problem with the
following initial data (Figure 4). We solve the forward problem and calculate
the field values at special observation points (Figure 5).
Fig.
4 Wave value function of initial object.
Fig.
5 Module of the solution of the integral equation (4).
We add 40% white noise into the
(Fig. 6) and filter
it using the autoencoder model (Fig. 7).
Fig.
6 Module of the solution of integral equation equation (4) without filtration.
Fig.
7 Module of the solution of the integral equation equation (4) after the
filtering procedure.
Next, the problem of restoring the body
structure is solved by substituting the calculated
into
formula (8). As a result, the value of the wave function
is restored.
|
|
(a)
|
(b)
|
Fig.
8. Modulus of the reconstructed values of the wave function: a - without
filtering, b - with filtering.
Figure 8 shows the solution of the inverse
problem for the considered figure (Fig. 4). From Fig. 8 we can conclude that
the recovery is significantly improved when using the neural network model to
filter the noisy data in the solution. The effectiveness of the model is
especially noticeable when it is applied to high noise levels.
In the previous experiment, the modulus of
the difference between the original and noisy data is a large value. As the
noise level decreases, the efficiency of gradient methods decreases. To solve
this problem, we will convert the input data to the frequency range using a
two-dimensional Fourier transform, where we will perform filtering. We will
introduce a small error of about 0.1% into the measured data and apply the
Fourier transform. The training process of the neural network model is similar
to that described above. We will approximate the noise level by the model after
preprocessing using a two-dimensional Fourier transform.
The direct two-dimensional Fourier
transform is the function:
|
(17)
|
Inverse two-dimensional transformation:
|
(18)
|
After applying the transformation (17), the
data values in the frequency range are not small, which makes it
possible to effectively use gradient methods to filter the data. The following
order of transformations for the input data is defined: the original vector is
divided into a real and imaginary part. For each of the parts, the model is
trained on the training sample.
Fig.
9 Fourier transform of the real part
Fig.
10 Fourier transform of the imaginary part
At the stage of using the trained model,
the inverse transformation (18) is applied. As a result of filtering, the noise
level can be reduced by one order of magnitude.
Fig.
11 Original (left), reconstructed (right) values of the real
part.
Fig.
12 Original (left), reconstructed (right) values of the imaginary
part.
Based on the Fourier transform, other
methods not related to machine learning can be used. There are various
algorithms, such as the recursive average algorithm, the exponentially weighted
average algorithm, the five-point moving average algorithm. However, these
algorithms do not provide a sufficient result for the task. Filtering methods
using the Fourier transform, based on the introduction of weight functions
(Hanna, Hamming, Kaiser, ...) work more efficiently.
An example of the restoration of a
sinusoidal signal using the Fourier transform is shown in the graphs below
(Fig. 13 – 14).
Fig.
13 Original (left), noisy (right) signal.
Fig.
14 Recovered signal.
However, these methods are not effective
for the problem under consideration. Therefore, it is proposed to use averaging
algorithms based on the impact of two types of error. The first is white noise,
the second is a system error and models the background impact on the input
data. When working with a sample of 100 values, it is possible to effectively
restore inhomogeneities at a system error level of about 1% and a white noise
error of 0.01%.
Fig.
15. Recovered values of the modulus of the solution of the
integral equation (4).
Figure 15 demonstrates good quality of
restoration of the value of the solution module of equation (4). However, a
minor deviation in the form of new non-existent inhomogeneities appears in the
restored data. These inhomogeneities are called artifacts. They are easily
removed by conducting additional measurements at other frequencies. When
developing medical diagnostic methods, the task is often to suspect a disease
at an early stage, when the tumor does not exceed a certain size. The obtained
results show that the algorithm copes with this problem.
The paper considers the problem of
restoring the structure of an object. The problem under consideration is of
great interest in medical diagnostics. The paper proposes algorithms that allow
restoring the structure of an object. The problem under consideration is
ill-posed, so it is proposed to use various data filtering methods. The
filtering algorithms are based on the use of neural networks. Convolutional
autoencoder models were used for training. Visualization of the obtained data
helps to separate important information about the structure of objects from
irrelevant information, including various artifacts. Graphical representation
of the data allows us to assert that the algorithms proposed in the paper
effectively restore the structure of an object at different levels of noise in
the data. To obtain numerical results, a set of programs was implemented in
C++. Data filtering and visualization were implemented in Python using the
Pytorch and Matplotlib libraries.
The work was carried out with the
support of the state assignment of the Ministry of Science and Higher Education
of the Russian Federation, (Reg. No. 124020200015-7)
1. Medvedik, Mikhail Y., Smirnov, Yury G. and Tsupak, Aleksei A.. "Inverse vector problem of diffraction by inhomogeneous body with a piecewise smooth permittivity" Journal of Inverse and Ill-posed Problems, vol. 32, no. 3, 2024, pp. 453-465. https://doi.org/10.1515/jiip-2022-0060
2. Medvedik, M., & Moskaleva, M. (2020). Numerical Method for Reconstruction of Inhomogeneity Parameters of a Body Placed in a Semi-Infinite Rectangular Waveguide. Lobachevskii Journal of Mathematics, 41, 1371-1376. DOI:10.1134/S1995080220070288
3. Medvedik, M., Smirnov, Y.G., & Tsupak, A.A. (2020). Non-iterative two-step method for solving scalar inverse 3D diffraction problem. Inverse Problems in Science and Engineering, 28, 1474 - 1492. DOI10.1080/17415977.2020.1727466
4. Medvedik, Mikhail, Yu. G. Smirnov and Aleksei A. Tsupak. ÒThe two-step method for determining a piecewise-continuous refractive index of a 2D scatterer by near field measurements.Ó Inverse Problems in Science and Engineering 28 (2020): 427 - 447. DOI:10.1080/17415977.2019.1597872
5. Evstigneev R.O., Medvedik M.Y. Reconstruction of Inhomogeneities in a Hemisphere from the Field Measurements // Lobachevskii Journal of Mathematics. - 2019. - Vol. 40. - N. 10. - P. 1653-1659. doi: 10.1134/S1995080219100093
6. Smirnov Y.G., Medvedik M.Y., Moskaleva M.A. Two-Step Method for Permittivity Determination of an Inhomogeneous Body Placed in a Rectangular Waveguide // Lobachevskii Journal of Mathematics. - 2018. - Vol. 39. - N. 8. - P. 1140-1147. doi: 10.1134/S1995080218080115
7. Medvedik, M.Y., Smirnov, Y.G. & Tsupak, A.A. Scalar problem of plane wave diffraction by a system of nonintersecting screens and inhomogeneous bodies. Comput. Math. and Math. Phys. 54, 1280Ð1292 (2014). https://doi.org/10.1134/S0965542514080089
8. Medvedik, M.Y. A subhierarchic method for solving the Lippmann-Schwinger integral equation on bodies of complex shapes. J. Commun. Technol. Electron. 57, 158Ð163 (2012). https://doi.org/10.1134/S1064226912010123
9. V. Cherepenin et al., ÒA 3D electrical impedance tomography (EIT) system for breast cancer detection,Ó Physiol. Meas., vol. 22, no. 1, Art. no. 1, Feb. 2001, doi: 10.1088/0967-3334/22/1/302.
10. S. Havriliak and S. Negami, ÒA complex plane representation of dielectric and mechanical relaxation processes in some polymers,Ó Polymer, vol. 8, pp. 161Ð210, Jan. 1967, doi: 10.1016/0032-3861(67)90021-3.
11. Zarafshani, T. Bach, C. R. Chatwin, S. Tang, L. Xiang, and B. Zheng, ÒConditioning Electrical Impedance Mammography System,Ó Measurement, vol. 116, pp. 38Ð48, Feb. 2018, doi: 10.1016/j.measurement.2017.10.052.
12. J. Halvorsen, I. Pivac, D. Bezmalinovi?, F. Barbir, and F. Zenith, ÒElectrochemical low-frequency impedance spectroscopy algorithm for diagnostics of PEM fuel cell degradation,Ó International Journal of Hydrogen Energy, vol. 45, no. 2, Art. no. 2, Jan. 2020, doi: 10.1016/j.ijhydene.2019.04.004.
13. Mohd. Kamal, T. Sakorikar, U. M. Pal, and H. J. Pandya, ÒEngineering Approaches for Breast Cancer Diagnosis: A Review,Ó IEEE Rev. Biomed. Eng., vol. 16, pp. 687Ð705, 2023, doi: 10.1109/RBME.2022.3181700.
14. Zarafshani, ÒFeasibility Test of Quantitative Assessment of Breast Density Based on Dielectric Impedance Spectroscopy,Ó JABB, vol. 2, no. 6, Art. no. 6, Apr. 2017, doi: 10.15406/jabb.2017.02.00047.
15. M. Safronov, A. Kuzmin, O. Bodin, V. Baranov, O. Timokhina, and O. Cheban, ÒReducing of Bioimpedance Influence on ECG by Correction Filter in Mobile Heart Monitoring System,Ó in 2020 27th Conference of Open Innovations Association (FRUCT), Trento, Italy: IEEE, Sep. 2020, pp. 200Ð206. doi: 10.23919/FRUCT49677.2020.9211026.
16. Kotelnikov Institute of Radioengineering and Electronics of RAS, Mokhovaya 11-7, Moscow 125009, Russia and A. V. Korjenevsky, ÒResearch electrical impedance tomography system suitable for making in out of factory conditions.,Ó JRE, vol. 2021, no. 9, Art. no. 9, Sep. 2021, doi: 10.30898/1684-1719.2021.9.5.