Documente Academic
Documente Profesional
Documente Cultură
net/publication/283784709
CITATIONS READS
32 809
6 authors, including:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Min Wu on 04 December 2015.
made with respect to the analysis results. In addition to these two with split cores can be conveniently applied to a high voltage
main components, feature extraction and representation for PD equipment without disconnecting any part of it, whereas those
signals can play an important role in data analysis, and we con- with closed cores are generally for permanent monitoring and
sider it as an independent component. This survey will mainly need to be installed prior to the operation.
focus on the pipeline with the three components for PD data
collection, representation, and analysis as illustrated in Figure 1. Acoustic Detection
Acoustic sensors can be used to detect sound waves gener-
ated by PD in sonic and ultrasonic range [14], [15]. Generally,
Detection and Localization for PD Signals acoustic sensors are applied to detect PD in switchgears, gas-
Based on different types of signals generated by PD, we insulated systems, and transformers. Acoustic sensors have sev-
summarize the PD sensing methods into three main categories: eral advantages for detecting PD signals. First, they are easily
electrical, acoustic, and chemical detection [9]. In fact, PD also installed. For example, they can be conveniently mounted on the
generates heat, and thus, thermal detection for PD using temper- outside of the tank wall of the transformer while the transformer
ature sensors sounds practicable. However, temperature sensors stays in full service. Second, acoustic sensors are immune to
have not been used for PD detection in current literatures. Next, electromagnetic interference. Third, acoustic sensors are gener-
we briefly introduce each detection method and the sensors in- ally associated with low price, but perform well for localizing
volved in each method. In addition, we also briefly introduce PD the PD sources [16]. Still, acoustic sensors suffer from their own
localization in this section. limitations, e.g., they are less sensitive due to background acous-
tic noise.
Electrical Detection Fiber-optic acoustic sensors have been developed to increase
Electrical detection of PD is based on the electrical phenom- their sensitivity for signal detection. They are becoming high-
ena accompanied by the discharge, including electromagnetic ly attractive for PD sensing due to their advantages (e.g., high
radiation and electric current pulses. sensitivity, electrical nonconductivity, and immunity to electro-
Ultrahigh frequency (UHF) sensors or UHF antennas are able magnetic interference). In [17]–[20], various fiber-optic acoustic
to detect electromagnetic waves generated by PD (usually with sensor systems have been designed for detecting and localizing
a frequency range from 300 MHz to 3 GHz). UHF sensors are PD in transformers.
often applied to detect PD in gas-insulated systems and trans-
formers. UHF sensors offer the advantage that external noise Chemical Detection
and disturbances can be shielded effectively. Therefore, UHF Chemical detection of PD is based on the fact that the dis-
sensors for PD signals have gained increasing attention recently charge may generate new chemical components via reactions.
[10], [11]. In addition, transformers with different geometry in- Dissolved gas analysis (DGA) is the main chemical detection
side will have different impedance, and UHF sensors thus need method [21]–[23], and it focuses on analyzing dissolved gases in
to be recalibrated for different types of transformers. the transformer oil. DGA methods include gas chromatography,
High frequency current transformer (HFCT) sensors detect hydrogen on-line monitoring, and photo-acoustic spectroscopy
the high frequency current pulses generated by PD. The frequen- [23]. The distribution of these gases collected from the above
cy band of HFCT sensors is usually from hundreds of kilohertz methods can help to identify different types of PD, based on the
to dozens of megahertz, which has been proven to be capable of existing IEC standards [23], [24].
capturing PD signals. HFCT sensors consist of a magnetic core Chemical detection is usually intrusive and performed for
and screened windings, and they are clamped around the case oil-filled components, e.g., transformers. This limits its applica-
ground of the components (e.g., switchgears and cable termina- tions to other components. Furthermore, chemical detection is
tions) to measure the current pulses [12], [13]. HFCT sensors not able to provide any information about the location of PD.
Table 1. The features for various partial discharge detection methods and sensors
phase window. We can thus represent the phase-resolved pattern • Pulse decay time (td): time required to decrease from 0.9A
with a 360 × 3 = 1,080 dimensional vector. Due to the curse of to 0.1A
dimensionality, it is difficult to analyze, understand, and visual- • Pulse width (tw): the time interval between 0.5A on both
ize such high dimensional data (e.g., 1,080 dimensional data as sides
introduced above). Feature extraction techniques for dimension-
ality reduction would thus be important before we can analyze Statistical Features
the PD data. In this section, we will also introduce principal Given a set of pulses, phase-resolved pattern is a good repre-
component analysis (PCA) and t-distributed stochastic neighbor sentation for them as mentioned above. The phase-resolved pat-
embedding (t-SNE) for feature dimensionality reduction. tern is first divided into many phase windows, e.g., each phase
window has 1° and we can thus have 360 phase windows. We
Pulse Characteristics can then extract some features in each phase window, e.g., the
Figure 4 shows an example PD pulse and typical features number of pulses, maximum amplitude, and average amplitude.
to describe the pulse shape. Assuming that the maximum am- The statistical moments of these features over all the phase win-
plitude of the pulse is A, we can extract the following features dows can be extracted to further represent the given set of pulses
to describe the pulse shape from the waveform, e.g., pulse rise [32].
time, decay time, and pulse width (duration) [32]. Assume that we have N phase windows, and xi is a specific
feature value for the ith phase window. The mean of xi over these
• Pulse rise time (tr): time required to increase from 0.1A to N
N phase windows would be µ = ∑ i =1 x i / N . If we consider
0.9A that phase windows may have different importance [let p(xi) be
the importance of the ith phase window], the weighted mean is
thus computed in Equation 2. For simplicity, the other statistics,
e.g., variance (σ2), skewness (γ), and kurtosis (κ), are all defined
as follows without considering the importance for phase win-
dows [33], [34].
N
∑ x i p (x i )
Weighted mean (w ∝) : w ∝ = i =N1 (2)
∑ i =1p (x i )
N 2
2
Variance (σ ) : σ 2 ∑ (x i − µ )
= i =1 (3)
N
N 3
∑ (x i − µ )
Skewness (γ ): γ = i =1 (4)
σ 3 ×N
Type I: Features extracted directly from the waveforms or phase-resolved (PR) patterns
PR patterns [7]
1
DWT = discrete wavelet transform; WPT = wavelet packet transform; PCA = principal component analysis; t-SNE = t-
distributed stochastic neighbor embedding.
k
2 Fuzzy C-Means
arg min ∑ ∑ x j − oi (8) Fuzzy C-means (FCM) is a soft version of K-means, where
C
i =1 x j ∈C i
each object has a fuzzy degree of belonging to each cluster [56].
Here, “soft” clustering means that an object can be assigned to
All the approaches [11], [44], [45], [54], [55] used K-means multiple clusters. In FCM, the optimization function in Equation
algorithm to group signals into different clusters. Their differ- 9 is extended from Equation 8.
ence is at the features extracted for clustering. In [44], PD sig-
nals were represented by a feature vector with the statistics of k N
2
the phase-resolved patterns (i.e., skewness, kurtosis, asymmetry, arg min ∑∑pijm ⋅ x j − oi (9)
P ,C
and cross-correlation factor). In [54], the authors proposed nor- i =1 j =1
malized auto-correlation function for recorded signals, which
summarizes both time and frequency domain features. In addi- In Equation 9, C = {C1, …, Ck} is the set of k clusters and oi
tion, the similarity between two signals is measured by the Pear- is the center of the cluster Ci. P = {pij} (1 ≤ i ≤ k, 1 ≤ j ≤ N, N is
son correlation coefficient between their normalized auto-corre- the total number of objects) and pij shows the probability of the
lation functions. In [45], the authors first extracted the features object xj belonging to the cluster Ci. In addition, m is a parameter
for the pulse shape (e.g., pulse rise and decay time, maximum that controls how much the weight is given to the closest center.
amplitude, and so on) and generated clusters with K-means. Using Lagrange multipliers, FCM computes P and C by itera-
Then, they merged clusters based on the distribution range of tively updating pij and oi in Equation 10 until they are converged.
1
Average-average distance (average linkage or unweighted pair group method with average,
i.e., UPGMA) A B ΣΣdist ( a, b )
a ∈ Ab ∈B
Centroid distance (unweighted pair group method with centroid, i.e., UPGMC) dist ( ca , cb )
ΣΣdist ( a, b )
2
Ward’s criterion
a ∈ Ab ∈B
Figure 6. (A) Linear support vector machines and the maximum margin principle; (B) an example of multilayer neural network; and
(C) a decision tree for the mammal classification problem.