CR16: Signal Processing and Networks Data analysis and...

70
Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion CR16: Signal Processing and Networks Data analysis and processing for networks Part 2 - Some applications of spectral graph wavelets Pierre Borgnat CR1 CNRS – Laboratoire de Physique, ENS de Lyon, Université de Lyon Équipe SI SYPHE : Signaux, Systèmes et Physique 09/2014 p. 1

Transcript of CR16: Signal Processing and Networks Data analysis and...

Page 1: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

CR16: Signal Processing and NetworksData analysis and processing for networks

Part 2 - Some applications of spectral graphwavelets

Pierre Borgnat

CR1 CNRS – Laboratoire de Physique, ENS de Lyon, Université de Lyon

Équipe SISYPHE : Signaux, Systèmes et Physique

09/2014

p. 1

Page 2: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Graph Wavelets

• Fourier is a global analysis. Fourier modes (eigenvectors ofthe laplacian) are used in classical spectral clustering, butdo not enable a jointly local and scale dependent analysis.

• For that classical signal processing (or harmonic analysis)teach us that we need wavelets.

• Wavelets : local functions that act as well as a filter arounda chosen scale.A wavelet:

– Translated:

– Scaled• Classical wavelets

by analogy−−−−−−→ Graph wavelets

p. 2

Page 3: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

The classical waveletsEach wavelet ψs,a is derived by translating and scaling a mother

wavelet ψ:ψs,a(x) =

1sψ

(x − a

s

)

Equivalently, in the Fourier domain:

ψs,a(ω) =

∫ ∞

−∞

1sψ

(x − a

s

)exp−iωx dx

= exp−iωa∫ ∞

−∞

1sψ

(Xs

)exp−iωX dX

= exp−iωa∫ ∞

−∞ψ(X ′)

exp−iωX ′dX ′

= δa(ω) ψ(sω) where δa = δ(x − a)

One possible definition: ψs,a(x) =∫∞−∞ δa(ω)ψ(sω) expiωx dω

p. 3

Page 4: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

The classical wavelets

ψs,a(x) =∫∞−∞ δa(ω)ψ(sω) expiωx dω

• In this definition, ψ(sω) acts as a filter bank defined byscaling by a factor s a filter kernel function defined inthe Fourier space: ψ(ω)

• The filter kernel function ψ(ω) is necessarily abandpass filter with:

• ψ(0) = 0 : the mean of ψ is by definition null• lim

ω→+∞ψ(ω) = 0 : the norm of ψ is by definition finite

(Note: the actual condition is the admissibility property)

p. 4

Page 5: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Classical waveletsby analogy−−−−−−→ Graph wavelets

[Hammond et al. ACHA ’11]

Classical (continuous) world Graph world

Real domain x node a

Fourier domain ω eigenvalues λi

Filter kernel ψ(ω) g(λi)⇔ G

Filter bank ψ(sω) g(sλi)⇔ Gs

Fourier modes exp−iωx eigenvectors χi

Fourier transf. of f f (ω) =∫∞−∞ f (x) exp−iωx dx f = χ> f

The wavelet at scale s centered around node a is given by:

ψs,a(x) =

∫ ∞

−∞δa(ω)ψ(sω) expiωx dω −−→ ψs,a = χ Gsδa = χ Gsχ> δa

p. 5

Page 6: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Examples of graph waveletsA WAVELET:

TRANSLATING: SCALING:

p. 6

Page 7: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Examples of wavelets: they encode the local topology

ψs=1,a

ψs=35,a

ψs=25,a

ψs=50,a

p. 7

Page 8: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Illustration on the smoothness of graph signals

fTL1f = 0.14 fTL2f = 1.31 fTL3f = 1.81

Smoothness of Graph Signals Revisited25

Intro Signal Transforms Problem Spectral Graph Theory Generalized Operators WGFT Conclusion

Incorporation of the Underlying Graph Connectivity 5

For notions of global smoothness, the discrete p-Dirichletform of f is defined as

Sp(f) :=1

p

X

i2V

kOifkp2 =

1

p

X

i2V

24X

j2Ni

Wi,j [f(j) � f(i)]2

35

p2

.

(5)

When p = 1, S1(f) is the total variation of the signal withrespect to the graph. When p = 2, we have

S2(f) =1

2

X

i2V

X

j2Ni

Wi,j [f(j) � f(i)]2

=X

(i,j)2EWi,j [f(j) � f(i)]

2= fTLf . (6)

S2(f) is known as the graph Laplacian quadratic form [17],and the semi-norm kfkL is defined as

kfkL := kL 12 fk2 =

pfTLf =

pS2(f).

Note from (6) that the quadratic form S2(f) is equal to zeroif and only if f is constant across all vertices (which is whykfkL is only a semi-norm), and, more generally, S2(f) is smallwhen the signal f has similar values at neighboring verticesconnected by an edge with a large weight; i.e., when it issmooth.

Returning to the graph Laplacian eigenvalues and eigen-vectors, the Courant-Fischer Theorem [35, Theorem 4.2.11]tells us they can also be defined iteratively via the Rayleighquotient as

�0 = minf2RN

kfk2=1

{fTLf} , (7)

and �` = minf2RN

kfk2=1f?span{u0,...,u`�1}

{fTLf} , ` = 1, 2, . . . , N � 1, (8)

where the eigenvector u` is the minimizer of the `th prob-lem. From (6) and (7), we see again why u0 is constantfor connected graphs. Equation (8) explains why the graphLaplacian eigenvectors associated with lower eigenvalues aresmoother, and provides another interpretation for why thegraph Laplacian spectrum carries a notion of frequency.

Example 1 in the box below demonstrates the importance ofincorporating the underlying graph structure when processingsignals on graphs.

F. Other Graph Matrices

The basis {u`}`=0,1,...,N�1 of graph Laplacian eigenvectorsis just one possible basis to use in the forward and inversegraph Fourier transforms (3) and (4). A second popular optionis to normalize each weight Wi,j by a factor 1p

didj

. Doing so

leads to the normalized graph Laplacian, which is defined asL := D� 1

2 LD� 12 , or, equivalently,

(Lf)(i) =1pdi

X

j2Ni

Wi,j

"f(i)p

di

� f(j)pdj

#.

G1

λ

f λ( )

G2

λ

f λ( )

G3

λ

f λ( )

Example 1 (Importance of the underlying graph):In the figure above, we plot the same signal f onthree different unweighted graphs with the same setof vertices, but different edges. The top row shows thesignal in the vertex domains, and the bottom row showsthe signal in the respective graph spectral domains.

The smoothness and graph spectral content of thesignal both depend on the underlying graph structure.In particular, the signal f is smoothest with respectto the intrinsic structure of G1, and least smooth withrespect to the intrinsic structure of G3. This can be seen(i) visually; (ii) through the Laplacian quadratic form,as fTL1f = 0.14, fTL2f = 1.31, and fTL3f = 1.81;and (iii) through the graph spectral representations,where the signal has all of its energy in the lowfrequencies in the graph spectral plot of f on G1, andmore energy in the higher frequencies in the graphspectral plot of f on G3.

The eigenvalues {�`}`=0,1,...,N�1 of the normalized graphLaplacian of a connected graph G satisfy

0 = �0 < �1 . . . �max 2,

with �max = 2 if and only if G is bipartite; i.e., the set ofvertices V can be partitioned into two subsets V1 and V2 suchthat every edge e 2 E connects one vertex in V1 and one vertexin V2. We denote the normalized graph Laplacian eigenvectorsby {u`}`=0,1,...,N�1. As seen in Figure 3(b), the spectrum ofL also carries a notion of frequency, with the eigenvectorsassociated with higher eigenvalues generally having more zerocrossings. However, unlike u0, the normalized graph Laplacianeigenvector u0 associated with the zero eigenvalue is not aconstant vector.

The normalized and non-normalized graph Laplacians areboth examples of generalized graph Laplacians [36, Section1.6], also called discrete Schrodinger operators. A generalizedgraph Laplacian of a graph G is any symmetric matrix whosei, jth entry is negative if there is an edge connecting verticesi and j, equal to zero if i 6= j and i is not connected to j, andmay be anything if i = j.

A third popular matrix that is often used in dimensionality-reduction techniques for signals on graphs is the random walkmatrix P := D�1W. Each entry Pi,j describes the probabilityof going from vertex i to vertex j in one step of a Markovrandom walk on the graph G. For connected, aperiodic graphs,each row of Pt converges to the stationary distribution of

Recall, a signal is smooth with respect to the intrinsic structure of itsunderlying graph

Similarly, the graph spectral content also depends on the underlying graph

David Shuman Signal Processing on Graphs February 11, 2013 21 / 35

p. 8

Page 9: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Kron reduction and spasificationKron reduction produces denser and denser graphs

After each reduction we use Spielman’s sparsification algorithm to obtain an equivallent but sparser graph

Approx preserves Laplacian quadratic form Explicit control based on effective

resistance of edges

Sparsification106

Intro Signal Transforms Problem Spectral Graph Theory Generalized Operators WGFT Conclusion

Further ReadingTutorial Overviews

D. I Shuman, S. K. Narang, P. Frossard, A. Ortega, P. Vandergheynst, “Signal processing on graphs:

Extending high-dimensional data analysis to networks and other irregular domains,” Signal Process. Mag.,to appear May 2013.

R. Rubinstein, A. M. Bruckstein, and M. Elad, “Dictionaries for sparse representation modeling,” Proc.

IEEE, vol. 98, no. 6, pp. 1045–1057, Jun. 2010.

Spectral Graph Theory and Graph Laplacian Eigenvectors

F. K. Chung, Spectral Graph Theory, vol. 92 of the CBMS Reg. Conf. Ser. Math., AMS Bokstore, 1997.

T. Bıyıkoglu, J. Leydold, and P. F. Stadler, Laplacian Eigenvectors of Graphs, Springer, 2007.

D. Spielman, “Spectral graph theory” in Combinatorial Scientific Computing, Chapman and Hall, 2012.

Dictionaries for Signals on Graphs

R. R. Coifman and M. Maggioni, “Di↵usion wavelets,” Appl. Comput. Harmon. Anal., vol. 21, no. 1, pp.

53–94, Jul. 2006.

D. K. Hammond, P. Vandergheynst, and R. Gribonval, “Wavelets on graphs via spectral graph theory,”

Appl. Comput. Harmon. Anal., vol. 30, no. 2, pp. 129–150, Mar. 2011.

S. K. Narang and A. Ortega, “Perfect reconstruction two-channel wavelet filter banks for graph structured

data,” IEEE Trans. Signal Process., vol. 60, pp. 2786–2799, Jun. 2012.

D. I Shuman, B. Ricaud, and P. Vandergheynst, “A windowed graph Fourier transform,”in Proc. IEEE Stat.

Signal Process. Wrkshp., Ann Arbor, MI, Aug. 2012.

David Shuman Signal Processing on Graphs February 11, 2013 34 / 35

Spielman and Srivastava, Graph sparsification by effective resistances, SIAM J. Comp, 2011

7

to one over the number of vertices in the complete subgraph.This class of graphs highlights two of the main weaknesses

of the Kron reduction: (i) it does not always preserve regularstructural properties of the graph; and (ii) it does not alwayspreserve the sparsity of the graph. We discuss an alternativegraph reduction method that does preserve the tree structure inSection VI-D, and a sparsity enhancing modification in SectionIV-D.

5) k-Regular Bipartite Graphs: In [18], Narang and Ortegaconsider connected and unweighted k-RBGs.8 They downsam-ple by keeping one subset of the bipartition, and they constructa new graph on the downsampled vertices V1 by linkingvertices in the reduced graph with an edge whose weight isequal to the number of their common neighbors in the originalgraph. If the vertices of the original graph are rearranged sothat all the vertices in V1 have smaller indices than all thevertices in Vc

1 , the adjacency and Laplacian matrices of theoriginal graph can be represented as:

W =

0 W1

WT1 0

�and L =

"kIN

2�W1

�WT1 kIN

2

#. (11)

Then for all i, j 2 V1 (with i 6= j), the i, jth entry of theadjacency matrix of the reduced graph is given by

W kRBG�reducedij (V1) = (W1W

T

1 )ij . (12)

They also show that

LkRBG�reduced(V1) = k2IN2� W1W

T

1. (13)

Now we examine the Kron reduction of k-RBGs. The Kron-reduced Laplacian is given by:

LKron�reduced(V1) = LV1,V1� LV1,V2

L�1V2,V2

LV2,V1

= kIN2� (�W1)(kIN

2)�1(�WT

1)

= kIN2� 1

kW1W

T

1,

which is a constant factor 1k times the reduced Laplacian (13)

of [18]. So, up to a constant factor, the Kron reduction is ageneralization of the graph reduction method presented in [18]for the special case of regular bipartite graphs.

D. Graph Sparsification

As a consequence of property (K5), repeated Kron reductionoften leads to denser and denser graphs. We have already seenthis loss of sparsity in Section IV-C4, and this phenomenonis even more evident in larger, less regular graphs. In addi-tion to computational drawbacks, the loss of sparsity can beimportant, because if the reduced graphs become too dense,it may not effectively capture local connectivity informationthat is important for processing signals on the graph. There-fore, in many situations, it is advantageous to perform graphsparsification immediately after the Kron reduction as part ofthe overall graph reduction phase.

8Although [18] considers unweighted k-RBGs, the following statementsalso apply to weighted k-RBGs if we extend the definition of the reducedadjacency matrix (12) to weighted graphs.

Algorithm 1 Spectral Sparsification [30]Inputs: G = {V, E ,W}, QOutput: W0

1: Initialize W0 = 02: for q = 1, 2, . . . , Q do3: Choose a random edge e = (i, j) of E according to the

probability distribution

pe =dRG (i, j)WijP

e=(m,n)2EdRG (m, n)Wmn

4: W 0ij = W 0

ij +Wij

Qpe

5: end for

(a) (b) (c)

(d) (e) (f)

Fig. 5. Incorporation of a spectral sparsification step into the graph reduction.(a)-(c) Repeated largest eigenvector downsampling and Kron reduction of asensor network graph. (d)-(f) The same process with the spectral sparsificationof [30] used immediately after each Kron reduction.

There are numerous ways to perform graph sparsification.In this paper, we use a straightforward spectral sparsificationalgorithm of Spielman and Strivastava [30], which is describedin Algorithm 1. This sparsification method pairs nicely withthe Kron reduction, because [30] shows that for large graphsand an appropriate choice of the number of samples Q, thegraph Laplacian spectrum and resistance distances betweenvertices are approximately preserved with high probability.In Figure 5, we show an example of repeated downsamplingfollowed by Kron reduction and spectral sparsification.

E. Alternative Graph Reduction Methods

First, we mention some alternative graph reduction methods:1) In [8], Narang and Ortega define a reduced graph via

the weighted adjacency matrix by taking W(j+1) =�[W(j)]2

�V1,V1

. Here, [W(j)]2 represents the 2-hop ad-jacency matrix of the original graph. The reduced Lapla-cian can then be defined as L(j+1) = D(j+1)�W(j+1),where D(j+1) is computed from W(j+1). However,there are a number of undesirable properties of thisreduction method. First, and perhaps foremost, the re-duction method does not always preserve connectivity.Second, self-loops are introduced at every vertex in thereduced graph. Third, vertices in the selected subsetthat are connected by an edge in the original graphmay not share an edge in the reduced graph. Fourth,

p. 9

Page 10: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Graph and Signal graph coarsening

Example104

−1

−0.8

−0.6

−0.4

−0.2

0

0.2

0.4

0.6

0.8

1

−1

−0.8

−0.6

−0.4

−0.2

0

0.2

0.4

0.6

0.8

1

−1

−0.8

−0.6

−0.4

−0.2

0

0.2

0.4

0.6

0.8

1

−1

−0.8

−0.6

−0.4

−0.2

0

0.2

0.4

0.6

0.8

1

Coarse !Approximations

Prediction !Errors

x x x

y y y

−1

−0.8

−0.6

−0.4

−0.2

0

0.2

0.4

0.6

0.8

1

−1

−0.8

−0.6

−0.4

−0.2

0

0.2

0.4

0.6

0.8

1

−1

−0.8

−0.6

−0.4

−0.2

0

0.2

0.4

0.6

0.8

1

2642 1334 669 337

p. 10

Page 11: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Graph and Signal graph coarsening105

p. 11

Page 12: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Graph wavelets for brain fMRI dataExample16

vendredi, 17 juin 2011p. 12

Page 13: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Purpose of the last part of the lectureDevelop a scale dependent community mining tool using

concepts from graph signal processing.Why ? For joint processing of graph signals and networks.

General Ideas• Take advantage of local topological information encoded in

Graph Wavelets.Wavelet = ego-centered vision from a node

• Group together nodes whose local environments aresimilar at the description scale

• This will naturally offer a multiscale vision of communities

p. 13

Page 14: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Let us recall: objective of community detection

Three examples of community detections:

• (A) A complex sensor network (non-uniform swiss rolltopology)

• (B) A contact network in a primary school [Stehle ’11]• (C) A hierarchical graph benchmark [Sales-Pardo ’07]

A

B C

p. 14

Page 15: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

or multiscale community detection ?

p. 15

Page 16: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Example of filters for community detection

For each graph under study, we automatically find the goodfilter parameters for g by imposing:• The coarsest scale needs to be focused on the eigenvectorχ1 (Fiedler vector).

• All scales need at least to keep some information from χ1.• The finest scale needs to use the information from all

eigenvectors (i.e., Fourier modes).

0 10 200

2

4

6

8

λ

g(s

λ)

0 10 200

2

4

6

8

λ

h(s

λ)

p. 16

Page 17: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Example of wavelet filters• More precisely, we will use the following kernel:

g(x ;α, β, x1, x2) =

x−α1 xα for x < x1

p(x) for x1 ≤ x ≤ x2

xβ2 x−β for x > x2.

• The parameters will be:

smin =1λ2, x2 =

1λ2, smax =

1λ2

2, x1 = 1, β = 1/log10

(λ3

λ2

)

• This leads to: (α = 2)

0 1 20

2

4

6

8

λ

g(s

λ)

s=7

s=13

s=25

s=47

0

2

4

6

8

10

x1

x2 x

g(x

)

α=1

α=2

α=10

α=50

p. 17

Page 18: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Multiscale community structure in a graph

Classical community detection algorithm based (for instance)on modularity optimisation only finds one solution:

A

B C

Where the modularity function reads:Q = 1

2N∑

ij

[Aij − di dj

2N

]δ(ci , cj)

p. 18

Page 19: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

A new method for multiscale community detection[N. Tremblay, P. Borgnat, 2013]

The problem of community mining is considered as a problemof clustering. We then need to decide upon:

1. feature vectors for each node2. a distance to measure two given vectors’ closeness3. a clustering algorithm to separate nodes in clusters

The method uses:1. wavelets as feature vectors2. the correlation distance3. the complete linkage clustering algorithm

p. 19

Page 20: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

1) Wavelets as featuresEach node a has feature vector ψs,a.Globally, one will need Ψs, all wavelets at a given scale s, i.e.

Ψs =(ψs,1|ψs,2| . . . |ψs,N

)= χGsχ

>.

NODE

A:

NODE

B:

AT SMALL SCALE: AT LARGE SCALE:

p. 20

Page 21: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

2) Correlation distances

Ds(a,b) = 1−ψ>s,aψs,b

||ψs,a||2 ||ψs,b||2.

NODE

A:

NODE

B:CORR.COEF.:

RESULT:

-0.50

Far appart in thedendrogram

0.97

Close to each otherin the dendrogram

p. 21

Page 22: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

3) Complete linkage clustering and dendrogram

• It is a bottom to top hierarchical algorithm: it starts with asmany clusters as nodes and works its way up to fewerclusters (by linking subclusters together) until it reachesone global cluster.

• To compute the distance between two subclusters underexamination : all possible pairs of nodes, taking one fromeach cluster, are considered. The maximum possiblenode-to-node distance is declared to be thecluster-to-cluster closeness.

• Outputs a dendrogram (from Greek dendron "tree" andgramma "drawing").

p. 22

Page 23: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Example of a dendrogram at a given scale s

0

0.5

1

1.5

2

corr

ela

tion d

ista

nce

The big question: where should we cut the dendrogram?

p. 23

Page 24: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

A toy graph for introducing the methodsmallest scale (16 com.): small scale (8 com.):

medium scale (4 com.): large scale (2 com.):

p. 24

Page 25: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut with prior knowledgeLet us cheat by using prior knowledge on the number ofcommunities we are looking for.If we cut each dendrogram in two clusters

0 10 20 30 40 50

0

0.2

0.4

0.6

0.8

1

scale number

AR

co

eff

icie

nt

Using wavelets as featuresConclusion: the dendrograms at different scales contain thecommunity structure at various scales.

p. 25

Page 26: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut with prior knowledgeLet us cheat by using prior knowledge on the number ofcommunities we are looking for.If we cut each dendrogram in four clusters

0 10 20 30 40 50

0

0.2

0.4

0.6

0.8

1

scale number

AR

co

eff

icie

nt

Using wavelets as featuresConclusion: the dendrograms at different scales contain thecommunity structure at various scales.

p. 25

Page 27: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut with prior knowledgeLet us cheat by using prior knowledge on the number ofcommunities we are looking for.If we cut each dendrogram in eight clusters

0 10 20 30 40 50

0

0.2

0.4

0.6

0.8

1

scale number

AR

co

eff

icie

nt

Using wavelets as featuresConclusion: the dendrograms at different scales contain thecommunity structure at various scales.

p. 25

Page 28: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut with prior knowledgeLet us cheat by using prior knowledge on the number ofcommunities we are looking for.If we cut each dendrogram in sixteen clusters

0 10 20 30 40 50

0

0.2

0.4

0.6

0.8

1

scale number

AR

co

eff

icie

nt

Using wavelets as featuresConclusion: the dendrograms at different scales contain thecommunity structure at various scales.

p. 25

Page 29: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut with prior knowledgeLet us cheat by using prior knowledge on the number ofcommunities we are looking for.The four levels of communities.

0 10 20 30 40 50

0

0.2

0.4

0.6

0.8

1

scale number

AR

co

eff

icie

nt

Using wavelets as features

Conclusion: the dendrograms at different scales contain thecommunity structure at various scales.

p. 25

Page 30: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Recall: The Adjusted Rand IndexLet:

• C and C′ be two partitions we want to compare.• a be the # of pairs of nodes that are in the same

community in C and in the same community in C′• b be the # of pairs of nodes that are in different

communities in C and in different communities in C′• c be the # of pairs of nodes that are in the same

community in C and in different communities in C′• d be the # of pairs of nodes that are in different

communities in C and in the same community in C′

a + b is the number of “agreements“ between C and C′.c + d is the number of “disagreements“ between C and C′.

p. 26

Page 31: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

The Adjusted Rand Index

The Rand index, R, is:

R =a + b

a + b + c + d=

a + b(n2

)

The Adjusted Rand index AR is the corrected-for-chanceversion of the Rand index:

AR =R − ExpectedIndex

MaxIndex − ExpectedIndex

p. 27

Page 32: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut with classical modularity

Recall that the classical modularity matrix reads:B(A) = 1

2m(A + dd>

2m )

where d is the strength vector and 2m =∑

d(i)

Classical modularity is

Q = Tr(S>BS)

0 10 20 30 40 50

0

0.2

0.4

0.6

0.8

1

scale number

AR

co

eff

icie

nt

Solution not good at large scale.p. 28

Page 33: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut with filtered modularity

We define the filtered adjacency matrices at scale s:

• recall that A = D12χ(I − Λ)χ>D

12

• Ags = A + D

12χGsχ

>D−12 A

We define the filtered modularity matrices at scale s:

Bgs = B(Ag

s )

p. 29

Page 34: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Maximize filtered modularity

Filtered Modu Opt. Classical Modu Opt.

0 10 20 30 40 50

0

0.2

0.4

0.6

0.8

1

scale number

AR

co

eff

icie

nt

0 10 20 30 40 50

0

0.2

0.4

0.6

0.8

1

scale number

AR

co

eff

icie

nt

p. 30

Page 35: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Notes about the filtered modularity

Ags = A + D

12χGsχ

>D−12 A

Consider d the vector of strengths of A and 2m the sum of thestrengths. The classical modularity reads:

B =A

2m− dd>

(2m)2

Consider d ′ the vector of strengths of Ags and 2m′ the sum of

the strengths. We can show that:

dd>

(2m)2 =d ′d ′>

(2m′)2

Moreover, if gs(1) = 0 (which is the case), the filteredmodularity reads:

Bgs =

A + D12χGsχ

>D−12 A

2m− dd>

(2m)2p. 31

Page 36: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Notes about the filtered modularity

Bgs =

A + D12χGsχ

>D−12 A

2m− dd>

(2m)2

Recall that modularity compares the actual normalised weightAij2m to the expected weight if the graph was a random Chung-Lugraph: di dj

(2m)2 .The filtered modularity does not change the expected weightbut rather changes the actual normalised weigth: it adds orretrieve value to Aij

2m . At small scale, it will increase the weightsimportant for small scale structures and decrease the weightsimportant for superstructures.

p. 32

Page 37: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Notes about the filtered modularityIt can be written:

Bgs =

12m

N∑

i=2

(1 + gs(i))(1− λi)D12χiχ

>i D

12

To compare to Schaub-Delvenne’s filtered modularity:

Bt =1

2m

N∑

i=2

(1− λi)tD

12χiχ

>i D

12

And Arenas’ version: (here for regular networks)

Bα =1

2m

N∑

i=2

(1− λi

α)D

12χiχ

>i D

12

p. 33

Page 38: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Maximize filtered modularity on Sales-Pardo network

2 10 40 1600

0.5

1

# of clusters

Q/m

ax(Q

)

2 10 40 1600

0.5

1

# of clusters

Qh s/m

ax(Q

h s)

p. 34

Page 39: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Maximize filtered modularity on Sales-Pardo network

0 1 2 40

0.5

1

s

nm

i

fine

interm.

coarse

p. 35

Page 40: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Maximize filtered modularity on Sales-Pardo network

0.4 0.8 1.7 3.61

10

100

s

<# o

f com

.>

0.4 0.8 1.7 3.6

10

100

1000

s

<C

om

. siz

e>

p. 36

Page 41: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Intermediate bilan

• As expected, the method works week with filteredmodularity

• Fundamental reason: it is related to Arenas orSchaub-Delvenne modified modularity to take into accounta scale

• However: the dendrogram has already in itself the goodsolutions, with no need of the step of (filtered) modularityoptimization.

• For that: look at the gaps !

p. 37

Page 42: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut at maximal gap

To avoid the cumbersome multiscale modularity optimization,we can simply cut the dendrogram at its maximal gap.

At small scale:

0

0.5

1

1.5

2

co

rre

latio

n d

ista

nce

At large scale:

0

0.5

1

1.5

2

co

rre

latio

n d

ista

nce

p. 38

Page 43: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut at maximal gap

To avoid the cumbersome multiscale modularity optimization,we can simply cut the dendrogram at its maximal gap.

At small scale:

0

0.5

1

1.5

2

co

rre

latio

n d

ista

nce

At large scale:

0

0.5

1

1.5

2

co

rre

latio

n d

ista

nce

p. 38

Page 44: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut at maximal gap

0 10 20 30 40 50

0

0.2

0.4

0.6

0.8

1

scale number

AR

co

eff

icie

nt

Using wavelets

p. 39

Page 45: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut at maximal gap

scale number

no

de

s

10 20 30 40

20

40

60

80

100

120

Using wavelets

p. 40

Page 46: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut at maximal gap: non robust to outliers

0

0.5

1

1.5

co

rre

latio

n d

ista

nce

nodes

p. 41

Page 47: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut at maximal average gap

0 0.2 0.4 0.6 0.8 10

0.5

1

correlation distance

Γa

Γ =1

Nmax(corr. dist.)

a∈VΓa

At small scale

0 0.2 0.4 0.6 0.8 10

0.5

1

correlation distance

Γ

0

0.5

1

corr

ela

tion d

ista

nce

nodes

p. 42

Page 48: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut at maximal average gap

0 0.2 0.4 0.6 0.8 10

0.5

1

correlation distance

Γa

Γ =1

Nmax(corr. dist.)

a∈VΓa

At larger scale

0 0.5 1 1.50

0.5

1

correlation distance

Γ

0

0.5

1

1.5

corr

ela

tion d

ista

nce

nodes

p. 43

Page 49: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Dendrogram cut at maximal average gap

For the previous graph:

0 0.5 10

0.5

1

correlation distance

Γ

p. 44

Page 50: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Comparison maximal gap vs. filtered modularity

Maximal Gap Filtered Modu Opt. Classical Modu Opt.

0 10 20 30 40 50

0

0.2

0.4

0.6

0.8

1

scale number

AR

co

eff

icie

nt

0 10 20 30 40 50

0

0.2

0.4

0.6

0.8

1

scale number

AR

co

eff

icie

nt

0 10 20 30 40 50

0

0.2

0.4

0.6

0.8

1

scale number

AR

co

eff

icie

nt

p. 45

Page 51: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Comparison maximal gap vs. filtered modularity

Maximal Gap Filtered Modu Opt. Classical Modu Opt.

0 10 20 30 40 50

0

0.2

0.4

0.6

0.8

1

scale number

AR

co

eff

icie

nt

0 10 20 30 40 50

0

0.2

0.4

0.6

0.8

1

scale number

AR

co

eff

icie

nt

0 10 20 30 40 50

0

0.2

0.4

0.6

0.8

1

scale number

AR

co

eff

icie

nt

p. 45

Page 52: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Comparison maximal gap vs. filtered modularity

Maximal Gap Filtered Modu Opt. Classical Modu Opt.

0 10 20 30 40 50

0

0.2

0.4

0.6

0.8

1

scale number

AR

co

eff

icie

nt

0 10 20 30 40 50

0

0.2

0.4

0.6

0.8

1

scale number

AR

co

eff

icie

nt

0 10 20 30 40 50

0

0.2

0.4

0.6

0.8

1

scale number

AR

co

eff

icie

nt

p. 45

Page 53: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Multiscale community detection on a simple network

scale number

no

de

s10 20 30

20

40

60

80

100

120

Another toy graph Using wavelets

p. 46

Page 54: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Multiscale community detection on more elaboratenetworks

p. 47

Page 55: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

The Sales-Pardo benchmark• Three community structures nested in one another• Parameters:

• sizes of the communities (N = 640)• ρ tunes how well separated the different scales are• k is the average degree; the sparser is the graph, the

harder it is to recover the communities.

p. 48

Page 56: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Results on the Sales-Pardo benchmark

10 15.8 25.1 39.80

0.5

1

scale s

Ad

j. R

an

d in

de

x

Large Scale

Medium Scale

Small Scale

23 1

p. 49

Page 57: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Results on the Sales-Pardo benchmark

11 13 15 170.9

0.95

1

<L

. sc.

Re

ca

ll>

k

Gr. wav. correlation

Schaub

Arenas

11 13 15 170.97

0.98

0.99

1

<M

. sc.

Re

ca

ll>

k

Gr. wav. correlation

Schaub

Arenas

11 13 15 170.92

0.94

0.96

0.98

1

<S

. sc.

Re

ca

ll>

k

Gr. wav. correlation

Schaub

Arenas

p. 50

Page 58: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

The case of larger networks

• Limit of the method: computation of the N ×N matrix of thewavelets Ψs.

• Improvement: use of random features.• Let r ∈ RN be a random vector on the nodes of the graph,

composed of N independent normal random variables ofzero mean and finite variance σ2.

• Define the feature fs,a ∈ R at scale s associated to node aas

fs,a = ψ>s,ar =N∑

k=1

ψs,a(k)r(k).

p. 51

Page 59: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

The case of larger networks• Let us define the correlation between features

Cor(fs,a, fs,b)=E((fs,a − E(fs,a))(fs,b − E(fs,b)))√

Var(fs,a)Var(fs,b).

• It is easy to show that:

Cor(fs,a, fs,b) =ψ>s,aψs,b

||ψs,a||2 ||ψs,b||2.

• Therefore, the sample correlation estimator Cab,η satisfies:

limη→+∞

Cab,η =ψ>s,aψs,b

||ψs,a||2 ||ψs,b||2= 1− Ds(a,b).

• This leads to a faster algorithm.

p. 52

Page 60: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Results on the Sales-Pardo benchmark

• As a function of η, the number of random vectors used

20 40 60 80 1000

0.5

1

η

<ra

tio

s>

LS Recall

MS Recall

SS Recall

20 40 60 80 100

3.23.43.63.8

44.2

η

<co

mp

. tim

e>

(se

c)

p. 53

Page 61: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Stability of the communities• Not all partitions are relevant: only those stable enough

convey information about the network• Lambiotte’s approach to stability:

Create B resampled graphs by randomly adding ±p%(typically p = 10) to the weight of each link and computingthe corresponding B sets of partitions {Pb

s }b∈[1,B],s∈S .Then, stability:

γr (s) =2

B(B − 1)

(b,c)∈[1,B]2,b 6=c

ari(Pbs ,P

cs ), (1)

• New approach: we have a stochastic algorithm.Consider J sets of η random signals and compute theassociated sets of partitions {P j

s}j∈[1,J],s∈S . Let stability be:

γa(s) =2

J(J − 1)

(i,j)∈[1,J]2,i 6=j

ari(P is,P

js). (2)

p. 54

Page 62: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Results with stabilities on the Sales-Pardo benchmark

10 15.8 25.1 39.80

0.5

1

scale s

Adj. R

and index

Large Scale

Medium Scale

Small Scale

23 1

10 15.8 25.1 39.80

0.5

1

scale s

Insta

bilitie

s

1−γr

1−γa

123

p. 55

Page 63: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

In addition: statistical test of relevance of thecommunities

• It is possible to design a data-driven test on γa (notexplained here).

• Result: threshold for 1− γa above which the partition incommunities is irrelevant.

Sales-Pardo graph Chung-Lu graph

10 15.8 25.1 39.80

0.5

1

scale s

1−

γa

3 2 1

1.9 2.2 2.5

0

0.5

1

scale s

1−

γa

p. 56

Page 64: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Comparison on larger Sales-Pardo graphs

N = 6400 nodes

Schaub-Delvenne

0 0.1 10

0.5

1

Markov time

Ad

j. R

an

d in

de

x

LS

MS

SS

0 0.1 10

0.5

1

Markov time

Va

r. I

nfo

rma

tio

n

Wavelets

6.3 10 15.8 25.10

0.5

1

scale s

Ad

j. R

an

d in

de

x

LS

MS

SS

6.3 10 15.8 25.10

0.5

1

scale s

1−

γa

p. 57

Page 65: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

More elaborate graphs

p. 58

Page 66: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Sensor network on the swiss roll manifold• Three scale ranges of relevant community structure

10000 100000 1e+06 1e+070

0.1

0.2

0.3

0.4

scale s

1−

γa

p. 59

Page 67: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

The dynamic social network of a primary school

Collaboration with A. Barrat (CPT Marseille)

p. 60

Page 68: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Multi-scale Communities in Primary School

Collaboration with A. Barrat (CPT Marseille)

20 28 37 51 74 1030

0.5

1

scale s

1−

γa

scale s20 28 37 51 74 103

1st a

1st b

2nd a

2nd b

3rd a

3rd b4th a4th b

5th a

5th b

p. 61

Page 69: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Multi-scale Communities in Primary SchoolCollaboration with A. Barrat (CPT Marseille)

p. 62

Page 70: CR16: Signal Processing and Networks Data analysis and ...perso.ens-lyon.fr/pierre.borgnat/MASTER2/lecture_DataAnalysis_Net… · Signal Process. Wrkshp.,AnnArbor,MI,Aug.2012. David

Graph Wavelets Some illustrations Multiscale community mining Developments; Stability of communities Conclusion

Conclusion

• Wavelet ψs,a gives an ”egocentered view“ of the networkseen from node a at scale s

• Correlation between these different views gives us adistance between nodes at scale s

• This enables multi-scale clustering of nodes incommunities

• I hope that you were attracted tothe emerging field of graph signal processing for networks.

http://perso.ens-lyon.fr/pierre.borgnat

Acknowledgements: thanks to Nicolas Tremblay for borrowingmany of his figures or slides.

p. 63