• 1. Simply ShapesThis is a simply memo

2. Classical shape analysis methodsCircularity:Irregularity: The degree of circularity is how much thisMeasurement of the irregu- larity of a solid. Itpolygon is similar to a circle. Where 1 is a is calculated based on its perimeter and theperfect circle and 0.492 is an isosceles perimeter of the sur- rounding circle. Thetriangle.minimum irregularity is a circle, corresponding at the value 1. A square is the maximum 4p ss: object areairregularity with a value of 1.402.C=p2 p: object perimeterpc I=Quadrature:pThe degree of quadrature of a solid,where 1 is a square and 0.800 an isosceles Elongation:triangle.The degree of ellipticity of a solid, where a circle and a square are the less elliptic shape.pQ=D 4 s E=d D: maximum diameter within an object d: minimum diameter perpendicular at D 3. The Workflow of Morphometric Analysis for Shape Original Shape Distance Matrix (Polygon) Fourier TransformTest the number of ClusteringInverse Fourier Transform Clustering by PAMApproximate ShapeAssign Class info to each object(Polygon) Procrustes AnalysisVisualize on Geo-space 4. Fourier descriptors of closed polygonsFourier transform enables to represent any periodic function with indefinite summation oftrigonometric function, which terms Fourier descriptors. Because polygon shape could bedenote as periodic function when decomposed into X and Y axis, this method could beapplicable to polygons.X axsis139.7110 35.54651 2p nt2p ntf( x ) = + an cos + bn sin2 n=1LL139.7106 t(i)139.7102139.7098 35.5460 0.0000.001 0.0020.003 0.004 0.005org_58[,2]t(xi, yi)tY axsis35.5465 1 2p nt2p nt g( y) =+ an cos + bn sin 2 n=1LL 35.545535.5460 t(i)35.5455 139.7098 139.7100 139.7102 139.7104139.7106 139.7108 139.7110 0.0000.001 0.0020.003 0.004 0.005 org_58[,1]t 5. Original Shape 6. Simplifying with approximate ShapeBy configuring higher number of harmonicsand of approximate points, shapes would bemore approximate to original shapes. 7. Inverse Fourier TransformOriginal polygons can be approximatelyOriginal ShapeFirst Approximate Ellipsereconstruct. To reconstruct original 35.5465Approximate Shape t(xj, yj)shapes, number of points should bespecified, and each point is arranged onconstant degree apart in a circle. 35.5460 Approximate with 10 pointsorg_58[,2] 1.0 1 0.5 35.5455 0.0 0 y-10 1 -0.5 -1.0 -1-1.0-0.50.0 0.5 1.0x139.7098 139.7100 139.7102 139.7104139.7106 139.7108 139.7110org_58[,1] H j 2p i j 2p i xj = ai cos + bi sin + cx i=2 L L H j 2p i j 2p i yi = ci cos + di sin + cyi=2 L L 8. Proclustes AnalysisThe aim is to obtain a similar placementand size between two shapes, byminimizing a measure of shape Find an optimum angle of rotation that thedifference called the Procrustes distance sum of the squared distances betweenbetween the objects. To conduct thiscorresponding points is minimized. nanalysis, number of control points inui yi - wi xieach shape should be same.q = tan -1 i=1 n i=1 ui xi - wi yiCalculate root mean square distance for Then, optimum coordinates are assigned byuniform scaling following fomula. ( x - x ) + ( y - y)n 2 2s=i=1 i i (hi, n i ) = ( cosqui -sinqwi,sinqui +sinqwi ) nDissimilarity between two shapes are Translate & uniform scalingmeasured as squared distance.xi - x yi - y(ui, wi ) = , d=i=1(hi - xi ) + (n i - yi ) n22 SS 9. Proclustes AnalysisProcrustes errors 35.5465 sum of squares: 35.5460 1.758e-065e-04org_58[,2] 35.5455Dimension 2 139.7098139.7102139.7106 139.71100e+00org_58[,1] 35.702-5e-04 35.700org_2570[,2] 35.698 35.696 -5e-04 0e+00 5e-04 139.650139.654139.658 Dimension 1 org_2570[,1] 10. Partition Around Medoids (PAM)Partition Around Medoids(PAM) is a clustering algorithm which attempt to minimizesquared error as well as the k-means. In contrast to k-means, PAM chooses existing pointsas centers, terms medoids, and the algorithm is more robust to noise and outliers ascompared to k-means.Silhouette plot of pam(x = tokyo.dist^2, k = 5)k n = 4373 5 clusters Cjargmin x j - mi j : nj | aveiCj si 1 : 1388 | 0.62 i=1 x j SiWhere mi is the medoid of Si.2 : 740 | 0.41\$classinfo (output of PAM clustering)3 : 1070 | 0.44sizemax_dissav_diss diameter separation[1,] 138865.80418.27153 193.87860.20960664 : 693 | 0.41[2,] 740 239.5017 29.9133463.2270.1864726[3,] 1070 200.8129 31.75182 429.51830.20960665 : 482 | 0.35[4,] 693 737.196530.68781 1044.5552 0.1864726[5,] 482 460.6608 46.2136 803.36250.3181256 -0.20.0 0.2 0.4 0.6 0.8 1.0 Silhouette width si Average silhouette width : 0.48 11. Silhouette Width - Test the number of clustering -For each datum i, average dissimilarity distanceC k-=4within the same class is calculated At first.1a(i) =(a(i) - a j )2 B n(k )a(i) ,a j KiiDCalculate the lowest averaged dissimilarity todatum j of any other cluster as following. b(i) = argmin 1 (a - b )2 A n b K (i ) jK (k j ) j jThe index of clustering efficiency at datum i The index of clustering efficiency at eachis calculated as silhouette width.cluster k is average silhouette width.a(i) - b(i) S(i) (-1 Sk 1)1S(i) = (-1 S(i) 1) Sk ={ max a(i) , b(i) } n(k j ) S(i) Ki 12. Average Silhouette Width The highest average width = 5Average Silhouette Width Silhouette Width N=50Averaged with PAM from 2 to 50 clusters0.481- a(i)0.46 b(i ) if (a(i) > b(i) ) S(i) = 0if (a(i) = b(i) )0.44 b(i ) if (a(i) < b(i) )res\$sila(i) -10.420.40 010 20 30 4050Index Averaged silhouette width suggests that the number of cluster = 5 13. Clustering by PAM 14. 