Modified Principal Points: A Flexible and Differentiable Approach for Data Summarization
Abstract
Principal points∗are a small set of characteristic locations which minimize the average squared Euclidian distance from the data points, and should be more informative about the data’s structure than simple features such as mean and variance. However it is also non-differentiable w.r.t point collapse due to minimum operation in (2.8) and weak-sparse in defining point spread penalty. In this paper we define the generalized principal points as the Gaussian weighted mean of distances. It results in a differentiable objective and has a tuning parameter for point closeness adjustment. When the bandwidth approaches to zero, modified points tend towards classical points from a real direction and when it tends to the infinity they define the mean. Finally, the simulation studies are reported and reveal higher robustness of our proposed methods against outliers, non-normality, and small sample sizes. Empirical studies on real statistical data also confirm that lower sensitivity to extremes is better
References
J. O. Berger, Statistical Decision Theory and Bayesian Analysis, 2nd ed. New York, NY, USA: Springer Science & Business Media, 1985.
G. Casella and R. L. Berger, Statistical Inference, 2nd ed. Pacific Grove, CA, USA: Duxbury-Thomson Learning, 2002.
S. Chakraborty, K. Das, and J. Li, “On properties and estimation of principal points for univariate distributions,” Statistical Methodology, vol. 47, pp. 1–18, 2020.
B. Flury, “Principal points,” Biometrika, vol. 77, no. 1, pp. 33–41, 1990.
X. Gu and T. Tarpey, “On the computation of principal points and self-consistent points for univariate distributions,” J. Comput. Graph. Stat., vol. 20, no. 4, pp. 1020–1037, 2011.
P. J. Huber, Robust Statistics. New York, NY, USA: Wiley, 1981.
H. Jeffreys, Theory of Probability, 3rd ed. Oxford, UK: Oxford Univ. Press, 1961.
E. L. Lehmann and G. Casella, Theory of Point Estimation, 2nd ed. New York, NY, USA: Springer-Verlag, 1998.
L. Li and B. Flury, “Uniqueness of principal points for univariate distributions,” Statist. Probab. Lett., vol. 23, pp. 361–364, 1995.
S. Matsuura, H. Kurata, and T. Tarpey, “Optimal estimators of principal points for minimizing expected mean squared distance,” J. Statist. Plann. Inference, vol. 167, pp. 102–122, 2015.
W. Mendenhall, R. J. Beaver, and B. M. Beaver, Introduction to Probability and Statistics, 13th ed. Belmont, CA, USA: Cengage Learning, 2009.
D. W. Scott, Multivariate Density Estimation: Theory, Practice, and Visualization. New York, NY, USA: Wiley, 1992.
B. W. Silverman, Density Estimation for Statistics and Data Analysis. London, UK: Chapman and Hall/CRC, 1986.
E. Stampfer and E. Stadlober, “Methods for estimating principal points,” J. Statist. Plann. Inference, vol. 64, pp. 101–120, 1997.
T. Tarpey, “Two principal points of symmetric, strongly unimodal distributions,” Statist. Probab. Lett., vol. 20, pp. 253–257, 1994.
T. Tarpey and B. Flury, “Self-consistency: A fundamental concept in statistics,” Statist. Sci., vol. 11, no. 3, pp. 229–243, 1996.
T. Tarpey and I. Ivey, “Principal points of a multivariate mixture distribution,” J. Multivar. Anal., vol. 97, no. 8, pp. 1767–1783, 2006.
T. Tarpey, “A parametric k-means algorithm,” Comput. Statist., vol. 22, no. 1, pp. 71–89, 2007.
F. Yu, “Uniqueness of principal points with respect to p-order distance for a class of univariate continuous distribution,” Statist. Probab. Lett., vol. 183, p. 109341, 2022.
M. P. Wand and M. C. Jones, Kernel Smoothing. London, UK: Chapman and Hall/CRC, 1995.



