Clustering Life Trajectories a New Divisive Hierarchical Clustering Algorithm for Discrete-Valued Discrete Time Series

19 Pages Posted: 19 Feb 2011

See all articles by Stephan Dlugosz

Stephan Dlugosz

ZEW – Leibniz Centre for European Economic Research

Date Written: February 1, 2011

Abstract

A new algorithm for clustering life course trajectories is presented and tested with large register data. Life courses are represented as sequences on a monthly timescale for the working-life with an age span from 16-65. A meaningful clustering result for this kind of data provides interesting subgroups with similar life course trajectories. The high sampling rate allows precise discrimination of the different subgroups, but it produces a lot of highly correlated data for phases with low variability. The main challenge is to select the variables (points in time) that carry most of the relevant information. The new algorithm deals with this problem by simultaneously clustering and identifying critical junctures for each of the relevant subgroups. The developed divisive algorithm is able to handle large amounts of data with multiple dimensions within reasonable time. This is demonstrated on data from the Federal German pension insurance.

Keywords: Clustering, Measures of Association, Discrete Data, Time Series

JEL Classification: C33, C38, J00

Suggested Citation

Dlugosz, Stephan, Clustering Life Trajectories a New Divisive Hierarchical Clustering Algorithm for Discrete-Valued Discrete Time Series (February 1, 2011). ZEW - Centre for European Economic Research Discussion Paper No. 11-015, Available at SSRN: https://ssrn.com/abstract=1763815 or http://dx.doi.org/10.2139/ssrn.1763815

Stephan Dlugosz (Contact Author)

ZEW – Leibniz Centre for European Economic Research ( email )

P.O. Box 10 34 43
L 7,1
D-68034 Mannheim, 68034
Germany

Do you have negative results from your research you’d like to share?

Paper statistics

Downloads
49
Abstract Views
566
PlumX Metrics