In this paper we address the problem of survival models when high-dimensional panel data are available. We discuss two related issues: The first one concerns the issue of variable selection and the second one deals with the stability over time of such a selection, since presence of time dimension in survival data requires explicit treatment of evolving socio-economic context. We show how graphical models can serve two purposes. First they serve as the input for a first algorithm to to assess the temporal stability of the data: Secondly, allow the deployment of a second algorithm which partially automates the process of variable selection, while retaining the option to incorporate domain expertise in the process of empirical model-building. To put our proposed methodology to the test, we utilize a dataset comprising Italian firms funded in 2009 and we study the survival of these entities over the period of 10 years. In addition to revealing significant volatility in the set of variables explaining firm exit over the years, our novel methodology enables us to offer a more nuanced perspective than the conventional one regarding the critical roles played by traditional variables such as industrial sector, geographical location, and innovativeness in firm survival.

Time varying effects in survival analysis: a novel data-driven method for drift identification and variable selection

Guerzoni M.
;
Riso L.
2024-01-01

Abstract

In this paper we address the problem of survival models when high-dimensional panel data are available. We discuss two related issues: The first one concerns the issue of variable selection and the second one deals with the stability over time of such a selection, since presence of time dimension in survival data requires explicit treatment of evolving socio-economic context. We show how graphical models can serve two purposes. First they serve as the input for a first algorithm to to assess the temporal stability of the data: Secondly, allow the deployment of a second algorithm which partially automates the process of variable selection, while retaining the option to incorporate domain expertise in the process of empirical model-building. To put our proposed methodology to the test, we utilize a dataset comprising Italian firms funded in 2009 and we study the survival of these entities over the period of 10 years. In addition to revealing significant volatility in the set of variables explaining firm exit over the years, our novel methodology enables us to offer a more nuanced perspective than the conventional one regarding the critical roles played by traditional variables such as industrial sector, geographical location, and innovativeness in firm survival.
2024
1
34
Firm survival; Model drift; Variable selection
Babutsidze Z.; Guerzoni M.; Riso L.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1960693
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 1
social impact