Extending business failure prediction models with textual website content using deep learning

Archive ouverte : Article de revue

Borchert, Philipp | Coussement, Kristof | de Caigny, Arno | de Weerdt, Jochen

Edité par HAL CCSD ; Elsevier

International audience. Business failure prediction (BFP) is an important instrument in assessing the risk of corporate failure. While a large body of research has focused on BFP, recent research in operations research and analytics acknowledges the beneficial effect of incorporating textual data for predictive modelling. However, extant BFP research that incorporates textual company information is very scarce. Based on a dataset containing 13,571 European companies provided by the largest European data aggregator, this study investigates the added value of extending traditional BFP models with textual website content. We further benchmark various feature extraction techniques in natural language processing (i.e. the vector-space approach, neural networks-based approaches and transformers) and assess the best way of representing and integrating textual website features for BFP modelling. The results confirm that including textual website data improves BFP predictive performance, and that textual features extracted by transformers add the most value to the BFP models in this benchmark setting.

Consulter en ligne

Suggestions

Du même auteur

Uplift modeling and its implications for B2B customer churn prediction: A segmentation-based modeling approach | de Caigny, Arno

Uplift modeling and its implications for B2B customer churn prediction: A s...

Archive ouverte: Article de revue

de Caigny, Arno | 2021-11

International audience. Business-to-business (B2B) customer retention relies heavily on analytics and predictive modeling to support decision making. Given this, we introduce uplift modeling as a relevant prescripti...

Predicting student dropout in subscription-based online learning environments: The beneficial impact of the logit leaf model | Coussement, Kristof

Predicting student dropout in subscription-based online learning environmen...

Archive ouverte: Article de revue

Coussement, Kristof | 2020-08

International audience. Online learning has been adopted rapidly by educational institutions and organizations. Despite its many advantages, including 24/7 access, high flexibility, rich content, and low cost, onlin...

Does it pay off to communicate like your online community? Evaluating the effect of content and linguistic style similarity on B2B brand engagement | Meire, Matthijs

Does it pay off to communicate like your online community? Evaluating the e...

Archive ouverte: Article de revue

Meire, Matthijs | 2022-10

International audience. Business-to-business (B2B) social media efforts have largely focused on creating brand engagement through online content. We propose to analyse company social media texts (tweets) according t...

Du même sujet

ILC-Unet++ for Covid-19 Infection Segmentation | Bougourzi, Fares

ILC-Unet++ for Covid-19 Infection Segmentation

Archive ouverte: Communication dans un congrès

Bougourzi, Fares | 2022-05-23

International audience. Since the appearance of Covid-19 pandemic, in the end of 2019, Medical Imaging has been widely used to analysis this disease. In fact, CT-scans of the Lung can help to diagnosis, detect and q...

Incorporating textual information in customer churn prediction models based on a convolutional neural network | de Caigny, Arno

Incorporating textual information in customer churn prediction models based...

Archive ouverte: Article de revue

de Caigny, Arno | 2019-08-21

International audience. This study investigates the value added by incorporating textual data into customer churn prediction (CCP) models. It extends the previous literature by benchmarking convolutional neural netw...

Face Presentation Attack Detection Using Deep Background Subtraction | Benlamoudi, Azeddine

Face Presentation Attack Detection Using Deep Background Subtraction

Archive ouverte: Article de revue

Benlamoudi, Azeddine | 2022-05

International audience. Currently, face recognition technology is the most widely used method for verifying an individual’s identity. Nevertheless, it has increased in popularity, raising concerns about face present...

Signal Denoising and Detection for Uplink in LoRa Networks based on Bayesian-optimized Deep Neural Networks | Tesfay, Angesom Ataklity

Signal Denoising and Detection for Uplink in LoRa Networks based on Bayesia...

Archive ouverte: Article de revue

Tesfay, Angesom Ataklity | 2023-01

International audience. Long-range and low-power communications are suitable technologies for the Internet of things networks. The long-range implies a very low signal-to-noise ratio at the receiver. In addition, lo...

Essais de géométrie analytique / Par F. Lefrancois ... | Lefrancois, F.. Auteur

Essais de géométrie analytique / Par F. Lefrancois ...

Livre | Lefrancois, F.. Auteur | 1804 - 2. éd., revue et augmentee ...

A CNN-Based Methodology for Cow Heat Analysis from Endoscopic Images | He, Ruiwen

A CNN-Based Methodology for Cow Heat Analysis from Endoscopic Images

Archive ouverte: Article de revue

He, Ruiwen | 2022-06

International audience. In cattle farming, the artificial insemination technique is a biotechnology that brings to farmers a wide range of benefits namely health security, genetic gain and economic costs. The main c...

Chargement des enrichissements...