MSc Thesis

Optimising Crowdfunding Success: A BOHB-Driven Reward-Tier Strategy for Technology Campaigns

Gemini-Driven Automated Prompt Engineering for Mining High-Volume Sustainability Reports at ING Bank

Designing for Dollars or Disputes? Linking Earnout Complexity to Payout Failure in M&A

Optimising Crowdfunding Success: A BOHB-Driven Reward-Tier Strategy for Technology Campaigns

Nathalie Tuijp, Optimising Crowdfunding Success: A BOHB-Driven Reward-Tier Strategy for Technology Campaigns, Financial Engineering & Management, University of Twente, 2025

Supervisors: prof. dr. Joerg Osterrieder (University of Twente, Netherlands), prof. dr. Berend Roorda (University of Twente, Netherlands), Dr. Stefana Belbe (Babeș-Bolyai University, Romania)

Crowdfunding platforms like Kickstarter offer opportunities for entrepreneurs and creators to fund their projects. Most of these projects have a reward-tier structure to fund their campaigns. The reward-tier-structure largely influences the outcome of the campaign. Therefore, this thesis aims to optimise this structure by developing a BOHB (Bayesian Optimization and Hyperband) framework with the integration of LLaMA-based embeddings, identifying the most effective reward-tier strategies to enhance campaign success rates. The BOHB framework is specifically chosen, as it is particularly effective for high-dimensional, non-convex search spaces like those found in crowdfunding campaigns. Its adaptive resource allocation and multi-fidelity optimisation allow it to efficiently explore vast parameter spaces, identifying optimal reward strategies with reduced computational cost. To complement traditional numerical features such as funding goals, number of backers, and reward levels, the research integrates LLaMA embeddings which are incorporated in the model to predict the campaign success, giving a higher accuracy to the model. These embeddings capture the semantic richness and emotional tone of campaign descriptions and reward titles. By combining advanced hyperparameter optimisation with the use of LLaMA embeddings, the model identifies optimal reward configurations that enhance the probability of campaign success.

The study uses the publicly available Kickstarter database WebRobots.io, alongside additional data collected by a custom webscraper script. This thesis uses the data in the category Technology from the Kickstarter platform, followed by a data analysis in Python.

The basic RF model scored 0.7428 on accuracy, 0.7169 on precision, 0.7326 on recall, 0.7247 on the F1-Score and 0.8150 on AUC-ROC. The best performing model, the XGBoost BOHB model with all-MiniLM-L6-v2 Embeddings, scored 0.8180 on accuracy, 0.8177 on precision, 0.8182 on recall, 0.8180 on the F1-Score, and 0.9011 on the AUC-ROC. This work contributes to the academic understanding of crowdfunding dynamics and provides actionable insights for researchers interested in NLP in crowdfunding.

Gemini-Driven Automated Prompt Engineering for Mining High-Volume Sustainability Reports at ING Bank

Author Fishchuk, V.V.
First supervisor Osterrieder, J.R.O. (BMS-IEBIS)
Other supervisor(s) Corbo Ugulino, W. (EEMCS-SCS)
Graduation year and month 2025, September
Study programme Master Business Information Technology
Faculty Electrical Engineering, Mathematics and Computer Science
External organisation ING Bank, Amsterdam, Netherlands

LLMs are increasingly used for mining long, heterogeneous reports, increasing the demand for prompt engineering; yet, scalable manual prompt engineering is costly. This study evaluates automated prompt optimization for information extraction at ING Bank. The author adapts the ProTeGi automated prompt optimization framework to the information extraction use case and introduces Gradient Verification (GradV). This decision gate filters LLM feedback, facilitating faster convergence of LLM-feedback-based prompt optimization methods. The work also introduces a transparent prompt enrichment (PE) framework, which converts verified LLM feedback into modular instructions. Using Gemini Flash 2.0, the author extracts absolute emissions of Scope 1, assurance of Scope 3, and reporting period from 141 anonymized annual sustainability reports. On the test set (n=49), the improvements over the initial prompts were 27 percentage points (pp) for S1, 4pp for S3a, and 10pp for Rp; after the Holm-Bonferroni correction, only S1 remains statistically significant (one-tailed exact binomial test on discordant pairs, with stepwise significance levels of 0.0167, 0.025, 0.05).

Optimization also improved stability, reducing run-to-run response variability on some variables under identical conditions.

Furthermore, optimized prompts displayed similar accuracies to analyst-designed prompts on some targets, clearly highlighting the potential for reducing manual effort. In general, automated prompt optimization has shown potential to be a feasible and scalable alternative to manual prompt engineering for long-context and long-input information extraction in enterprise scenarios for certain target variables.

Designing for Dollars or Disputes? Linking Earnout Complexity to Payout Failure in M&A

Alexandros Vassilios Gabriel

Earnouts are increasingly common in mergers and acquisitions (M&A) as a tool to reconcile valuation gaps and align incentives. This research performs a large-scale empirical analysis of 448 M&A earnout agreements, leveraging a novel AI driven methodology to extract detailed contractual features directly from the legal text of SEC filings. The analysis reveals that high contractual complexity, driven by extensive protective covenants, is a universal market standard, independent of the deal’s initial level of information asymmetry. The study’s central finding is a powerful and paradoxical negative relationship between this contractual complexity and the probability of a successful payout, suggesting that the very mechanisms designed to ensure payment are correlated with its failure. The analysis further reveals that earnouts based on uncontrollable, market-based metrics demonstrate a significantly higher rate of failure, a finding consistent with Agency Theory. The primary contribution to the literature is moving beyond ex ante announcement studies to provide the first modern, large-scale empirical evidence linking specific ex ante contractual design choices to ex post realized payout outcomes. Practically, the findings offer a, data-driven caution to M&A practitioners against both overly complex agreements and the use of market-based metrics. By creating and analyzing this detailed dataset, the study provides a modern benchmark of earnout design and its ultimate performance.

Page updated

Report abuse