Nathalie Tuijp, Optimising Crowdfunding Success: A BOHB-Driven Reward-Tier Strategy for Technology Campaigns, Financial Engineering & Management, University of Twente, 2025
Supervisors: prof. dr. Joerg Osterrieder (University of Twente, Netherlands), prof. dr. Berend Roorda (University of Twente, Netherlands), Dr. Stefana Belbe (Babeș-Bolyai University, Romania)
Crowdfunding platforms like Kickstarter offer opportunities for entrepreneurs and creators to fund their projects. Most of these projects have a reward-tier structure to fund their campaigns. The reward-tier-structure largely influences the outcome of the campaign. Therefore, this thesis aims to optimise this structure by developing a BOHB (Bayesian Optimization and Hyperband) framework with the integration of LLaMA-based embeddings, identifying the most effective reward-tier strategies to enhance campaign success rates. The BOHB framework is specifically chosen, as it is particularly effective for high-dimensional, non-convex search spaces like those found in crowdfunding campaigns. Its adaptive resource allocation and multi-fidelity optimisation allow it to efficiently explore vast parameter spaces, identifying optimal reward strategies with reduced computational cost. To complement traditional numerical features such as funding goals, number of backers, and reward levels, the research integrates LLaMA embeddings which are incorporated in the model to predict the campaign success, giving a higher accuracy to the model. These embeddings capture the semantic richness and emotional tone of campaign descriptions and reward titles. By combining advanced hyperparameter optimisation with the use of LLaMA embeddings, the model identifies optimal reward configurations that enhance the probability of campaign success.
The study uses the publicly available Kickstarter database WebRobots.io, alongside additional data collected by a custom webscraper script. This thesis uses the data in the category Technology from the Kickstarter platform, followed by a data analysis in Python.
The basic RF model scored 0.7428 on accuracy, 0.7169 on precision, 0.7326 on recall, 0.7247 on the F1-Score and 0.8150 on AUC-ROC. The best performing model, the XGBoost BOHB model with all-MiniLM-L6-v2 Embeddings, scored 0.8180 on accuracy, 0.8177 on precision, 0.8182 on recall, 0.8180 on the F1-Score, and 0.9011 on the AUC-ROC. This work contributes to the academic understanding of crowdfunding dynamics and provides actionable insights for researchers interested in NLP in crowdfunding.
Author Fishchuk, V.V.
First supervisor Osterrieder, J.R.O. (BMS-IEBIS)
Other supervisor(s) Corbo Ugulino, W. (EEMCS-SCS)
Graduation year and month 2025, September
Study programme Master Business Information Technology
Faculty Electrical Engineering, Mathematics and Computer Science
External organisation ING Bank, Amsterdam, Netherlands
LLMs are increasingly used for mining long, heterogeneous reports, increasing the demand for prompt engineering; yet, scalable manual prompt engineering is costly. This study evaluates automated prompt optimization for information extraction at ING Bank. The author adapts the ProTeGi automated prompt optimization framework to the information extraction use case and introduces Gradient Verification (GradV). This decision gate filters LLM feedback, facilitating faster convergence of LLM-feedback-based prompt optimization methods. The work also introduces a transparent prompt enrichment (PE) framework, which converts verified LLM feedback into modular instructions. Using Gemini Flash 2.0, the author extracts absolute emissions of Scope 1, assurance of Scope 3, and reporting period from 141 anonymized annual sustainability reports. On the test set (n=49), the improvements over the initial prompts were 27 percentage points (pp) for S1, 4pp for S3a, and 10pp for Rp; after the Holm-Bonferroni correction, only S1 remains statistically significant (one-tailed exact binomial test on discordant pairs, with stepwise significance levels of 0.0167, 0.025, 0.05).
Optimization also improved stability, reducing run-to-run response variability on some variables under identical conditions.
Furthermore, optimized prompts displayed similar accuracies to analyst-designed prompts on some targets, clearly highlighting the potential for reducing manual effort. In general, automated prompt optimization has shown potential to be a feasible and scalable alternative to manual prompt engineering for long-context and long-input information extraction in enterprise scenarios for certain target variables.