Master’s theses

Research master Humanities: track  Human Language Technology
(30ec – 5 months)

  • Charlotte Pouw (2022) Cross-lingual Transfer of Correlations between Linguistic Complexity and Human Reading Behaviour (thesis)
  • Marcel Feteke (2022) Cross-lingual Transfer Using Stacked Language Adapters (thesis ♦  internship at https://www.taus.net/)  cum laude
  • Yilmaz Polat (2022) The Hallucinatory World of Automatic Text Generation (thesis)
  • Eliza Hobo (2022)  Simply accessible: Contextualized Lexical Simplification for Accessibility of Dutch Texts (thesis internship at Amsterdam Intelligence teamcum laude
  • Adrielli Lopez Rego (2022) Matching Ontologies in the Education Domain with Semantic Similarity (thesis ♦  internship at wizenoze)
  • Alessandra Polimeno (2022) Diversifying News Recommendation Systems by Detecting Fragmentation in News Story Chains (thesis )
  • Sanne Hoeken (2022) Using Language Models for Analyzing Semantic Variation between Dutch Social Communities (thesiscum laude
  • Vivian Claes (2021)  ECBERT: Applying BERT to European Central Bank Communication to Predict Market Response (thesis ♦  internship at DNB)
  • Sophie Neutel (2021) Towards automatic ontology alignment using BERT (thesis ♦  internship at TNO)
  • Søren K. Fomsgaard (2021) In the eye of the storm with style – Investigating style features in the language of QAnon on Twitter (thesis ♦  internship at TextGain)
  • Nathan van der Molen – Pater (2021)  Information Usage in Coreference Resolution ((thesis)
  • András Aponyi (2020) Estimating Translation Quality Using Distributed Representations of Words and Sentences (thesis ♦  thesis github ♦ internship at https://www.taus.net/)
  • Klaudia Bartosiak (2020)Towards Formalizing Eligibility Criteria of Clinical Trials: Biomedical Entity Linking (thesis not availablethesis github ♦ internship at https://mytomorrows.com/)
  • Suzana Bašic (2020) Color as a Discriminative Property for Establishing Object Identity in Human-Robot Communication (thesis not availablethesis github  ♦ research project: CLTL-make robots talk and think )
  • Lauren Green  (2020)  Semi-supervised Classification of Occupations using Pseudo-Labelling and Information Extraction (thesis not available ♦ internship at: https://greple.de/)
  • Ngan Nguyen (2020)  Clickbait anatomy: Identifying clickbait with machine learning  (thesis )
  • Lisa Vasileva (2020) Machine Translation Detection for Neural Machine Translation Scenario (thesis ♦  internship at https://www.taus.net/)
  • Jonathan Schaller (2020) Cross-domain evaluation of a question-answering classifier ( thesis not available )
  • Karen Goes (2019) Exploring text mining techniques to structure a digitised catalogue (thesis ♦ internship at: https://www.kb.nl/
  • Liza King (2018) Modals and Measles: Computational linguistic investigations into modal use in the vaccination debate (thesis)
  • Benedetta Torsi (2018) Detecting claims in a cross-register corpus (thesis)
  • Pia Sommerauer (2017) From old to new racism? Investigating known dangers in distributional semantic approaches to conceptual change (thesis)
  • Chantal van Son (2015) Towards a Dutch frame-semantic parser (thesis ♦ research project: CLTL-newsreader)
  • Femke Klaver (2014) Authorship attribution of forum posts  (thesis ♦ internship at: TNO

Master linguistics : track  Text Mining
(18ec – 3 months)

  • Ellemijn Galjaard (2022)  Evaluating Transfer of a Functional Level Classifier from Secondary to Primary Healthcare Notes (thesis ♦ internship at VU Medical center)
  • Lahorka Nikolov (2022) Synthetic Data for Domain Adaptation in Neural Machine Translation (thesis ♦ internship at www.taus.net/cum laude
  • Myrthe Buckens (2022)  Comparing and Evaluating Language Models for Conversational Data from the Medical Domain. (thesis ♦ internship at Autoscriber)
  • Michiel van Nederpelt (2022) Evaluating a transformer-based language model under increasingly challenging conditions for the task of offensive language detection (thesis )
  • Sharona Badloe (2022) MedRoBERTa.nl: Transfer Learning From COVID-19 to Cancer Patients (thesis ♦ internship at  VU Medical center)
  • Shuyi Shen (2022) Data to text generation with a joint entity and relation based method for a job advertisement (thesis ♦ internship at TextMetrics)
  • Tessel Wisman (2022) Domain adaptation of end-to-end ASR via n-gram language modelling. (thesis  ♦  internship at Amberscriptcum laude
  • Sylvia Pronk (2022) A detailed comparison between two coreference systems and their effect on key-sentence extraction (thesis ♦  internship at DNB)
  • Mira Reisiger (2022) Context-based entity linking of biomedical text (thesis  internship at Elsevier)
  • Jingyue Zhang (2022)  Mapping text to learning objectives: A keyword-based text classification method (thesis  internship at Edia)
  • Yan Chung Li (2022) A Challenge Set for Natural Language Inference on but-inferred propositions (thesis) cum laude
  • Konstantina Andronikou (2022)  Automatic Retrieval of Topics Using Topic Modeling Techniques from Customer Conversations in the Airline Domain  (thesis ♦ internship at  Underlined)
  • Elena Weber (2022)  Automatic Topic Classification of Customer Feedback in the Banking Domain (thesis ♦ internship at  Underlined)
  • Anouk Twilt (2022) Sustainability in action: exploring automatically extracting actions from news-articles (thesis)
  • Lois Rink (2022) Automatic Classification of Speech Acts in tax service letters (thesis ♦ internship at Belastingdienst)
  • Giorgio Malinverni (2022) Analysing the Influence of Morphological Characteristics on the Performance of Few-Shot Prompting for Natural Language Inference in Cross-Lingual Settings (thesis )
  • Eva den Uijl (2021) Detecting Discriminatory Language in Job Advertising Texts (thesis ♦ internship at TextMetrics)
  • Guido Ansem (2021)The Effect of Auxiliary Data on Low Resource Languages in Aspect Extraction (thesis)
  • Michelle Chan (2021) An Empirical Framework for Topic Modelling for Dutch Texts based on Newspaper Articles on Soil Pollution 
  • Melisha Lemain – van der Nest (2021) Named Entity Recognition: identifying NER Indicators in Dutch Police Reports (thesis ♦ internship at CBS). 
  • Dyon van der Ende (2021) Text Mining for Sustainability: Detecting Corporate Greenwashing with the Sustainable Development Goals (thesis)
  • Gabriele Catanese (2021) A Transfer Learning approach to Aspect Based Sentiment Analysis for airline customer feedbacks (thesis ♦ internship at Underlined)  cum laude                   !! nominated for the Faculty of Humanities thesis prize 2021
  • Stan Frinking (2021) Using Text Mining Techniques to Detect Fall Events in Medical Patient Notes (thesis ♦ internship at VU Medical center)
  • Jasmine van Vugt  (2021) Two Dutch fine-tuned BERT models: Named Entity Recognition and Named Entity Linking to increase findability of local geographical information. (thesis ♦ internship at  CBS)
  • Sanne Hamersma (2021) Explorative analysis of precursors of physical aggression in a health care institute: a Text Mining approach (thesis ♦ internship at : GGZ
  • Aju Shreshta (2021)  BERTje-based Automatic Anonymisation of Dutch Police Reports (thesis ♦ internship at : CBS
  • Breta Micha (2021)  Automatic Terminology Extraction in domain specific texts: a comparison between a rule-based system and a BERT-based system. (thesis)
  • Jan van Casteren (2020) Automatic Attribution Extraction From Dutch News Articles: A Beginning (thesis  ♦ thesis github research at: eScience center – inside the filter bubble)
  • Peter Caine (2020). Mind the gap: A comparison of linguistic vs deep-learning approaches to aspect extraction and aspect category detection  (thesis ♦ thesis github)
  • Luca Meima (2020) Finding potentially HIV defining conditions in medical reports  (thesis ♦ thesis github ♦ internship at https://mytomorrows.com/)
  • Eva Zegelaar (2020) An Automatic Emotion & Purpose Classifier for Dutch Tweets Written by Members of the Dutch Parliament (thesis  ♦ thesis github ♦ internship at: https://reddata.nl/)