Skip to content
View KlaraGtknst's full-sized avatar
  • Student at University of Kassel
  • Kassel

Highlights

  • Pro

Block or report KlaraGtknst

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
KlaraGtknst/README.md

Hi there, I'm Klara M. Gutekunst 👋

I'm currently a PhD Student of Computer Science at the Deep Semantic Learning group at the University of Kassel. My interests lie in machine learning, data analysis, Information Retrieval and Natural Language Processing. Feel free to explore my repositories to see the projects I've been working on.

🔬 Research and Projects

Here are some of the projects I've been involved in:

  • Master Thesis: Leveraging LLM-Generated Hard Negatives for the Impostors Approach
    Repository: master-thesis
    Description: This repository contains the written work on my Master thesis, employing LLM-generated paraphrases as hard negatives for the Impostor approach of Koppel and Winter (2014).

  • Bachelor Thesis: Identification of Key Information with Topic Analysis on Large Unstructured Text Data
    Repository: bachelor-thesis
    Description: This repository contains the written work on my Bachelor thesis, focusing on identifying key information using topic analysis techniques.

  • Discord Detection in Time Series Data
    Repository: discord_detection
    Description: A project to identify discords in time series data using the HOT SAX methodology.

  • Approaches for Finding Sample Pairs in Contrastive Learning
    Repository: master-seminar-ies
    Description: Work from my Master seminar focusing on methods to find sample pairs in contrastive learning, conducted under the Intelligent Embedded Systems chair.

  • Text topic
    Repository: text_topic
    Description: This repository implements a pipeline to store various data of files from a large unstructured dataset. These fields are used for topic modeling (wordclouds, based on low-dimensional versions of embedding vectors, Named Entity Clustering and document-topic incidences). The information is aggregated and visualised using FCA. A comprehensive overview of the system can be found in the slides presented at the conference CONCEPTS'25 (September 08th-12th, 2025, Cluj-Napoca, Romania) or in the corresponding paper.

  • Topic Analysis of Text Data
    Repository: topic-analysis-text-data
    Description: This repository provides methods and functions to find similar documents in terms of content and visual appearance, i.e. layout, from a large corpus of unstructured text data.

  • Identifying fiscal fraud with anomaly detection techniques
    Repository: identifying-fiscal-fraud
    Description: Bachelor Seminar about exploring techniques to identify anomalies and fiscal fraud.

  • Optimization of Spiking Neural Networks
    Repository: bachelor-seminar-kde
    Description: Bachelor Seminar about Spiking Neural Networks.

🚀 About Me

  • 🔭 I’m currently working on Data Mining of large unstructured (text) data and Information Retrieval research projects.
  • 🌱 I’m currently learning Argumentative search & Web search in the context of Information Retrieval.
  • 👯 I’m looking to collaborate on Natural Language Processing, Information Retrieval projects and open-source initiatives.
  • 💬 Ask me about Natural Language Processing, Information Retrieval and Data Mining.
  • 😄 Pronouns: She/Her
  • ⚡ Fun fact: I enjoy visualizing complex data through creative infographics!

📊 GitHub Stats

Klara's GitHub stats

📫 Connect with Me

Feel free to reach out if you're interested in collaborating or discussing any of my projects!

Popular repositories Loading

  1. I2OT_energy I2OT_energy Public

    I2OT Hackathon

    Jupyter Notebook 3 2

  2. bachelor-thesis bachelor-thesis Public

    This repository contains the written work on the Bachelor thesis 'Identification of key information with topic analysis on large unstructured text data'.

    TeX 2

  3. identifying-fiscal-fraud identifying-fiscal-fraud Public

    Seminar about exploring techniques to identify anomalies and fiscal fraud.

    TeX 2

  4. text_topic text_topic Public

    This repository implements a pipeline to store various data of files from a large unstructured dataset. These fields are used for topic modeling (wordclouds, based on low-dimensional versions of em…

    Python 1

  5. master-thesis master-thesis Public

    TeX 1

  6. FAT_CAT_slides FAT_CAT_slides Public

    Slides for the paper *"Conceptual Topic Aggregation"* presented at CONCEPTS 2025 (08–12 September 2025, Romania).

    TeX 1