Logo des Repositoriums
  • English
  • Deutsch
Anmelden
Keine TU-ID? Klicken Sie hier für mehr Informationen.
  1. Startseite
  2. Publikationen
  3. Publikationen der Technischen Universität Darmstadt
  4. Erstveröffentlichungen
  5. Evaluation of whole graph embedding techniques for a clustering task in the manufacturing domain
 
  • Details
2022
Erstveröffentlichung
Masterarbeit
Verlagsversion

Evaluation of whole graph embedding techniques for a clustering task in the manufacturing domain

File(s)
Download
Hauptpublikation
tuprint.pdf
CC BY 4.0 International
Format: Adobe PDF
Size: 4.23 MB
TUDa URI
tuda/9504
URN
urn:nbn:de:tuda-tuprints-223405
DOI
10.26083/tuprints-00022340
Autor:innen
Iskandar, Yusef ORCID 0000-0001-6787-3031
Kurzbeschreibung (Abstract)

Production systems in manufacturing consume and generate data. Representing the relationships between subsystems and their associated data is complex, but suitable for Knowledge Graphs (KG), which allow us to visualize the relationships between subsystems and store their measurement data. In this work, KG act as a feature engineering technique for a clustering task by converting KG into Euclidean space with so-called graph embeddings and serving as input to a clustering algorithm. The Python library Karate Club proposes 10 different technologies for embedding whole graphs, i.e., only one vector is generated for each graph. These were successfully tested on benchmark datasets that include social media platforms and chemical or biochemical structures. This work presents the potential of graph embeddings for the manufacturing domain for a clustering task by modifying and evaluating Karate Club’s techniques for a manufacturing dataset. First, an introduction to graph theory is given and the state of the art in whole graph embedding techniques is explained. Second, the Bosch production line dataset is examined with an Exploratory Data Analysis (EDA), and a graph data model for directed and undirected graphs is defined based on the results. Third, a data processing pipeline is developed to generate graph embeddings from the raw data. Finally, the graph embeddings are used as input to a clustering algorithm, and a quantitative comparison of the performance of the techniques is conducted.

Sprache
Englisch
Fachbereich/-gebiet
16 Fachbereich Maschinenbau > Institut für Produktionsmanagement, Technologie und Werkzeugmaschinen (PTW)
16 Fachbereich Maschinenbau > Institut für Produktionsmanagement, Technologie und Werkzeugmaschinen (PTW) > Management industrieller Produktion
DDC
000 Allgemeines, Informatik, Informationswissenschaft > 004 Informatik
500 Naturwissenschaften und Mathematik > 510 Mathematik
600 Technik, Medizin, angewandte Wissenschaften > 620 Ingenieurwissenschaften und Maschinenbau
Institution
Technische Universität Darmstadt
Ort
Darmstadt
Gutachter:innen
Metternich, JoachimORCID 0000-0003-0611-8723
Bretones Cassoli, BeatrizORCID 0000-0003-4478-4620
Name der Gradverleihenden Institution
Technische Universität Darmstadt
Ort der Gradverleihenden Institution
Darmstadt
PPN
503107670

  • TUprints Leitlinien
  • Cookie-Einstellungen
  • Impressum
  • Datenschutzbestimmungen
  • Webseitenanalyse
Diese Webseite wird von der Universitäts- und Landesbibliothek Darmstadt (ULB) betrieben.