This repository contains the data used for "Extraction of UML Class Diagrams from Natural Language Specification" (Yang et al. 2022)
To get the entire dataset, you must download the release containing dataset.tar.gz
.
dataset.tar.gz
: archive that contains all the filesfragments.csv
: file that lists UML fragments and their characteristicslabels.csv
: file that contains the labels received in the crowdsourcing effortmodels.csv
: file that lists UML class diagrams and their characteristicszoo/
: folder that contains all the UML data itself, such as pictures and UML encodingsUnzip the tarball first.
Open models.csv
to read the list of available models. Copy its name and search in the zoo/
folder for .png
files starting with that name. For example, the ACME model has an image in the zoo/
folder called ACME.png
.
ls zoo/ACME.png
code zoo/ACME.png # any other image visualizer
Fragment files are named in the following pattern.
Class fragments:
(ModelName)_(class)(number).png
Relationship fragments:
(ModelName)_(rel)(number).png
Similarly, you can visualize them.
code zoo/CFG_class0.png
labels.csv
and find the line that has the label of interest.fragment_id
, which can be indexed in fragments.csv
. Find the ID for the label of interest.fragments.csv
, search for the line where the column value of unique_id
equals fragment_id
from Step 2.Paper | Code | Results | Date | Stars |
---|