Parse Azure ML Data Labelling dataset convert to XML for object detection modelling

3 min readAug 15, 2020

Convert the Azure machine learning dataset from ML assist/data labelling output to XML for Object detection modelling

One of the challenge with Azure ML data set is ability to export to format which can be leveraged into existing Mask R-CNN or Fast R-CNN or resnet to train for object detection.

Azure Machine learning ml assist data set is in different format.

Please convert to your specific format as needed. This tutorial show how to parse the AML dataset.

This tutorial walks through step take the AML data set and convert to XML to be able to use in any object detection model

Pre Requisite

Azure machine learning service
Create data labelling project
https://docs.microsoft.com/en-us/azure/machine-learning/how-to-create-labeling-projects
enable ML assist
label images
Export label output as Azure ML dataset
compute instance

When saved, the dataset section will have the new dataset

Now lets convert to Xml

Go to compute instance and click jupyter lab/jupyter
create folder called xml
create a new notebook with azure ML sdk
Get workspace details like subscription id, resource group name and ML workspace name

# azureml-core of version 1.0.72 or higher is required
# azureml-dataprep[pandas] of version 1.1.34 or higher is required
# azureml-contrib-dataset of version 1.0.72 or higher is required
from azureml.core import Workspace, Dataset
import azureml.contrib.dataset
import pandas as pdsubscription_id = 'replacexxxxxxxxxxxxxxxxxxxxxxxx'
resource_group = 'replaceresourcegroupname'
workspace_name = 'replaceworkspacename'workspace = Workspace(subscription_id, resource_group, workspace_name)dataset = Dataset.get_by_name(workspace, name='yourdatasetnamehere')
dataset.to_pandas_dataframe().head()

Will display only first 5 rows to see the format of AML dataset produced
Convert the dataset to DataFrames

objdf = dataset.to_pandas_dataframe()#convert column to string objdf['image_url'] = objdf['image_url'].astype(str)objdf['imgfile'] = objdf['image_url'].str.extract('AmlDatastore://([^/]*[^/\d])\d*\.jpg', expand=False).str.strip()from xml.dom import minidom 
import xml.etree.ElementTree as EToutputfolder = "./xml"import os import numpyfor index, row in objdf[['imgfile', 'label']].iterrows():
    # print(str(row['imgfile']), row['label'])
    xmlfile = os.path.join(outputfolder, row['imgfile'])
    # create the file structure
    data = ET.Element('annotation')
    folder = ET.SubElement(data, 'folder')
    filename = ET.SubElement(data, 'filename')
    size = ET.SubElement(data, 'size')
    segmented = ET.SubElement(data, 'segmented')    
    filename.text = row['imgfile'] + ".jpg"
    
    for line in row['label']:
        # print(line['label'], line['topX'], line['topY'], line['bottomX'], line['bottomY'])
        object = ET.SubElement(data, 'object')
        name = ET.SubElement(object, 'name')
        bndbox = ET.SubElement(object, 'bndbox')
        xmin = ET.SubElement(bndbox, 'xmin')
        ymin = ET.SubElement(bndbox, 'ymin')
        xmax = ET.SubElement(bndbox, 'xmax')
        ymax = ET.SubElement(bndbox, 'ymax')
        name.text = line['label']
        xmin.text = str(numpy.float64(line['topX']))
        ymin.text = str(numpy.float64(line['topY']))
        xmax.text = str(numpy.float64(line['bottomX']))
        ymax.text = str(numpy.float64(line['bottomY']))
        
    difficult = ET.SubElement(object, 'difficult')
    
    # create a new XML file with the results
    fname = row['imgfile'] + ".xml"
    fullfname = os.path.join(outputfolder,fname)
    #print(fullfname)
    mydata = ET.tostring(data)
    #print(mydata)
    myfile = open(fullfname, "wb")
    myfile.write(mydata)

Above code first expands each row from the data frame. Then get the image name and Labels list
Loop the label list to get the label and bounding box coordinates
Format the XML
Finally write the xml to disk
sample output xml

<annotation><folder /><filename>2020-02-27T09-42-38.000000Z.jpg</filename><size /><segmented /><object><name>nostock</name><bndbox><xmin>0.19410098496303763</xmin><ymin>0.31760204081632654</ymin><xmax>0.2524044351338486</xmax><ymax>0.5690306371572066</ymax></bndbox></object><object><name>nostock</name><bndbox><xmin>0.3393817204301075</xmin><ymin>0.3241326779735332</ymin><xmax>0.43878436344926075</xmax><ymax>0.5657653185786033</ymax></bndbox></object><object><name>nostock</name><bndbox><xmin>0.7188321094216696</xmin><ymin>0.3012755102040816</ymin><xmax>0.8144116327611634</xmax><ymax>0.5478060975366709</ymax></bndbox></object><object><name>nostock</name><bndbox><xmin>0.4263590057263665</xmin><ymin>0.5951531857860332</ymin><xmax>0.5506122912699746</xmax><ymax>0.7437245271643814</ymax></bndbox></object><object><name>nostock</name><bndbox><xmin>0.09756571816429585</xmin><ymin>0.015561224489795918</ymin><xmax>0.20939368244567652</xmax><ymax>0.2278061224489796</ymax></bndbox><difficult /></object></annotation>

Originally published at https://github.com.

Parse Azure ML Data Labelling dataset convert to XML for object detection modelling

Convert the Azure machine learning dataset from ML assist/data labelling output to XML for Object detection modelling

Pre Requisite

Written by Balamurugan Balakreshnan

No responses yet