2 min read
Configure regional settings on Azure Virtual Machines
By default every Windows virtual machine that you deploy in Azure from the marketplace will be deployed using US regional settings. So the timezone,...
Hi all,
This blog post will cover how to transform data with Azure Machine Learning Workbench. One of the most important aspects in any Machine learning project is how accurate and how good your data is. If you have un-accurate or bad data the learning or simulation models will generate bad results.
The main components of Azure Machine Learning are:
Have a look here for a quick overview of Azure Machine Learning and a detailed description of the different Machine learning components.
We will create a new Azure ML Experimentation Service first, after which we will download and install Azure ML Workbench.
The Experimentation Service handles the execution of machine learning experiments. It also supports the Workbench by providing project management, Git integration, access control, roaming, and sharing.
Through easy configuration, you can execute your experiments across a range of compute environment options:
Today we will not execute any experiments. We will just show how to transform the data. Let’s create our new Azure ML learning account:
I selected the DevTest SKU
After filling in the required information and submitting the deployment you should see the following:
Now that we have our Azure ML Learning account we can install Azure ML Workbench. Azure ML Workbench is a desktop application plus command-line tools, supported on both Windows and macOS. It allows you to manage machine learning solutions through the entire data science life cycle:
You can install Azure Machine Learning Workbench on your computer running Windows 10, Windows Server 2016, or newer.
Double-click the downloaded installer AmlWorkbenchSetup.msi from File Explorer. The installer downloads all the necessary dependent components, such as Python, Miniconda, and other related libraries. The installation might take around half an hour to finish all the components.
Select Blank Project:
I downloaded a dataset from https://www.kaggle.com/datasets that I will be using here as an example. To bring data into a project using the data source wizard. Select the + button next to the search box in the data view and choose Add Data Source
In my case I used Excel as the data source
You can specify one or more sampling strategies for the dataset, and choose one as the active strategy. The default is to load the Top 100 rows. I changed it to load the full file
Once the data has been imported you will see the following screen
Now that we have our datasource we will prepare our data by creating a Data Preparation Package that we will use to transform our data.
flights
data source and click ‘Prepare’This will then create a flights.dprep
file in your project directory, and will show a page almost similar to what we’ve seen after we’ve added a data source.
We will now transform our month column from numeric values to a string with the derive column by example. Azure ML Workbench synthesizes a program based on the examples provided by you and applies the same program on remaining rows. All other rows are automatically populated based on the example you provided. Workbench also analyzes your data and tries to identify edge cases.
Select the column you want to transform, click derive column by example
A new column will appear with null values. Click in the first empty cell in the new column and provide an example. In my case I typed January to transform my numeric value of 1 (first month of the year) to January.
Select OK to accept the changes. You should now see the following:
That’s how easy it is to transform data with Azure ML Workbench. Imagine doing this with Excel?! For more examples regarding data transformation have a look here.
Thanks,
Alex
2 min read
By default every Windows virtual machine that you deploy in Azure from the marketplace will be deployed using US regional settings. So the timezone,...
3 min read
Bernie White has a Powershell Module (PSDocs) that can generate mark down files (*.md) and Stefan Stranger’s blog post shows us how to upload these...
3 min read
By default, when you deploy a new Azure WebApp, Function app or API app it will be publicly available to the internet. For the current customer I’m...