Take your machine learning models to production with new MLOps capabilities

This blog post was authored by Jordan Edwards, Senior Program Manager, Microsoft Azure.

At Microsoft Build 2019 we announced MLOps capabilities in Azure Machine Learning service. MLOps, also known as DevOps for machine learning, is the practice of collaboration and communication between data scientists and DevOps professionals to help manage the production of the machine learning (ML) lifecycle.

Azure Machine Learning service’s MLOps capabilities provide customers with asset management and orchestration services, enabling effective ML lifecycle management. With this announcement, Azure is reaffirming its commitment to help customers safely bring their machine learning models to production and solve their business’s key problems faster and more accurately than ever before.

 

Here is a quick look at some of the new features:

Azure Machine Learning Command Line Interface (CLI) 

Azure Machine Learning’s management plane has historically been via the Python SDK. With the new Azure Machine Learning CLI, you can easily perform a variety of automated tasks against the ML workspace including:

Compute target management

Experiment submission

Model registration and deployment

Management capabilities

Azure Machine Learning service introduced new capabilities to help manage the code, data, and environments used in your ML lifecycle.

Code management

Git repositories are commonly used in industry for source control management and as key assets in the software development lifecycle. We are including our first version of Git repository tracking – any time you submit code artifacts to Azure Machine Learning service, you can specify a Git repository reference. This is done automatically when you are running from a CI/CD solution such as Azure Pipelines.

Data set management

With Azure Machine Learning data sets you can version, profile, and snapshot your data to enable you to reproduce your training process by having access to the same data. You can also compare data set profiles and determine how much your data has changed or if you need to retrain your model.

Environment management

Azure Machine Learning Environments are shared across Azure Machine Learning scenarios, from data preparation to model training to inferencing. Shared environments help to simplify handoff from training to inferencing as well as the ability to reproduce a training environment locally.

Environments provide automatic Docker image management (and caching!), plus tracking to streamline reproducibility.

Simplified model debugging and deployment

Some data scientists have difficulty getting an ML model prepared to run in a production system. To alleviate this, we have introduced new capabilities to help you package and debug your ML models locally, prior to pushing them to the cloud. This should greatly reduce the inner loop time required to iterate and arrive at a satisfactory inferencing service, prior to the packaged model reaching the datacenter.

Model validation and profiling 

Another challenge that data scientists commonly face is guaranteeing that models will perform as expected once they are deployed to the cloud or the edge. With the new model validation and profiling capabilities, you can provide sample input queries to your model. We will automatically deploy and test the packaged model on a variety of inference CPU/memory configurations to determine the optimal performance profile. We also check that the inference service is responding correctly to these types of queries.

Model interpretability

Data scientists want to know why models predict in a specific manner. With the new model interpretability capabilities, we can explain why a model is behaving a certain way during both training and inferencing.

ML audit trail

Azure Machine Learning is used for managing all of the artifacts in your model training and deployment process. With new audit trail capabilities, we are enabling automatic tracking of the experiments and datasets that corresponds to your registered ML model. This helps to answer the question, “What code/data was used to create this model?”

Azure DevOps extension for machine learning

Azure DevOps provides commonly used tools data scientists leverage to manage code, work items, and CI/CD pipelines. With the Azure DevOps extension for machine learning, we are introducing new capabilities to make it easy to manage your ML CI/CD pipelines with the same tools you use for software development processes. The extension includes the abilities to trigger Azure Pipelines release on model registration, easily connect an Azure Machine Learning Workspace to an Azure DevOps project, and perform a series of tasks designed to help interaction with Azure Machine Learning as easy as possible from the existing automation tooling.

Get started today

These new MLOps features in the Azure Machine Learning service aim to enable users to bring their ML scenarios to production by supporting reproducibility, auditability, and automation of the end-to-end ML lifecycle. We’ll be publishing more blogs that go in-depth with these features in the following weeks, so follow along for the latest updates and releases.

Learn more about Azure Machine Learning service
Get started today with a free trial

Quelle: Azure

Azure SQL Data Warehouse releases new capabilities for performance and security

As the amount of data stored and queried continues to rise, it becomes increasingly important to have the most price-performant data warehouse. While we’re excited about being the industry leader in both of Gigaom’s TPC-H and TPC-DS benchmark reports, we don’t plan to stop innovating on behalf of our customers.

As Rohan Kumar mentioned in his blog on Monday, we’re excited to introduce several new features that will continue to make Azure SQL Data Warehouse the unmatched industry leader in price-performance, flexibility, and security.

To enable customers to continue improving the performance of their applications without adding any additional cost, we’re announcing preview availability of result-set caching, materialized views, and ordered clustered columnstore indexes.

In addition to price-performance enhancements, we’ve added new capabilities that enable customers to be more agile and flexible. The first is workload importance, which is a new feature that enables users to decide how workloads with conflicting needs get prioritized. Second, our new support for automatic statistics maintenance (auto-update statistics) means that manageability and maintenance of Azure SQL Data Warehouse just got easier and more effective. And finally, we’re also adding support for managing and querying JSON data. Users can now load JSON data directly into their data warehouses and mix it with other relational data, leading to faster and easier insights.

Our last announcement focuses on security and privacy. As you know, deploying data warehousing solutions in the cloud demands sophisticated and robust security. While Azure SQL Data Warehouse already enables an advanced security model to be deployed, today we’re announcing support for Dynamic Data Masking (DDM). DDM allows you to protect private data, through user-defined policies, ensuring it’s visible only to those that have permission to see it.

In the sections below, we’ll dive into these new features and the benefits that each provide.

Price-performance

Price-performance is a reoccurring theme in our releases because it ensures we provide one of the fastest analytics services at incredible value. With new functionalities announced today, we continue to demonstrate our commitment towards offering the leading price-performance platform.

Interactive dashboarding with result-set caching (preview)

Interactive dashboards come with predictable and repetitive query patterns. Result-set caching, now available in preview, helps with this scenario as it enables instant query response times while reducing time-to-insight for business analysts and reporting users.

With result-set caching enabled, Azure SQL Data Warehouse automatically caches results from repetitive queries, causing subsequent query executions to return results from the persisted cache that omits full query execution. In addition to saving compute cycles, queries satisfied by result-set cache do not use any concurrency slots and thus do not count against existing concurrency limits. For security reasons, only users with the appropriate security credentials can access the result sets in cache.

Materialized views to improve performance (preview)

Another new feature that greatly enhances query performance for a wide set of queries is materialized view support, now available in preview. A materialized view improves the performance of complex queries (typically queries with joins and aggregations) while offering simple maintenance operations.

When materialized views are created, Azure SQL Data Warehouse query optimizer transparently and automatically rewrites user queries to leverage deployed materialized views, leading to improved query performance. Best of all, as the data gets loaded into base tables, Azure SQL Data Warehouse automatically maintains and refreshes materialized views, providing a simplified view of maintenance and management. As the user queries leverage materialized views, queries run significantly faster and use less system resources. The more complex and expensive the query within the view is, the bigger potential there is for execution time savings.

Fast scans with ordered clustered columnstore indexes (preview)

Columnstore is a key enabler for storing and efficiently querying large amounts of data. For each table, it divides incoming data into row groups and each column of a row group forms a segment on a disk. When querying columnstore indexes, only the column segments that are relevant to user queries are read from the disk. Ordered clustered columnstore indexes further optimize query execution by enabling efficient segment elimination.

Due to pre-ordered data, you can drastically reduce the number of segments that are read from the disk, leading to faster query processing. Ordered clustered columnstore indexes is now available in preview, and queries containing filters and predicates can greatly benefit from this feature.

Flexibility

As business requirements evolve, the ability to change and adapt solution behavior is one of the key benefits of a modern data warehousing product. The ability to handle and manage heterogeneous data that enterprises have while offering ease of use and management is critical. To support these needs, Azure SQL Data Warehouse is introducing the following new functionalities to help you deal with ever-evolving requirements.

Prioritize workloads with workload importance (general availability)

Running mixed workloads on your analytics solution is often a necessity to effectively and quickly execute business processes. In situations where resources are constrained, the capability to decide which workloads need to be executed first is critical, as it helps with overall solution cost management. For instance, executive dashboard reports may be more important than ad-hoc queries. Workload importance now enables this scenario. Requests with higher importance are guaranteed quicker access to resources, which helps meet predefined SLAs and ensures important requests are prioritized.

Workload classification concept

To define workload priority, various requests must be classified. Azure SQL Data Warehouse supports flexible classification policies that can be set for a SQL query, a database user, database role, Azure Active Directory login, or Azure Active Directory group. Workload classification is achieved using the new CREATE WORKLOAD CLASSIFIER syntax.

The diagram below illustrates the workload classification and importance function:

Workload importance concept

Workload importance is established through classification. Importance influences a requester's access to system resources  including memory, CPU, and IO and locks. A request can be assigned one of these five levels of importance: low, below_normal, normal, above_normal, and high. If a request with above_normal importance is scheduled, it gets access to resources before a request with the default normal importance.

Manage and query JSON data (preview)

Organizations are increasingly faced with dealing with multiple data sources and heterogeneous file formats, JSON being among the top ones, aside from CSV files. To speed up time to insight and minimize unnecessary data transformation processes, Azure SQL Data Warehouse now enables support for querying JSON data. This feature is now available in preview.

Business analysts can now use the familiar T-SQL language to query and manipulate documents that are formatted as JSON data. JSON functions, such as JSON_VALUE, JSON_QUERY, JSON_MODIFY, and OPENJSON are now supported in Azure SQL Data Warehouse. Azure SQL Data Warehouse can now effectively support both relational and non-relational data, including joins between the two, while enabling users to use their traditional BI tools, such as Power BI.

Automatic statistics maintenance and update (preview)

Azure SQL Data Warehouse implements a cost-based optimizer to ensure optimal execution plans are being generated and used. For any cost-based optimizer to be effective, column level statistics are needed. When these statistics are stale, there is potential for selecting a non-optimal plan, leading to slower query performance.

Today, we’re extending that support for auto statistics creation by adding the ability to automatically refresh and maintain statistics. As data warehouse tables get loaded and updated, the system can now automatically detect and update out-of-date statistics. With the auto-update statistics capability now available in preview, Azure SQL Data Warehouse delivers full statistics management capabilities while simplifying statistics maintenance processes. You no longer need to manually maintain statistics, which leads to a simplified and more cost-effective data warehouse deployment.

Security

Azure SQL Data Warehouse provides one of the most advanced security and privacy features in the market. This is achieved through using proven SQL Server technology. SQL Server, as the core technology and component of Azure SQL Data Warehouse, has been the least vulnerable databases over the last eight years according to the NIST national vulnerabilities database. To expand existing Azure SQL Data Warehouse's security and privacy features, we’re announcing Dynamic Data Masking (DDM) support is now available in preview.

Protect sensitive data with dynamic data masking (preview)

Dynamic data masking (DDM) enables administrators and data developers to control access to their company’s data, allowing sensitive data to be safe and restricted. It prevents unauthorized access to private data by obscuring the data on-the-fly. Based on user-defined data masking policies, Azure SQL Data Warehouse can dynamically obfuscate data as the queries execute, and before results are shown to users.

Azure SQL Data Warehouse implements the DDM capability directly inside the engine. When creating tables with DDM, policies are stored in the system's metadata and then enforced by the engine as queries get executed. This centralized policy enforcement process simplifies data masking rules management as access control is not implemented and repeated at the application layer. As various users access queries tables, policies are automatically honored and applied while protecting sensitive data. DDM comes with flexible policies and you can choose to define a partial mask, which exposes some of the data in the selected columns, or a full mask that obfuscates the data completely. Azure SQL Data Warehouse also provides built-in masking functions that users can choose from.

Next steps

Get started with a free Azure SQL Data Warehouse account.
Learn more about workload management concepts and workload management scenarios.
Learn more about why analytics in Azure is simply unmatched.

Please note that the preview features mentioned in this blog are being rolled out to all regions. Check the version deployed to your instance and review the latest Azure SQL Data Warehouse release notes to learn more. For preview questions, please contact AskADWPreview@microsoft.com.
Quelle: Azure

Connecting the colossal: How to scale innovation with serverless integration

Starting the process of migrating to the cloud can be daunting. Legacy systems that are colossal in scale often overwhelm the average team tasked with the mission of digital transformation. How can they possibly untangle years of legacy code to start this new digital transformation initiative? Not only are these systems colossal in scale, but also colossal in terms of business importance. Enterprise applications like SAP and IBM, are integral to the daily rhythm of business. A seemingly simple mistake can result in catastrophic consequence.

Over the past year, Azure Integration Services has been reflecting on solutions to help with these challenges and we’re excited to announce new capabilities:

Developer focused – Improved the developer experience inside Logic Apps by allowing you to directly write code as a step inside a Logic App.
Enterprise ready – Added new migration and modernization scenarios with the general availability of our new-and-improved SAP connector.
Serverless first – Better integration between API Management and Azure Functions makes it even easier to create and manage serverless integrations and applications.

The challenges facing customers

Over the past year, we've had the opportunity to meet with and hear from customers in-person to discuss the biggest challenges facing their organizations, in terms of innovation. While the tools and technology customers use might be unique to their industry, the high-level challenges encountered are often universal.

Here’s a couple of the common high-level challenges :

Developing, onboarding, and scaling new apps and services within existing IT infrastructure. Vipps, the number one payment service in Norway, faced these challenges while making the move from a monolithic application structure to a microservices first architecture.
Migrating from on-premises legacy systems to the cloud without disrupting day-to-day operations. Alaska Airlines worked through this by adopting a hybrid approach.
Rolling out digital transformation efforts throughout your organization and ensuring the success of these initiative. Finastra tackled this problem by leaning into an API-first solution and created a partner program that unlocked many different new opportunities for them.

These challenges are rooted in integration. By moving to the cloud and creating new, smaller cloud-native services, customers are realizing that the true benefit lies is in how everything works together. Integration is no longer about transforming and moving data from point A to point B, it’s now about how systems of apps, microservices, databases, and on-premises infrastructure are composed to achieve results.

Azure Integration Services leverages serverless technology to remove the resource overhead of managing infrastructure and instead, focus on connecting and composing systems that are adaptable to changing demands.

Looking ahead, our focus is on two key areas:

Making the journey to the cloud as smooth and seamless as possible by improving the experience with our products, and how our products work together.
Creating an out-of-the-box solution library that provides step-by-step guidance on when and how to use our Azure services to solve business and IT problems around connectivity, integration, and application development.

We know that to achieve in these areas, we must be at the forefront of the changing technology landscape. Integration is the backbone that drives application innovation and development, the journey to the cloud, and long-term success with cloud-native innovation strategies.

To learn more about Azure Integration Services, watch the Azure Friday episode giving an overview of Integration Services.

Interested in talking with an expert? Schedule a call with one of our solutions experts to see how Azure Integration Services can help you.
Quelle: Azure

SAP and Microsoft bring IoT data to the core of the business applications

As a leader in the IoT cloud ecosystem, Microsoft enables a full stack of business applications, within different industries, across the intelligent edge and intelligent cloud. The continued growth of the IoT industry is going to be a transformative force across all organizations. Microsoft and SAP have collaborated for over two decades to enable enterprise SAP solution deployments and the partnership has expanded across the Industrial Internet Consortium, the OPC Foundation, and the Platform Industrie 4.0.

At Mobile World Congress in February, SAP and Microsoft announced our extended collaboration to physical assets in the space of Internet of Things (IoT). Today, we are excited to announce the general availability of SAP Leonardo IoT integration with Azure IoT Hub.

SAP Leonardo IoT integrates with Azure IoT services providing customers with the ability to contextualize and enrich their IoT data with SAP business data to drive new business outcomes. Leveraging Azure IoT Hub and Azure IoT Edge, it provides access to secure connectivity at scale, powerful device management functionality, and industry-leading edge computing support. With the ability to intelligently combine business data to provide industrial IoT capabilities and services consumed by SAP business applications, customers now have a complete view on their data from physical assets to business processes to customer relationships, and offers a full digital feedback loop.

SAP and Microsoft’s common goal is to provide a 360 view of the data from physical assets to business processes enabling customers to remove data silos and realize a full digital feedback loop across the intelligent cloud and intelligent edge. By running SAP Essential Business Functions on Microsoft Azure IoT Edge, customers will be able to extend their S4/Hana and C4/Hana business processes closest to their most valuable assets, providing the capability of business requests and governance at the edge.

Explore more on the Azure IoT and SAP Leonardo IoT Interoperability.
Quelle: Azure

Reshaping the business landscape with serverless APIs

Things are changing for the modern business. API-first development and microservices architecture is opening the door to new innovations. Many of these new approaches are possible in part due to the evolution of serverless technology, which eliminates the need for the management of infrastructure.

Fully managed infrastructure allows for allocating resources to solving a business problem, rather than managing the IT infrastructure. This results in more agility, reduced operating cost, and shorter time-to-market, which is important for organizations of any size.

Serverless is for all, no matter the size

The benefits serverless offers is independent of the size of the company. For example:

Startups need to quickly assess product-market fit and build prototypes to test their hypotheses.

With limited resources, startups can build, measure, and iterate their way to success with execution-based pricing models.
Unlocks a new generation of startups, all built on the idea that a small group of people with a limited budget can be disruptive.
As they evolve, they’ll benefit from serverless much in the same way as larger organizations do.

Enterprises need to adapt to constantly evolving customer requirements to stay competitive with agile, fast moving startups.

Serverless enables a business to grow without worrying about managing infrastructure and the planning associated with it.
Promotes move to architectural patterns that increase the flexibility and agility of software development.
Provides the ability to compete at the same level as more nimble players, while consistently growing the business.

Both benefit equally from a serverless approach, for different reasons.

Improved, stronger integration for API-first applications

Over the past year, API Management has collaborated with Azure Functions to build a stronger integration between the two services. Over the past year, API Management has collaborated with Azure Functions to build a stronger integration between the two services. Our goal is to increase developers’ productivity and provide better, more impactful experiences for creating serverless, API-first applications.

To achieve that goal, we are announcing that two new capabilities are now generally available:

Expose a Function App via API Management by linking it to a new or existing API.
Monitor, debug, and maintain applications built with Functions and API Management via distributed tracing in Application Insights.

Azure API Management simplifies publishing of APIs as well as their consumption by clients. It allows for abstraction of APIs from their implementation. APIs are governed through policies, managed from a unified plane, optimized through caching, and published for frictionless consumption through a developer portal.

API Management is the front door for your application. Azure Functions provide serverless compute and eliminate the initial friction associated with implementing new applications. Functions allow for agile assembly of prototypes and production-grade solutions.

Moving into the future

The proliferation of APIs and the API economy has given rise to new opportunities for businesses of all sizes. API-first development is now a necessary approach to ensure future success. This is why we are excited about the investments we've made this year and how we are making API architectures easier to adopt by leveraging serverless technology.

Jump right in and get started:

Learn how to build an API-first application with Azure Functions and API Management
Explore the Serverless Application learning path
Create an API Management instance 5-minute quickstart

Interested in talking with an expert? Schedule a call with one of our solution experts, for a more personalized approach to starting with serverless, API-first applications.
Quelle: Azure