Machine Learning Archives

Compiler, LLVM, Machine Learning, Software, Sparsity

A primer on Sparsity: What is it and Why should we care about it?

Udit kumar agarwal / August 27, 2023

Broadly speaking, in mathematics, sparsity generally refers to the property of having a relatively small number of non-zero elements or structures within a larger space or set. This concept can be applied to various mathematical objects, such as matrices, graphs, functions, and more. Sparsity is often exploited to optimize efficiency. For instance, in linear algebra, sparse matrices (matrices with a large number of zero elements) are handled differently from dense matrices, leveraging the abundance of zeros to save memory and accelerate computations. In graph theory, sparse graphs (graphs with relatively few edges) can be processed more efficiently with different algorithms.…

Continue Reading→

Compiler, LLVM, Machine Learning, Software, static analysis

ML Compilers Part 2: An Overview of Graph Optimizations

Udit kumar agarwal / August 12, 2023

In the previous post, we looked at different types and design objectives of high-level IRs used by ML compilers. Today, we are going to look at different optimizations that are performed using high-level IRs.These optimizations can be performed either offline or online. In online mode, the optimizations are done before performing the inference, while in offline mode, the runtime saves the optimized graph to disk. In this post, I have attempted to further extend Li et al.’s [1] classification of high-level IR optimizations into five categories to the best of my knowledge. Figure 1: Overview of Optimizations done on High-level…

Continue Reading→

Compiler, LLVM, Machine Learning, Software

ML Compilers Part 1: High-Level Intermediate Representation

Udit kumar agarwal / August 6, 2023

High-level intermediate representations (IRs) play a crucial role in machine learning (ML) compilers, enabling optimization and code generation for various hardware platforms. These IRs provide an abstract representation of the machine learning models that can be transformed and optimized before being compiled into executable code. In this blog post, I will discuss the design objectives, and the type of high-level IRs used by popular, real-world ML compilers. Workflow of ML compilers Content Design Objectives Types of High-level IR Examples of High-level IR Relay (TVM) Torch.Fx Design Objectives for High-Level IR Optimizations Optimizing the input ML model to enhance its runtime…

Continue Reading→

FuzzyWare

A collection of my thoughts, findings and experiments

Category: Machine Learning

A primer on Sparsity: What is it and Why should we care about it?

Like this:

ML Compilers Part 2: An Overview of Graph Optimizations

Like this:

ML Compilers Part 1: High-Level Intermediate Representation

Like this:

FuzzyWare

A collection of my thoughts, findings and experiments

Share this:

Like this:

Share this:

Like this:

Share this:

Like this: