News

News Highlights

Cadence Reports Q2 2025 Financial Results

New Podcast Episode Drop: Optimize the Interconnect and the Future of HDI

Zuken to Showcase Defence & Security-Focused Electronic Systems Design Solutions at DSEI 2025

More News
Books

Featured Books

Download

Download

Download
design007 Magazine

Latest Issues

Current Issue
July 2025
Showing Some Constraint

A strong design constraint strategy carefully balances a wide range of electrical and manufacturing trade-offs. This month, we explore the key requirements, common challenges, and best practices behind building an effective constraint strategy.

Flip Book PDF Download

June 2025
All About That Route

Most designers favor manual routing, but today's interactive autorouters may be changing designers' minds by allowing users more direct control. In this issue, our expert contributors discuss a variety of manual and autorouting strategies.

Flip Book PDF Download

May 2025
Creating the Ideal Data Package

Why is it so difficult to create the ideal data package? Many of these simple errors can be alleviated by paying attention to detail—and knowing what issues to look out for. So, this month, our experts weigh in on the best practices for creating the ideal design data package for your design.

Flip Book PDF Download
Articles

Article Highlights

Setting Design Constraints Effectively

From Attraction to Action: Where Marketing Ends and Sales Begins

I-Connect007 Editor’s Choice: Five Must-Reads for the Week

More Articles
Columns

Latest Columns

Connect the Dots: Sequential Lamination in HDI PCB Manufacturing

Target Condition: The 5 Ws of PCB Design Constraints

Fresh PCB Concepts: More Than Compliance—A Human-centered Sustainability Approach

See all of our columnists
Links
Media kit

Media Kit - Choose Your Primary Marketing Focus:

Faster Big-Data Analysis

October 31, 2017 | MIT

Estimated reading time: 5 minutes

We live in the age of big data, but most of that data is “sparse.” Imagine, for instance, a massive table that mapped all of Amazon’s customers against all of its products, with a “1” for each product a given customer bought and a “0” otherwise. The table would be mostly zeroes.

With sparse data, analytic algorithms end up doing a lot of addition and multiplication by zero, which is wasted computation. Programmers get around this by writing custom code to avoid zero entries, but that code is complex, and it generally applies only to a narrow range of problems.

At the Association for Computing Machinery’s Conference on Systems, Programming, Languages and Applications: Software for Humanity (SPLASH), researchers from MIT, the French Alternative Energies and Atomic Energy Commission, and Adobe Research recently presented a new system that automatically produces code optimized for sparse data.

That code offers a 100-fold speedup over existing, non-optimized software packages. And its performance is comparable to that of meticulously hand-optimized code for specific sparse-data operations, while requiring far less work on the programmer’s part.

The system is called Taco, for tensor algebra compiler. In computer-science parlance, a data structure like the Amazon table is called a “matrix,” and a tensor is just a higher-dimensional analogue of a matrix. If that Amazon table also mapped customers and products against the customers’ product ratings on the Amazon site and the words used in their product reviews, the result would be a four-dimensional tensor.

“Sparse representations have been there for more than 60 years,” says Saman Amarasinghe, an MIT professor of electrical engineering and computer science (EECS) and senior author on the new paper. “But nobody knew how to generate code for them automatically. People figured out a few very specific operations — sparse matrix-vector multiply, sparse matrix-vector multiply plus a vector, sparse matrix-matrix multiply, sparse matrix-matrix-matrix multiply. The biggest contribution we make is the ability to generate code for any tensor-algebra expression when the matrices are sparse.”

Joining Amarasinghe on the paper are first author Fredrik Kjolstad, an MIT graduate student in EECS; Stephen Chou, also a graduate student in EECS; David Lugato of the French Alternative Energies and Atomic Energy Commission; and Shoaib Kamil of Adobe Research.

Custom kernels

In recent years, the mathematical manipulation of tensors — tensor algebra — has become crucial to not only big-data analysis but machine learning, too. And it’s been a staple of scientific research since Einstein’s time.

Traditionally, to handle tensor algebra, mathematics software has decomposed tensor operations into their constituent parts. So, for instance, if a computation required two tensors to be multiplied and then added to a third, the software would run its standard tensor multiplication routine on the first two tensors, store the result, and then run its standard tensor addition routine.

In the age of big data, however, this approach is too time-consuming. For efficient operation on massive data sets, Kjolstad explains, every sequence of tensor operations requires its own “kernel,” or computational template.

“If you do it in one kernel, you can do it all at once, and you can make it go faster, instead of having to put the output in memory and then read it back in so that you can add it to something else,” Kjolstad says. “You can just do it in the same loop.”

Computer science researchers have developed kernels for some of the tensor operations most common in machine learning and big-data analytics, such as those enumerated by Amarasinghe. But the number of possible kernels is infinite: The kernel for adding together three tensors, for instance, is different from the kernel for adding together four, and the kernel for adding three three-dimensional tensors is different from the kernel for adding three four-dimensional tensors.

Many tensor operations involve multiplying an entry from one tensor with one from another. If either entry is zero, so is their product, and programs for manipulating large, sparse matrices can waste a huge amount of time adding and multiplying zeroes.

Hand-optimized code for sparse tensors identifies zero entries and streamlines operations involving them — either carrying forward the nonzero entries in additions or omitting multiplications entirely. This makes tensor manipulations much faster, but it requires the programmer to do a lot more work.

The code for multiplying two matrices — a simple type of tensor, with only two dimensions, like a table — might, for instance, take 12 lines if the matrix is full (meaning that none of the entries can be omitted). But if the matrix is sparse, the same operation can require 100 lines of code or more, to track omissions and elisions.

Enter Taco

Taco adds all that extra code automatically. The programmer simply specifies the size of a tensor, whether it’s full or sparse, and the location of the file from which it should import its values. For any given operation on two tensors, Taco builds a hierarchical map that indicates, first, which paired entries from both tensors are nonzero and, then, which entries from each tensor are paired with zeroes. All pairs of zeroes it simply discards.

Taco also uses an efficient indexing scheme to store only the nonzero values of sparse tensors. With zero entries included, a publicly released tensor from Amazon, which maps customer ID numbers against purchases and descriptive terms culled from reviews, takes up 107 exabytes of data, or roughly 10 times the estimated storage capacity of all of Google’s servers. But using the Taco compression scheme, it takes up only 13 gigabytes — small enough to fit on a smartphone.

“Many research groups over the last two decades have attempted to solve the compiler-optimization and code-generation problem for sparse-matrix computations but made little progress,” says Saday Sadayappan, a professor of computer science and engineering at Ohio State University, who was not involved in the research. “The recent developments from Fred and Saman represent a fundamental breakthrough on this long-standing open problem.”

“Their compiler now enables application developers to specify very complex sparse matrix or tensor computations in a very easy and convenient high-level notation, from which the compiler automatically generates very efficient code,” he continues. “For several sparse computations, the generated code from the compiler has been shown to be comparable or better than painstakingly developed manual implementations. This has the potential to be a real game-changer. It is one of the most exciting advances in recent times in the area of compiler optimization.”

Share on:

Testimonial

"The I-Connect007 team is outstanding—kind, responsive, and a true marketing partner. Their design team created fresh, eye-catching ads, and their editorial support polished our content to let our brand shine. Thank you all! "

Sweeney Ng - CEE PCB

Suggested Items

Brent Laufenberg Appointed CIO of the Global Electronics Association, Advancing Technology and Member Services

07/31/2025 | Global Electronics Association
The Global Electronics Association (formerly IPC International Inc.) announces the appointment of Brent Laufenberg as its new Chief Information Officer (CIO).

SES AI Accelerates Timeline for Revenue Growth and Profitability with Acquisition of UZ Energy

07/31/2025 | BUSINESS WIRE
SES AI Corporation, a global leader in the development and manufacturing of AI-enhanced high-performance Li-Metal and Li-ion batteries, today announced it has executed a definitive agreement to acquire 100% of UZ Energy, an energy storage systems (“ESS”) provider, for a purchase price of approximately $25.5 million, subject to earnout adjustment based on the achievement of specified financial targets.

Teramount Raises $50M to Address Growing Demand for AI Infrastructure Optical Connectivity

07/31/2025 | PRNewswire
Teramount, the leader in scalable fiber-to-chip interconnect solutions for AI, data centers and advanced computing, today announced it has raised $50 million in financing led by new investor Koch Disruptive Technologies (KDT). Existing investors Grove Ventures and several new strategic investors, including AMD Ventures, Hitachi Ventures, Samsung Catalyst Fund and Wistron, joined the round.

Hon Hai Technology Group (Foxconn) and TECO Announce Strategic Alliance Targeting AI Data Center Capabilities

07/31/2025 | Hon Hai Technology Group
Hon Hai Technology Group (“Foxconn”) and TECO Electric & Machinery Co Ltd (“TECO”) on Wednesday announced a share exchange, strategic alliance that will strengthen their AI infrastructure capabilities and propel the two Taiwanese tech majors into key markets in the global super-computing race.

Leveraging Chemical Data More Efficiently

07/29/2025 | Lynn L. Bergeson, Bergeson & Campbell
Some truths transcend politics, one being that chemical data holds enduring value and is becoming increasingly essential. In the United States, regardless of which party federally controls the levers of power, it’s clear that chemical manufacturers and their customers must develop and curate robust data portfolios for their chemical inventories. The commercial imperatives driving this are undeniable and gaining traction.

News Highlights

More News

Featured Books

Latest Issues

Showing Some Constraint

All About That Route

Creating the Ideal Data Package

Article Highlights

More Articles

Latest Columns

See all of our columnists

Media Kit - Choose Your Primary Marketing Focus: