Connecting the Bytes
January 30, 2017 | ORNLEstimated reading time: 2 minutes
Supercomputers like Oak Ridge National Laboratory’s Titan are advancing science at a frenetic pace and helping researchers make sense of data that could have easily been missed, says Ramakrishnan “Ramki” Kannan.
Kannan, a computer scientist who came to ORNL in March 2016 after earning a doctorate from Georgia Institute of Technology, points to a bygone era when scientists often only had to focus on one instrument at a time. They could log data and not miss a thing. Now, however, experiments can involve several instruments, like the sophisticated ones at the Department of Energy’s Spallation Neutron Source and Center for Nanophase Materials Sciences at ORNL. The sheer amount of data generated can be overwhelming.
Enter Kannan’s distributive machine learning tool, which collects and sorts enormous amounts of data in a fraction of the time of other methods. It helps researchers extract the most from Titan and its 18,688 nodes (20 petaflops) of computing power.
“This technology condenses the information into what’s significant, enabling us to better understand very large high-dimensional data,” said Kannan, who noted that computers can provide multiple perspectives that humans cannot. “Scientists can extract the data they want to see, but it sometimes helps them to have even more data than they originally intended.”
This innovation was made possible through a project funded by ORNL’s laboratory directed research and development program. Kannan designed what he describes as off-the-shelf data analysis algorithms with some important modifications to handle large amounts of scientific and Internet data with amazing speed and efficiency. The native of India noted that his approach provides scientists with the fastest and best program available, and the information could help guide policy-makers.
To accomplish his goal, Kannan minimized the amount of traffic among computers, pooled multiple messages and communications into bigger message sizes – similar to volume discounts and economies of scale – and sequenced operations to avoid unnecessary communication.
In the laboratory, his technique can capture even molecular movements in stunning detail, eliminate background noise and identify precisely when a significant event occurred. In other application areas, the technique is useful for analyzing video of highways and intersections, for example, which could aid in the design of better roads and help reduce congestion, or help researchers better understand trending social topics in near real time at different geographical levels, from rural to urban.
Some of Kannan’s latest work is detailed in a paper available via arxiv.org. This math-intensive research performed with Grey Ballard of Wake Forest University and Haesun Park of Georgia Tech explores efficient parallel algorithms to solve the problem of large data sets.
Before joining ORNL’s Computational Data Analytics Group, Kannan worked in research and product development for IBM, where he earned several honors. These include Master Inventor, the Authorship Award for sustained contributions to IBM’s intellectual property through publications, conference papers, patents and articles, and the IBM Fifth Plateau Award. His passion for collaborative research, however, led him to ORNL because he has the chance to work with scientists in pursuit of discoveries that will benefit humanity.
“Every day provides me with a new set of challenges and the excitement of making contributions across a wide range of disciplines from materials to the environment and human health,” Kannan said. “It’s inspiring to think about where we’re headed with deep learning and knowledge discovery.”
Kannan is especially looking forward to working with Summit, scheduled to be commissioned at ORNL in 2018. Summit will feature more than five times the computational performance of Titan.
Suggested Items
Nolan’s Notes: Coming to Terms With AI
05/07/2024 | Nolan Johnson -- Column: Nolan's NotesHow fast do things move in the world of data analytics? Here’s an example. We’ve been planning this issue on artificial intelligence for the past few months, and, in fact, I had already written this column about a month ago. Then I went to IPC APEX EXPO and upended it all. I originally had compared AI to drag racing in that (CPU) horsepower and new (data) vehicles have steadily delivered higher performance competition. That seemed pretty accurate given how generative AI models dominated the popular media with amazing results—and sometimes spectacular crashes.
RTX's Advanced Ground System for Space-based Missile Warning Now Operational
05/06/2024 | RTXAn advanced ground system for space-based missile warning developed by Raytheon, an RTX business, is now operational at the U.S. Space Force's Overhead Persistent Infrared Battlespace Awareness Center (OBAC).
Cigent, Swissbit Announce Partnership to Enhance Endpoint Data Security
05/06/2024 | CigentCigent, a leading provider of endpoint data protection solutions, and Swissbit, a leading manufacturer of storage, security, and embedded IoT solutions, today announced a strategic partnership to offer a comprehensive portfolio of secure storage drives designed to safeguard endpoint data against a growing landscape of cyberthreats.
Industrial PC Market Size to Record $1.75 Billion Growth from 2023-2027
05/03/2024 | PRNewswireThe global industrial pc market size is estimated to grow by USD 1.75 billion from 2023 to 2027, according to Technavio. This growth is expected to occur at a Compound Annual Growth Rate (CAGR) of almost 6.29% during the forecast period.
Gartner Survey: 61% of Organizations Evolving D&A Model Due to AI
05/01/2024 | Gartner, Inc.Sixty-one percent of organizations are forced to evolve or rethink their data and analytics (D&A) operating model because of the impact of disruptive artificial intelligence (AI) technologies, according to a new Gartner, Inc. survey.