International Scientific Journal of Engineering and Management

An International Scholarly || Multidisciplinary || Open Access || Indexing in all major Database & Metadata
The journal follows the UGC Guidelines and is evaluated for inclusion in the Web of Science
ISSN: 2583-6129

Impact Factor: 8.072

TSR-GEMM: Tile-Selective Precision Recovery for Robust Mixed-Precision Matrix Multiplication on GPU Tensor Cores

Version
File Size 1.08 MB
Downloads 0
Files 1
Published 20 April 2026
Updated 20 April 2026

TSR-GEMM: Tile-Selective Precision Recovery for Robust Mixed-Precision Matrix Multiplication on GPU Tensor Cores

 

Authors:

Dr. Pavithra L1, Vedant Singh Chauhan2, Ananya Singh3, Vinayak Shrivastava4, Abhinav Rai5

1Department of Computational Intelligence, SRMIST

Chennai, India

{vc2685, as1178, vs, ar}@srmist.edu.in

Abstract:

Modern deep learning frameworks increasingly rely on mixed-precision general matrix multiplication (GEMM) to exploit the throughput advantages of half-precision (FP16) Tensor Cores on NVIDIA GPUs. While FP16 GEMM delivers substan-tial speedups over FP32 computation, it introduces numerical errors that are spatially non-uniform across the output matrix—concentrated in tiles whose input sub-blocks exhibit high condi-tion numbers or significant cancellation. Existing recovery mech-anisms, such as iterative refinement, operate at matrix-global granularity and therefore cannot exploit this spatial locality. We introduce TSR-GEMM (Tile-Selective Residual GEMM), a three-phase mixed-precision GEMM pipeline that (1) performs the bulk computation in FP16 using Tensor Cores while simultaneously accumulating per-tile norm statistics, (2) evaluates a lightweight instability score for each output tile based on input panel and output tile norms, and (3) selectively re-computes only flagged tiles in FP32 via cuBLAS. TSR-GEMM exposes a single tunable threshold τ that governs the precision–performance trade-off. On an NVIDIA RTX 3050 Ti GPU across matrix dimensions from 512×512 to 4096×4096, TSR-GEMM achieves FP32-comparable accuracy (5.4 × 108 relative error) at full recovery, while at 70% tile recovery it reduces error by 8× over pure FP16 with only a 12% throughput reduction relative to the no-recovery baseline. The τ sweep reveals a smooth, well-behaved Pareto frontier, confirming the instability score as a reliable predictor of per-tile numerical risk.

Index Termsmixed-precision arithmetic, GEMM, Tensor Cores, CUDA, numerical accuracy, tile-selective recovery, GPU computing

 

Download
or download free
[changelog]

Categories & Tags

Similar Downloads

No related download found!
ISJEM Journal

Author's Blog

What is the difference between a Research Paper and a Review Paper?

A research paper and a review paper are both scholarly documents, but they serve different purposes and have different characteristics....
Read More
Author's Blog

What is DOI?

A Digital Object Identifier (DOI) is a unique alphanumeric string that is used to identify and provide a persistent link...
Read More
Author's Blog

What do you need to do during production of your Research Paper?

During the production of a research paper, the following steps need to be taken: conducting research, organizing and analyzing data,...
Read More
Author's Blog

What are the advantages of publishing a research paper?

Publishing a research paper can have many advantages for researchers, including: Career advancement, professional recognition, opportunities for collaboration, increased visibility,...
Read More
Author's Blog

Ways to Support your Academic Wellbeing which preparing the Research Paper/Article

To support your academic wellbeing while publishing a research paper, it's important to set realistic goals, manage your time effectively,...
Read More
Author's Blog

How to improve your Research Paper writing Skills?

Read extensively: One of the best ways to improve your research paper skills is to read extensively in your field...
Read More
Author's Blog

Is DOI compulsory to publish a research paper in a Journal?

DOI is not strictly required to publish a research paper, but it is highly recommended. Basically, the International Scientific Journal...
Read More
Author's Blog

In what ways does research paper give weight to career development?

Publishing a research paper can give weight to a researcher's career development in several ways, such as: establishing oneself as...
Read More
Author's Blog

How to develop a Research Paper from Scratch

Developing a research paper involves several steps including: choosing a topic, conducting background research, formulating a research question or hypothesis,...
Read More
Author's Blog

How Plagiarism report plays crucial role in Research Paper Publication?

Plagiarism is a major concern in the academic and research community, as it undermines the integrity of the research and...
Read More