Show simple item record

dc.contributor.advisorSmidt, Tess E.
dc.contributor.authorKotak, Mit
dc.date.accessioned2026-02-12T17:12:37Z
dc.date.available2026-02-12T17:12:37Z
dc.date.issued2025-09
dc.date.submitted2025-09-15T16:02:47.302Z
dc.identifier.urihttps://hdl.handle.net/1721.1/164822
dc.description.abstractE(3)-equivariant neural networks have demonstrated success across a wide range of 3D modeling tasks. Until recently, they were bottlenecked due to their high memory and wall-time requirements. In this thesis we first provide an overview of recent GPU kernel efforts by both academia and industry that address this issue. These approaches tradeoff performance for engineering complexity, while still being algorithmically bottlenecked at 10 % GPU utilization. We instead trade off engineering complexity for performance. This not only lowers the barrier to GPU programming but also builds an abstraction layer to reason about future algorithmic innovations that can improve GPU utilization. Our kernel 𝐵3, based on the tiling- optimizations in just 100 lines of PyTorch-like code. We explore the performance-simplicity tradeoff with two case studies and demonstrate the practicality of our kernel workflow through downstream integration with a production model. We hope this work serves as inspiration to broaden and deepen existing equivariant kernel efforts.
dc.publisherMassachusetts Institute of Technology
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
dc.rightsCopyright retained by author(s)
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.titleSimplifying Equivariant GPU Kernels through Tile-based Programming
dc.typeThesis
dc.description.degreeS.M.
dc.contributor.departmentMassachusetts Institute of Technology. Center for Computational Science and Engineering
mit.thesis.degreeMaster
thesis.degree.nameMaster of Science in Computational Science and Engineering


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record