Digital ASIC Designer · Computer Architecture · PhD Candidate, University of Toronto
Hi! I'm a digital ASIC and computer architecture engineer focused on energy-efficient hardware for AI, signal processing, and edge-intelligence systems. I am a PhD candidate in the Intelligent Sensory Microsystems Lab at the University of Toronto, advised by Prof. Roman Genov.
Over five years of my PhD, I have worked across the full chip-design stack: defining system architecture, translating algorithms into microarchitecture, writing synthesizable RTL, integrating digital subsystems into mixed-signal SoCs, closing timing, validating on FPGA, and supporting silicon bring-up. Across four 65 nm CMOS tape-outs, I have contributed to streaming data paths, DSP/filtering pipelines, adaptive control, feature extraction, ML inference, packetization, and hardware/software validation.
My strengths are in RTL design, microarchitecture, SoC integration, and ML accelerator design. I enjoy working at the boundary between algorithms and hardware — training and evaluating ML models, understanding their compute and memory requirements, and mapping them into efficient digital architectures for real-time inference. Results published at IEEE ISSCC, VLSI Technology and Circuits, IEEE TBioCAS, and IEEE BioCAS. Recipient of the NSERC PGS-D Scholarship for 2023–2025.
Designed low-power digital control logic for adaptive dynamic-range operation of an energy-efficient zooming ADC front-end, featuring three incremental resolution modes for a multi-channel neural sensing SoC. Click to view the same detailed project write-up from the Projects tab.
Designed RTL for a scalable mode-switching controller for an inverter-based AFE, a band-pass digital filter, and an area-efficient ML accelerator for event-driven neural signal detection and processing. Enabled best-in-class 10 nW/channel event-driven operation.
Led RTL system design and top-level digital integration for a split-inference AI SoC, owning feature extraction, classification, ML processing, digital filtering, control logic, and SoC data-path integration. Accepted at IEEE VLSI 2026.
Designed an Edge AI accelerator and data-path blocks, including a configurable processing element, adaptive AFE mode-switching controller, DSP/filtering pipeline, and data packetization logic. Led digital integration with a custom pad-frame and optimized the data path for efficient wireless transceiver operation.
Designed low-power digital control logic for adaptive dynamic-range operation of an energy-efficient zooming ADC front-end, featuring three incremental resolution modes for a multi-channel neural sensing SoC. Led microarchitecture, RTL, simulation, synthesis, block integration, and post-silicon validation.
Designed RTL for a scalable mode-switching controller for an inverter-based AFE, a band-pass digital filter, and an area-efficient ML accelerator for event-driven neural signal detection and processing. Enabled best-in-class 10 nW/channel event-driven operation. Published at IEEE ISSCC 2025.
Led RTL system design and top-level digital integration for a split-inference AI SoC, owning feature extraction, classification, ML processing, digital filtering, control logic, and SoC data-path integration. Designed low-power accelerator/control blocks to reduce on-chip compute and off-chip communication energy while preserving real-time inference behavior. Accepted at IEEE VLSI 2026.
Designed an Edge AI accelerator and data-path blocks, including a configurable processing element, adaptive AFE mode-switching controller, DSP/filtering pipeline, and data packetization logic for edge-inference SoC operation. Led digital integration with a custom pad-frame and optimized the data path for efficient wireless transceiver operation.
Integrated a split-inference Edge AI SoC with a Zynq-based neural recording/stimulation platform, enabling real-time neural-data streaming into the chip's on-chip spike-classification engine and retrieval of inference outputs. Designed Verilog interface/data-path logic and custom PCB for modular chip-to-platform validation; validated the workflow from model pre-training to real-time on-chip classification.
Developed a Zynq-7030 FPGA-based closed-loop validation platform for neural recording and stimulation, connecting custom front-end PCBs, neural interface ASICs, embedded Linux control, and Python-based visualization. Designed RTL streaming pipelines with FIFO buffering, packet formatting, AXI/APB control, and embedded Linux data streaming for real-time acquisition and closed-loop intervention. Published at IEEE BioCAS 2025.
Integrated NVIDIA's NVDLA open-source ML accelerator core with a PULP/OpenHW Group RISC-V core. Set up FPGA emulation for the RISC-V subsystem prior to synthesis and place-and-route, and explored IP integration, memory mapping, and connectivity challenges for a streamlined accelerator-based SoC integration.
FPGA-based verification environments for pre-silicon RTL validation of ML accelerator and DSP blocks across all four tape-outs, including stimulus generation, functional coverage, interface emulation, and Python-based post-processing flows for ASIC bring-up support.
FPGA platforms used extensively for pre-silicon verification, bring-up support, and rapid prototyping across all four tape-out projects.
Notes on research, digital ASIC design, edge-AI hardware, neural interfaces, and the practical thinking behind technical work. The latest Substack posts are pulled into this section automatically and shown newest first.