[Misc] Scraps from the GEMMA on FPGA Project
Some discarded byproducts from the GEMMA on FPGA project...
Some discarded byproducts from the GEMMA on FPGA project...
Since "vibe coding" is so trendy these days, I've been trying it out here and there. But after actually using it, I definitely feel that AI-generated code still has a lot of shortcomings. Relying entirely on AI for all the logic is definitely not the way to go.
This text contains the core concepts and mathematical principles of the Transformer model architecture.
This document is a note organizing the architecture and training process of the GPT-1 paper by combining mathematical definitions with intuitive interpretations.
Cherry blossoms are starting to bloom — and so is this new site.
I finally set up a proper personal homepage. What used to be just a GitHub Profile README is now a full static site powered by Docusaurus.
A quick summary of notable updates in deep learning inference, GPU architecture, and HPC this week.
The book that helped me most when I first started learning CUDA programming.
While studying deep learning inference optimization, I explored how memory access patterns in CUDA kernels dramatically affect performance.