If Berkeley EECS taught me anything, it's that perseverance is just repeatedly ignoring your own internal segmentation faults. On a more serious note, studying at Berkeley was one of the best experiences of my life, and I wanted to summarize some of the lessons I've learned here. I hope that it can be useful for others, and that it can serve as a reference for my future self.
Freshman Year: Course Overflow
When I first set foot on Berkeley campus in the fall, I really had no idea what to expect. On one hand, I was excited to be at a place with such a rich computer science history. On the other hand, I did not have much idea of what I wanted to do. I think this lack of direction motivated me to take quite a few courses my first semester, so as to get a sense of what EECS even was. For this reason, from a course load perspective, freshman year was likely the hardest year of my college career. But courses are just one dimension of Berkeley, and many of the future years would be much harder (more on that later).
Many of my early Berkeley memories involve late nights wrestling with new concepts. One night stands out vividly: sitting in the engineering dorm's living room until 4 AM, struggling to understand page tables for an exam the next morning. These sorts of moments were common, as I frequently found myself working late until a tricky concept finally clicked. In hindsight, it was mostly my excitement to be at Berkeley that kept me going.
Unfortunately, I do not have too many interesting stories to tell from freshman year. As bland as it sounds, most of my life was spent absorbing the material in my classes. Of course, there are some very obvious downsides to this, such as not trying out many clubs or other activities. However, there is no such thing as a free lunch, and I was able to learn a lot from my classes. I would not necessarily recommend this approach to others, but it worked for me. Luckily, I had done enough exploring in my first year that I realized I had a liking for computer systems. I gravitated towards courses such as computer architecture and databases much more than the circuit analysis classes I had to take. That's not to say that I did not enjoy the circuit analysis classes, but I did not see myself dedicating the time to really excel in them. As a natural next step, I decided to take advantage of the great CS research faculty at Berkeley and get involved in research my sophomore year.
Sophomore Year: Introduction to Research
I had emailed my databases professor looking for a research opportunity. The funny thing is that I had literally no idea what computer science research even was. I did not even know if I would like it at all, but I thought I might as well check it out since there are some cool labs at Berkeley. In retrospect, picking databases as a research topic was quite random and really only because databases was one of the few upper level courses I had taken as a freshman.
Fortunately, the faculty I emailed actually forwarded me to one of his PhD students, who was in need of an undergraduate student to help with her projects. Since I was just a sophomore who was getting his first taste of research, I was given some well-defined tasks to help out with on a larger project that was already mostly complete. My contributions to the project were not particularly groundbreaking, but I did enjoy the experience quite a bit and wanted to see more. I was also fortunate to have a great mentor who was willing to guide me through the process of doing research.
After the original project was complete, I was given a bit of flexibility to pick the general area for the next project I could help with. The lab I was working in was the major systems lab at Berkeley, so many of the PhD students came from systems backgrounds. However, this was right around the time when ChatGPT was released, and LLMs were gaining popularity. So, many systems folks were starting to do machine learning + systems research, which basically meant optimizing inference/training of LLMs. Because of this, one of the directions that was proposed to me was a machine learning + systems project related to GPU memory optimizations for training. However, at this point, I was only in the middle of my neural networks and machine learning courses, and I had no experience with GPUs. But, I thought that machine learning + systems was a really cool area, and this was a great opportunity to learn about it.
The second half of my sophomore year consisted of taking more courses related to machine learning and systems, while working on this new research project. I had been given a lot more responsibility in the project, compared to my first semester. The crux of the project revolved around formulating a tractable optimization problem for searching over a space of memory allocation strategies. Since I was taking a convex optimization course that semester, I was able to use some of the tools I had learned to come up with possible solutions. However, I was still very much a beginner in machine learning, and I had a lot of catching up to do on the LLM front. Ultimately, while I was able to create some interesting optimization problems, my lack of experience led me to overlook some serious flaws. For example, to really implement the chosen optimized solution, I would need to modify the XLA compiler, which is not a simple engineering task. This led to many weeks of hacking away at the XLA codebase, with not much to show for it, given my lack of serious engineering experience. In the end, the project did not end up being as successful as I had hoped, but I did learn a lot about machine learning and systems. Though, most importantly, I learned a lot of lessons about being resilient in the face of research setbacks. These learnings would come in handy later on.
Junior Year: Focusing on Research
At the start of my junior year, the idea of doing a PhD had started to occupy my mind. I had a lot of doubts about whether or not I should pursue a PhD, for a variety of reasons. One reason was that I had not enjoyed much success in my research endeavors, and I was not sure if/when that would change. Another reason was that industry options for computer science are generally pretty good without a PhD, so I was not sure if it was worth it. But, I knew that if I wanted to at least have the option of doing a PhD, I would need to focus on research my junior year. In order to gather as much research experience as possible, I decided to also join the machine learning lab at Berkeley, in addition to the systems lab I was already working in. So, I was working on two research projects concurrently, which was quite challenging. To compensate, I decided to take a lighter course load my junior year.
My research experiences in my junior year were truly amazing, and I consider it to be the most valuable experience I gained at Berkeley. Since I had already gotten an intro to research in my sophomore year, I was able to hit the ground running in my junior year. In the machine learning lab, I was working on projects related to AI agents and efficient LLM inference. The PhD students I was working with were some of the best in the world at things like LLM quantization and HW-SW co-design, so I was able to learn a lot of new and interesting techniques. In the systems lab, I was working on projects related to AI + databases, which was particularly interesting to me since I had an initial interest in databases, before I got into machine learning. So, by working on AI + databases, I was able to combine my two main interests. In my sophomore year, when I was struggling a lot on machine learning, I would wonder if I was better off staying in pure database world. But, I am glad I stuck with it, because I think I would have missed out on some interesting work at the intersection of AI and databases otherwise. One lesson I gathered is the importance of having a multi-disciplinary approach to research. Technologies are always changing and it is important to always be learning new things, rather than getting stuck in a single area. Of course depth is very important, but so is breadth.
To save time, I will briefly elaborate on one project in the systems lab that I was involved in, which best captures my current interests. In the systems lab, I was working on a project called LOTUS, which proposed semantic operators as a way of interacting with database systems using LLMs. Just how standard relational operators like JOIN, SELECT, etc. are used in SQL, semantic operators are used in LOTUS to interact with databases using natural language to do things like filtering, projection, etc. I got quite a lot of engineering experience building the LOTUS system, and the project got some traction in the database and open source communities, which was nice to see. From a research perspective, it opened my eyes to the importance of building declarative systems for LLM apps and having optimizations that are transparent to the user. There were a lot of other works at this time with very similar philosophies (DSPy being a notable one), so I am excited to see how the community moves in these directions.
During my junior year, I was also able to take graduate courses such as advanced computer systems or hardware for machine learning. I would recommend Berkeley students to take a couple of these courses, as they may shed light on interesting problems to be thinking about. Unlike typical undergraduate courses, graduate courses have a lot more paper reading and discussion components. As an undergraduate in those classes, I was not able to contribute much to those discussions, but I still found them to be very useful since I could see how top notch PhD students and professors were thinking about problems. Speaking of that, the PhD students and advisors I had at Berkeley were the people who shaped my undergraduate experience. I think that my way of thinking about research and computer science is just distilled from the people I have been fortunate to know. I would strongly urge anyone to find mentors who are doing interesting work, and to learn as much as possible from them.
Conclusion
I won't write about my senior year, since I am just finishing it up and have not yet been able to think about it looking backwards. The way I would have written the previous sections would have been very different if I wrote them 1 or 2 years ago. This is really a testament to how much I have experienced and grown each year at Berkeley. Each year at Berkeley was a rollercoaster of ups and downs, but I am grateful for all of them. The biggest lessons came exactly when I felt stretched to my limits—debugging at 4 AM, chasing research ideas, and juggling competing priorities.