About Me


Some personal stuff


I work as a data scientist for a large chemical company. Typically, my projects involve figuring out how to apply machine learning to manufacturing use cases to drive value.

I approach data science from a pragmatic perspective, always wanting to make sure that my projects are aimed at some real-life use case (even if whimsical). There's a lot of hype in the field currently, and I'm always interested in making sure that hype turns into something tangible and real.


When I'm not coding or working on side projects, I can commonly be found spending time on other interests such as:

  • Playing board games (I have over 30 games, it's a problem)
  • Cooking
  • Making cocktails, with a special interest in Tiki drinks
  • Looking after my plants
  • Reading fiction and nonfiction
  • PC Gaming (Age of Empires 2 + League of Legends mostly)
  • Making and drinking coffee
  • Watching live sports, with a focus on rugby + American football
  • Enjoying the outdoors on bike or on foot

Feel welcome to get in touch if you'd like to discuss some of these topics further.

In case you like graphics and lists...


In case you're just looking for a pdf resume, here it is. Another way to get the bullet points would be to look at my LinkedIn Page.

Work Experience (click for more details):

This began as a part of BASF's rotational new hire program, but turned into my full time role after 8 months. My role primarily is to identify and scope use cases to bring data science and analytics for manufacturing use cases, usually for process optimization. Deliverables can range from written reports, field experiments, self-suffiencent web applications, APIs around machine learning models, or anything else that the use case requires. A typical tech stack includes Python/R, Docker, Kubernetes, and MS Azure.

In addition to the more nitty gritty coding parts of the job, I give internal trainings around data science concepts for leadership level as well as hands-on courses for engineers. To further grow BASF's data science talent pool, I also manage and mentor data science and engineering graduates in BASF's rotational programs.

This was an 8 month rotation as a part of BASF's new hire rotational program. In this role, I created a MS Access tool for salespeople to better store and manage data around customer expected purchasing volume. I also performed analysis and reported on various aspects of the industrial petrochemical market in North America. For many of these analyses, I was able to bring mathematical rigor into the role by using techniques like linear programming and hypothesis testing. Many of these analyses were automated using Excel Macros.

This was an 8 month rotation as a part of BASF's new hire rotational program. I used Lean Six Sigma methodology to analyze root causes for manufacturing defects and inefficiencies. This applied to aspects of formulatioms, process cycle times, machine malfunctions, and many other parts of the process.

In this 3 month internship, I assumed responsibility for the day-to-day operations of demineralized water, cogeneration, and steam operations for a multi-plant site. I analyzed data trends for events that could require escalation, and at times had to collect samples in order to recommend chemical additive adjustments. I also researched, ran financial scenarios, and delivered a written report for a new chemical process technology that was applicable to the process.

In this 3 month internship, I researched and reported various aspects of rail shipping to suggest new KPIs for the business to consider. Additionally, I trained myself on the new Chemlogix Sterling TMS software system and wrote an instruction manual for other users in the company. I also evaluated suppliers as a part of an RFI and bid process.

Education:

I completed UC Berkeley's Master of Information and Data Science program from the School of Information in the summer of 2020. I completed all coursework and projects while working full time, which allowed for a great experience of learning concepts one day and using them in my job the next. My final capstone project revolved around image augmentation to lessen the effects of distribution gaps that exist in datasets, which can be seen in my projects.

I graduated from UT Austin with a BS in Chemical engineering and also completed a Business Foundations Certificate, UT's version of a business minor. I was heavily involved with chemical engineering organizations like AIChE. My main organizational affiliation, however, was with UT's rugby team where I started all 4 years and held officer positions like president, vice president, and recruiting officer.

Skills:

Excuse the mess while this section is still being worked on, I have plans for a nice D3 visualization. In the meantime, I'm proficient in R and Python, and knowledgeable in SQL. I have experience with Spark, bash scripting, Docker, and Kubernetes. My main front end tools are R Shiny and Dash, although I am using this website to get more experience with HTML + Javascript and especially D3. I am knowledgeable about statistics and machine learning methods. For now, I'll leave the rest for the viz when it comes!

Projects:

This project was done with the partnership of other UC Berkeley MIDS students as a part of the program capstone. We explored using different types of data augmentation to create different datasets in order to observe if they effectively changed models' performance across datasets. This work followed up on previous work coming out of UC Berkeley, which found the existence of a distribution gap across classifiers for CIFAR-10 and Imagenet (Recht et al, 2019). A full writeup can be seen here, and the github repository is here.