My name is Sophia. I am a self-motivated data professional who believes in the power of exploratory data analysis, reproducible workflow, and striving towards automated operational decisions.
My data story starts with my first R
class at Carnegie Mellon University in 2014. I worked in a multinational company before I pursued my master degree in CMU. One of my previous job duties is to update the slides every month with the new marketing and sales numbers and present them in the management meeting. Updating the slides was painful and tedious, and it often occupied my weekends. When I was first introduced to R in my class, I was fascinated by the analysis and reproducibility power within R. From then on, I have been exploring the wonderful data world that enables quality control, replicable workflow, and data-driven decision making.
I am a big fan of Python
and R
. I am also highly proficient in advanced SQL
and Tableau
, fluent in Pandas
. I enjoy using data visualizations to explain complex logic and interpret models. I am a continuous learner. I use this blog to keep notes, review and share statistical modeling, machine learning, and programming knowledge.
Regardless of sector, I am always excited about the opportunities to make data for good.