Hey visitor, don't remember why you are here? Good, I’ll take this chance to introduce myself :-)
My name is Chengkai Yang. I have almost 9 years of work experience in big data development and back-end development in the fields of autonomous driving, finance, and banking.
I am currently pursuing a master degree in Applied Data Science at the University of Victoria.
I believe that my past engineering background and the data science I am currently studying can help me solve problems more effectively in the future.
I use Java, Scala and Python, as well as the following technology stack.
Apr 2023 - Dec 2024 Beijing,China
Used HBase, Spark and Flink analyzing the trajectory of autonomous vehicle data (TB-scale daily) to assess the performance of the vehicle.
Participated in building an image annotation platform (Using Spring Framework) to help the algorithm team manage and store datasets, to train the algorithm more efficiently.
For more details about my employment history, please go to my LinkedIn page.
The courses that I am currently studying or planning to study in MADS at the University of Victoria include:
Systems for Massive Datasets: The reason why I chose this course is that it teaches how to use frameworks such as Spark to run algorithms like ML on large datasets to solve problems.
Introduction to Parallel and Cluster Computing: I have used several parallel computing frameworks such as Spark and MapReduce. This course will specifically cover the design principles of parallel computing.
Other courses of MADS mainly include Machine Learning, which is also a topic that I am very interested in. See the link for details.
I am also a contributor to the community of the well-known real-time processing framework Apache Flink.
My contributions mainly include translating the official English documentation of Flink into Chinese, which helps thousands of Chinese developers use Flink better. Here is a main list of my contributions.
Translate “Metric Reporters” page of “Deployment” This article mainly introduces how to use monitoring tools such as Prometheus and JMX to better monitor the running status of Flink.
Translate “Extension Points” and “Full Stack Example” in “User-defined Sources & Sinks” page:This article mainly introduces how users can customize the Table API to read and write data to external storage in addition to the Table connectors provided by Flink officially (such as Kafka connector, JDBC connector).
I really enjoy doing weight training at the gym.
I believe that long-term physical exercise can improve my concentration and perseverance.
Currently, I train at the gym of the University of Victoria, and it is also a great place to meet new friends and train together.