The Archive

THE
EVOLUTION
OF A VISION

Tracing the trajectory from raw lines of code to engineered pipelines. A non-linear exploration of milestones, failures, and breakthroughs.

Personal Journey

Started Data Engineering Journey

Began learning SQL, Python, and the fundamentals of data pipelines. Completed first online data engineering course.

Stellenbosch University

Started BEng Data Engineering

Began learning engineering & applied mathematics along with probability theory and data science. Started my computer programming journey with the C programming language. Also learned electrotechniques along with engineering physics and chemistry.

C programmingR languagewolfram
Personal Journey

2nd Year Data Engineering Studies

Continuation of engineering & applied mathematics journey. Studied computer science principles and algorithms using Java object-oriented programming (OOP). Learned data engineering concepts e.g Big Data, Datastore, ACID. Computer systems, logic design and signal analysis were also covered in depth.

JavaPythonSQL
Personal Journey

3rd Year Data Engineering Studies

The year of furthering computer science and data analysis skills. A lot of projects: a github webscraper and custom API, a full-stack web application and flutter app that can play music (spotify API), generate lyrics and meaning of songs (ChatGPT+Gemini APIs), and an STM32 microcontroller smart light project.

JavaScriptDartC
Personal Journey

4th Year Data Engineering Studies

The year of intense mathematical statistics. Highlists include an entrepreneurship course and presentation of a business idea. Other areas of focus where operations research and optimisation, machine learning and deep learning. Complementary skills such as project management, environmental engineering and philsophy were also covered.

PythonExcelMS Access
Personal Journey

5th Year Data Engineering Studies

The only major highlight of this year was my final year project: A comparison of open-source diarisation tools for audio data. I built a custom audio diarisation pipeline using Python and evaluated it against existing tools like pyannote.audio. The rest of the work was more theory (some reduntant), with my main focus on obtaining my degree and graduating.

PythonAudio Processing
Unemployed

Junior Data Engineer

Built and maintained ETL pipelines using Apache Spark and Airflow. Worked with data warehouses on AWS Redshift.

Apache SparkAirflowAWS Redshift

7

Milestones

16

Skills

01

Degree

Curiosity

READY TO BUILD
THE FUTURE?