Project
In the summer of 2024, I worked for University Health Network (UHN), engineering pipelines for data processing and analysis. I created data pipelines and visualization tools in Python, and designed and built fully autonomous continuous integration (CI) pipelines with Github Actions. The tool is also containerized with Docker. In addition to my technical work, I also organized a trivia night for UHN summer interns, with over 60 interns showing up for a fun night of questions, pizza, and conversation! I was also recognized with the Arthur Axelrad award for most outstanding summer student at the Princess Margaret Cancer Centre at UHN.
Engineering involved mining multiple bioinformatics databases for hundreds of features relating to thousands of biology and healthcare topics. A large portion of problem-solving in these pipelines was payload optimizations for calling these database APIs that reduced average API latency by 48%. I also created and executed over 50 unit tests using the pytest framework, and leveraged these tests to create a continuous integration pipeline with GitHub Actions for the tool.
Technologies
Python
Github Actions
Docker
pytest
Pandas
Back