Zhen Ze Ong

Logo

View My GitHub Profile

Hello there, I’m Justin!

Data Engineering / Data Modeling / Data Analysis

I am an aspiring data scientist with entry-level professional experience in application development. I mainly work with Python, Java, and SQL in my personal and academic projects. I graduated from Southern Arkansas University with a major in Computer Science and a minor in Mathematics. I enjoy collaborating with people and creating ideas!

Find me on:

website    website    website

Summary of Skills

Programming Languages: Python • Java • SQL • C++
Frameworks: Pandas • NumPy • scikit-learn • Keras • Matplotlib
Front-end: HTML • CSS • JavaScript
Tools: MySQL • Postman • Tableau • PowerBI • Excel • PowerPoint • Word • Git • Spring Boot
IDEs: Jupyter Notebook • Visual Studio Code • PyCharm • IntelliJ IDEA
Soft Skills: Public Speaking • Technical Writing • Collaborative Communication • Project Management

Work Experience

App Dev Intern @ J.B. Hunt Transport

Internship ReportInternship SlidesPresentation Video

In this paid, in-person internship, I participated in multiple 2-week scrum sprints alongside a full-time backend team. We focused on updating and maintaining the 360 Shipper application, where I was assigned stories on EUM calls cleanup, removing and replacing legacy code with modern APIs.

The application was based on the Java Spring Boot framework, and we used Git requests to collaborate through Azure DevOps Services. We also utilized Postman API and Dynatrace to test our services.

What I did:

 

CS Research Assistant @ Southern Arkansas University

I assisted in several machine learning research projects, specifically ViT (vision transformers) object detection. Python and its libraries (Tensorflow, Keras, PyTorch, CUDA) were utilized for our projects.

What I did:

 

Peer Tutor @ Southern Arkansas University

In the Academic Enrichment Center, I worked as an in-person supplemental tutor to peers, providing tailored support to individuals who required assistance in mastering their course material. This role trained my communication and leadership skills.

What I did:

Education

Southern Arkansas University

Bachelor of Science in Computer Science: Cyber Security and Privacy Option, Minor in Mathematics, 4.00GPA

Honors College, International Students Association

Projects

1. Hobbit Face SVM Classifier

GitHub Repo

Skills: Python • NumPy • Pandas • Matplotlib • OpenCV • PyWavelets • scikit-learn • Keras • HTML • CSS • JavaScript

Tools: Jupyter Notebook • VS Code • PyCharm • Flask

Overview:

I searched for and downloaded 50 images for each of the chosen five hobbit actors (Elijah Wood, Sean Astin, Billy Boyd, Dominic Monaghan, and Martin Freeman) on Google. For the preprocessing, I used OpenCV’s Haar cascade classifiers to detect faces and eyes in those images, filtering out the unideal training images. I then stored the cropped facial regions into a separate folder before using PyWavelets to extract the facial regions from them. The combined images of both the original cropped image and Wavelet transformed image were split into train and test sets, which were finally used to train a SVM (support vector machine) model. I used GridSearchCV to determine the best model and parameters.

After exporting the model as a Pickle file, I loaded it in a Flask server that was connected to a HTML/CSS/JavaScript webpage. The webpage allows the user to drop in an image to classify which of the five hobbits the image resembles. It also displays the confidence of the model and can detect multiple faces in a single image.

I was not satisfied with the model’s accuracy, so I went back to the model-building process. Since SVM is considered outdated in modern image classification, I replaced it with a CNN (Convolutional Neural Network) approach. It skips the wavelet transforming process, since CNN can work directly with raw image pixels. It achieved an accuracy of 84.1% after 20-30 training epochs, a hefty 20% improvement from the SVM model.

 

2. Real Estate Price Prediction

GitHub Repo

Skills: Python • NumPy • Pandas • Matplotlib • scikit-learn • HTML • CSS • JavaScript

Tools: Jupyter Notebook • VS Code • PyCharm • Flask • Postman

Overview:

In this regression project, I used a U.S. real estate dataset (2.2M+ entries) on Kaggle that was extracted from Realtor.com to create a prediction model that estimates the price of a property based on house area (square feet), number of bedrooms and bathrooms, and state.

I started by preprocessing the dataset and used it to build a model with scikit-learn using linear regression. The model was then exported as a Pickle file. Next, I created a Python Flask server to run the model and receive GET and POST requests, which I tested using Postman. Lastly, I made a webpage using HTML, CSS, and JavaScript with a user-friendly UI, where the user can enter their desired inputs to get a predicted price.

The model building section covers a majority of data science concepts like data cleaning, outlier removal, feature engineering, dimensionality reduction, one hot encoding, and K-Fold cross-validation.

 

3. Sales Insights using PowerBI

GitHub Repo

Skills: SQL • DAX • Data Visualization

Tools: MySQL Workbench • Microsoft Excel • PowerBI

Overview:

As a data analysis project, I took a sales dataset and used it to create a dashboard in PowerBI. I first used MySQL Workbench and Excel to clean and ETL a sales dataset. I then used PowerBI to analyze and visualize the revenue and profit across different regions, customers, and markets. I learned to focus on the critical areas (profit rather than revenue, etc.) that a sales manager would be interested in to answer and tackle sales problems.

I used basic DAX to return specific data aggregates that could be used in visualizations. I also learned how to integrate the powerful interactive tools in PowerBI to allow my stakeholders to conveniently isolate data within specific conditions. I also took additional feedback to drastically improve my initial dashboard.

 

4. KAMI (Kitchen Assistant and Meal Innovator) - AI Recipe Generator

Project ReportProject Poster

Skills: Project Management • Python • SQL • Data Modeling • AI Prompt Modeling • HTML • CSS • JavaScript

Tools: Django • XAMPP MySQL • GPT-4 • DALLE-3

Overview:

This senior capstone project was carried out across two semesters (roughly 9 months) with two of my buddies. We met twice a week physically to discuss our individual and collaborative progress with our senior project advisor. Other forms of communication and collaboration was done through Discord, Google Docs, and Google Slides.

The main focus of the project was to make an AI-based recipe generator that functions by feeding it with available ingredients and establishing limits or constraints to the dish. We integrated this into a website that ‘invents’ new recipes by considering the user’s dietary needs and choice of cuisine.

The recipe’s ingredients, instructions, and AI-generated image are displayed to the user. The project utilized OpenAI’s GPT-4 and DALLE-3 APIs into a Django Framework that is connected to a local database.

I fully designed the webpage using HTML, CSS, and JavaScript. I also engineered the entire database that handled the many-to-many relationships between users and their available ingredients, preferences, and generated recipes. The database was hosted on a local XAMPP MySQL database. Unfortunately, we took the demo website offline due to its costs.

Skills

Data Visualization

The following is a simple interactive data visualization of the wages of computer science jobs between 2020-2023. I created this while learning and experimenting with Tableau.

Tableau Dashboard

Here’s my simple take on the classic Tableau Sample Superstore dataset:

Tableau Dashboard

As a research assistant, I had to explore different ways of visualizing data. I utilized Python libraries and Jupyter Notebook during our machine learning projects.

Jupyter Notebook Visualization

 

Technical Writing

Sample ProposalSample MemorandumSample Recommendation Letter

I believe that technical writing skills are vital in the Computer Science field, to report documents professionally. I have written some technical documents outside of my CS projects to develop these skills.

During my Technical Writing class, I wrote a sample proposal where I sent out digital surveys on a potential product and fully analyzed and researched that topic with both primary and secondary sources. Then I designed the proposal in an aesthetic yet informative document.

Proposal Sample Page

I also had the opportunity to write a sample memorandum to my professor, analyzing the technical effectiveness of a particular product I was familar with.

Memo Sample Page

Certifications

Arkansas Summer Research Institute 2023

Certificate of Completion

A virtual two-week professional development event hosted by Arkansas NSF EPSCoR, where partipants learn technical and professional skills in the data science field.

Other things about me

In my free time, I like do digital art and animations. Creative projects have always been my interest.

I had the opportunity to design the logo/mascot of my Senior capstone project, where I even created a .gif animation consisting of 16 hand-drawn frames.

Animation frames: