Khuyen Tran

Edwardsville, Illinois · +1 (618) ASK-4-IT!So · khuyentran1476@gmail.com

I am an applied mathematics student with the love for data science, machine learning, and the passion for solving challenging problems with AI


Skills

Programming Languages & Tools
Extra skills / knowledge
  • Web scraping
  • Natural Language Processing
  • Machine learning and deep learning
  • Many Python packages – NumPy, Matplotlib, Tensorflow, Scikit-learn, etc
  • SQL
  • AWS RDS and EC2
  • Tableau
  • Talend ETL

Experience

Natural Language Processing Research Assistant

Research Center in Mathematics (CIMAT A.C.)

Collaborative research with Dr. Adrián Pastor López in predicting the author’s gender and language variety by processing raw Twitter data. Achieve 80% accuracy for gender predict and 84% accuracy for language variety. Experimenting with different NLP models such as Linear Kernel SVM, Pytorch, BERT, Neural Network

Reference link.

Januanary 2019 - Present

Math Research Assistant

SIUE Department of Mathematics and Statistics

Formulated metaheuristic with Dr.Chew, a method to provide optimal solution with incomplete information within acceptable time. Especially useful for neural network training. Achieved 10-6 accuracy when tested on multidimensional functions. Skills: MATLAB, metaheuristic methods

January 2019 - December 2019

Data Science Technical Writer

Medium

Authorized more than 30 articles on topics of Natural Language Processing, web scraping, data science tools, and mathematical programming with more than 110k views a month. Simplified complex mathematical and programming concepts with insights and interactive visualizations. Skills: Communication, creativity, analysis, in-depth knowledge in programming, statistics, and data science tools

Reference link.

January 2020 - Present

Math and Business Tutor

SIUE Tutoring Resource Center

Guided more than 10 students to get from C to A grade in statistics and mathematics courses within 2 months. Effectively communicate to those who have little understanding of the subjects with confidence and clarity. Skills: Communication, listening, good understanding of mathematics and statistics.

Aug 2018 - Dec 2019

Calculus Enrichment Session Leader (ES Leader)

SIUE Department of Mathematics and Statistics

Led a class of 60+ students from different learning backgrounds to master calculus knowledge. Devised strategic plans with other leaders and professors to deliver complex concepts in comprehensible ways. Skills: Leadership, verbal communication, collaboration, knowledge in advanced mathematics

Aug 2018 - Dec 2019

Education

Research Center in Computation and Mathematics (CIMAT A.C.)

Mathematical Tools for Data Science
Program focuses on three areas of data science: statistics, mathematics, and computer science. Provides a solid understanding of data science, data structure and algorithms, data visualization, statistics, numerical optimizations, and machine learning algorithms

January 2020 - May 2020

Southern Illinois University Edwardsville

Bachelor of Mathematics
Major in Mathematics with strong focus in application

August 2016 - May 2021

Community and social activity

Advancement of Chicanos/Hispanics and Native Americans in Science (SACNAS)

Modern Math Workshop and STEM conference
Selected as a travel scholar.
Know more
10th October 2018 - 13th October 2018

datostada

Where Science and Industry Meet Data Sience
An international meeting for society, academia, government and industry, about the role of data in several high-impact topics
Know more
12th March 2020 - 14th March 2020

Projects

This section contains awesome projects that I've developed:

Scheduling App with Machine Learning and Integer Programming

Scheduling App with Machine Learning and Integer Programming

Boost Productivity by Scheduling Tasks Intelligently

Utilize integer optimization to find the most effective schedule and maximize productivity.

Integer Programming PuLP Python NLP

Predict Gender and Language Variety in Twitter

Predict Gender and Language Variety in Twitter

Identify author’s gender and language variety in Twitter.

Experiment with 3 different ML models and achieve f1-score of up to .81 for gender prediction and .84 for language variety

Python NLTK Word2Vec Tf-Idf Bag of Words GridsearchCV LinearSVC

Scraping, Text Processing, and Analysis Ghibli Movie Database

Scraping, Text Processing, and Analysis Ghibli Movie Database

An end-to-end exploratory data analysis: extracting raw data on Ghibli Movie Database and analyzing the data.

Use Beautiful Soup to extract the data, preprocessing with scikit-learn and NLTK, visualizing with Matplotlib

Python text precessing beautiful soup data mining nltk matplotlib


Awards & Certifications

  • Andrew O. Lindstrum, Jr. Memorial Scholarship for an outstanding mathematical student
  • Dean's List
  • International Undergraduate Scholarship

This Flask template was built with by Rodolfo Ferro, under a MIT License.