Stephen is passionate about building AI-driven solutions that make data more intelligent and accessible. He has designed and deployed AI/ML applications for semantic classification, data harmonization, and retrieval-augmented generation (RAG). His work includes optimizing LLM-powered chatbots, developing hybrid search solutions that combine full-text and vector search, and building scalable FastAPI interfaces for AI-driven data platforms. He focuses on creating reliable, high-performing AI systems that are efficient, scalable, and built with privacy and security in mind.
MSc Bioinformatics
Johns Hopkins University, Baltimore, MD
BSc Bioinformatics
Loyola University, Chicago, IL
AWS Certified Solutions Architect (Associate) | MongoDB Database Administrator | MongoDB Developer (Associate) |
Programming languages | Python, R/Shiny, JavaScript, SQL, Perl |
Databases | MongoDB, MySQL, Postgres, BigQuery, BigTable, Snowflake, Redshift, EMR |
Data Engineering | Kafka, Cloud Composer (Airflow), Google Dataproc (Spark), Pub/Sub, SNS/SQS |
Data sharing | FastAPI, Flask, Google Looker (Data Studio), Tableau, ggplot, matplotlib, Prometheus |
DevOps & MLOps | Git, Docker, Kubernetes, MLflow, CI/CD, Infrastructure-as-Code (Terraform, CloudFormation) |
Vaz, M., Hwang, S. Y., ... Baylin, S. B. (2017). Chronic cigarette smoke-induced epigenomic changes precede sensitization of bronchial epithelial cells to single-step transformation by KRAS mutations. Cancer Cell, 32, 360–376. doi:10.1016/j.ccell.2017.08.006 [PubMed]
Gern, J. E., Jackson, D. J., Lemanske, R. F., Seroogy, C. M., Tachinardi, U., Craven, M., Hwang, S. Y., ... Bacharier, L. B. (2019). The Children's Respiratory and Environmental Workgroup (CREW) Birth Cohort Consortium: Design, methods, and study population. Respiratory Research, 20(1), 115. doi:10.1186/s12931-019-1088-9 [PubMed][Full PDF]