Stephen is passionate about building AI-driven solutions that make data more intelligent and accessible. He has designed and deployed AI/ML applications for semantic classification, data harmonization, and retrieval-augmented generation (RAG). His work includes optimizing LLM-powered chatbots, developing hybrid search solutions that combine full-text and vector search, and building scalable FastAPI interfaces for AI-driven data platforms. He focuses on creating reliable, high-performing AI systems that are efficient, scalable, and built with privacy and security in mind.
MSc Bioinformatics
Johns Hopkins University, Baltimore, MD
BSc Bioinformatics
Loyola University, Chicago, IL
| AWS Certified Solutions Architect (Associate) | MongoDB Database Administrator | MongoDB Developer (Associate) |
| Programming languages | Python, R/Shiny, JavaScript, SQL, Perl |
| Databases | MongoDB, MySQL, Postgres, BigQuery, BigTable, Snowflake, Redshift, EMR |
| Data Engineering | Kafka, Cloud Composer (Airflow), Google Dataproc (Spark), Pub/Sub, SNS/SQS |
| Data sharing | FastAPI, Flask, Google Looker (Data Studio), Tableau, ggplot, matplotlib, Prometheus |
| DevOps & MLOps | Git, Docker, Kubernetes, MLflow, CI/CD, Infrastructure-as-Code (Terraform, CloudFormation) |
Vaz, M., Hwang, S. Y., ... Baylin, S. B. (2017). Chronic cigarette smoke-induced epigenomic changes precede sensitization of bronchial epithelial cells to single-step transformation by KRAS mutations. Cancer Cell, 32, 360–376. doi:10.1016/j.ccell.2017.08.006 [PubMed]
Gern, J. E., Jackson, D. J., Lemanske, R. F., Seroogy, C. M., Tachinardi, U., Craven, M., Hwang, S. Y., ... Bacharier, L. B. (2019). The Children's Respiratory and Environmental Workgroup (CREW) Birth Cohort Consortium: Design, methods, and study population. Respiratory Research, 20(1), 115. doi:10.1186/s12931-019-1088-9 [PubMed][Full PDF]