Data Scientist · Open to Full-Time Roles / Projects

3+ Years Turning
Data Into Decisions
That Move Business

I'm a data scientist with hands-on experience in machine learning, generative AI, and analytics in the financial industry. I build LLM-powered systems and ML models — and deliver results that show up on KPIs and the bottom line.

Download CV View My Projects
What I Bring

Focused Expertise Across
the Core Data Science Stack

From exploratory analysis to GenAI-powered production systems — I build with rigour, communicate with clarity, and always tie results back to business outcomes.

🧠
Machine Learning
End-to-end model development from feature engineering to deployment. I've shipped classification, regression, and segmentation models that generate measurable business impact in production financial environments.
Scikit-LearnXGBoost SHAPLifelines
🤖
Generative AI & RAG
I architect LLM-powered systems end-to-end — from prompt engineering and RAG pipelines to production-ready document intelligence solutions. I've delivered Text-to-SQL, OCR automation, and legal Q&A systems on AWS with Claude.
Prompt EngineeringRAG LangChainClaude / LLMs
☁️
Cloud & MLOps (AWS)
I build and deploy ML and AI pipelines in AWS — leveraging Amazon Textract, S3, and other managed services to automate document workflows and power scalable data science infrastructure in regulated financial environments.
AWSAmazon Textract PythonMicroStrategy
📊
Analytics & Customer Intelligence
I translate raw cross-entity data into clear business narratives. Customer 360 analysis, segmentation studies, and cross-sell targeting — designed so stakeholders actually understand and act on what they're seeing.
SQLPandas MicroStrategyOracle
Big Data & Data Engineering
I can build and maintain the pipelines my own models depend on — large-scale data processing with PySpark, ETL design, and data quality checks that keep data flowing cleanly at enterprise scale.
PySparkNumPy SQLPython
🔍
Explainable AI & Insight
I surface the "why" behind model predictions using SHAP analysis, making ML systems interpretable and actionable for non-technical stakeholders. I also design and deliver data literacy training to bridge business and technical teams.
SHAPMatplotlib SeabornData Storytelling

3+ Years of Focused
Financial Industry Experience

A deliberate progression from analytics and ML to GenAI system architecture — with growing scope and measurable impact at every step.

2023 – Present
PT. Asuransi Allianz Indonesia · Jakarta, ID
Data Scientist
  • Architected and delivered a Proof of Concept (POC) Text-to-SQL system leveraging Retrieval-Augmented Generation (RAG) techniques with Claude Sonnet on AWS, enabling non-technical stakeholders to query databases using natural language and reducing dependency on data teams for ad-hoc data retrieval.
  • Engineered an intelligent document processing pipeline on AWS, leveraging Claude Sonnet and Amazon Textract to automate the capture and conversion of handwritten and printed submission documents into structured data, applying prompt engineering techniques to optimize extraction accuracy and improve operational efficiency at scale.
  • Built a POC OCR pipeline using Claude Sonnet, designing structured prompts to automatically extract, structure, and reconcile invoice data from raw claim documents in an AWS ecosystem, identifying overcharged invoices and surfacing actionable cost-recovery insights for operations teams.
  • Conducted Customer 360 analysis by integrating cross-entity data to identify untapped customer segments with no existing product holdings, enabling targeted cross-selling campaigns that achieved 164% of KPI target through close cross-functional collaboration.
  • Designed and delivered a data literacy training program for cross-functional participants, bridging business and technical requirements by translating AI concepts into actionable insights — strengthening organizational analytics capabilities and driving data-informed decision-making across teams.
  • Developed a K-Nearest Neighbors (KNN) model for customer segmentation using Python, collaborating with cross-functional teams to transform customer segment data into strategic insights that contributed to ~150% of Annual Revenue Target achievement.
  • Featured Projects

    Real Problems.
    Measurable Outcomes.

    My personal projects — each with concrete, verifiable results.

    LegalTech · RAG · NLP · 2025
    RAG-Based AI Legal Document Chatbot
    Built an AI-powered legal Q&A chatbot leveraging RAG architecture — LangChain, ChromaDB, Llama 3.2, and MiniLM-L6 Embeddings — with vector similarity search to enable semantic retrieval over Indonesia's 1945 Constitution.
    RAG
    Full vector-search pipeline with semantic retrieval over legal corpus
    RAG-Based AI Legal Document Chatbot
    LegalTech · RAG · NLP · 2025
    Telecom · Classification · 2022
    Customer Telco Churn Prediction
    Developed an end-to-end supervised ML classification system using Random Forest for telecom customer churn prediction. Identifies at-risk customers early, enabling proactive retention campaigns and reducing involuntary lapse rates.
    Random Forest
    Full pipeline from EDA and feature engineering to model evaluation and insight
    Customer Telco Churn Prediction
    Telecom · Classification · 2022
    Aquaculture · Regression · ML App
    Abalone Age Prediction
    Built a machine learning-powered Streamlit web application to estimate abalone age using physical measurements, eliminating the need for complex microscopic ring counting. The system integrates feature engineering, encoding, scaling, and a pre-trained regression model to support faster and more efficient decision-making for abalone farmers.
    Random Forest
    End-to-end ML pipeline with feature engineering, scaling, and deployment via Streamlit web app
    Abalone Age Prediction
    Aquaculture · Regression · ML App
    Public Health · Dashboard · 2022
    COVID-19 Indonesia Monitoring Dashboard
    Developed an interactive COVID-19 analytics dashboard to provide public users with insights into case trends, recoveries, and mortality across Indonesian provinces from 2020 to 2022. The dashboard includes cleaned and parsed time-series data, province-level filtering, and multiple visualization components to support intuitive public health monitoring and data exploration.
    Interactive BI Dashboard
    Scorecards, geospatial bubble map, trend analysis, drill-down time series, and province-level filtering for public health insights
    COVID-19 Indonesia Monitoring Dashboard
    Public Health · Dashboard · 2022
    3+ Years of
    Experience

    A Data Scientist Who
    Delivers in Production

    I'm a data scientist with 3+ years of hands-on experience in the financial industry, specializing in ML modelling, generative AI systems, and customer analytics. I work across the full lifecycle — from exploratory analysis and feature engineering to LLM-powered pipeline architecture and business storytelling.

    My edge is the ability to connect technical rigour with business outcomes. I ask what a model actually needs to do for the business before I start building it, communicate AI concepts in plain language to non-technical stakeholders, and always close the loop between insight and action.

    Data Scientist · PT. Asuransi Allianz Indonesia · Jakarta, ID (2023–Present)
    Associate Data Scientist · DataCamp
    Advanced SQL · HackerRank
    B.Eng Mechanical Engineering · University of Brawijaya
    Python SQL PySpark XGBoost Scikit-Learn SHAP LangChain RAG Prompt Engineering AWS Amazon Textract MicroStrategy
    Certificates

    Professional Certifications

    Fresh Graduate Academy Data Science program by the Ministry of Communication and Information Technology. Focused on data analytics, machine learning, and real-world business case implementation.
    Recognized as Data Scientist Associate by DataCamp, demonstrating proficiency in data analysis, statistical modeling, and machine learning workflows.
    Advanced SQL certification from HackerRank validating expertise in complex queries, data manipulation, and database performance optimization.

    Let's Build Something
    Meaningful Together

    If you're hiring for a data science role — or want to explore whether I'm the right fit — reach out. I respond within 24 hours with a direct, no-fluff conversation about what you actually need.

    Prefer email? adamhubert.wrk@gmail.com · LinkedIn available on request · Response within 24h guaranteed.