Shachar Heyman

Shachar Heyman

Data Scientist & ML Engineer

Berlin, Germany

About

Data Scientist & Machine Learning Engineer with 8+ years of experience building production AI systems across computer vision and LLM-based platforms. Experienced in designing end-to-end ML pipelines, evaluation frameworks and adaptive learning systems that translate research into scalable products. Proven track record of leading ML initiatives from early-stage development to production deployment in both startups and enterprise environments.

Experience

Convergent-AI

Data Science & AI Consultant (Contract)

2025 - Present

Designed the ML architecture for an enterprise training platform based on LLM-driven human-interaction simulations. Built evaluation pipelines to assess conversational performance and developed benchmarking methodologies for measuring human-likeness of AI-driven conversations.

Siemens

Machine Learning Engineer / Data Scientist

2023 - 2025

Designed and deployed deep learning systems for industrial visual inspection, improving defect detection accuracy in production environments. Built end-to-end ML pipelines from data ingestion to model deployment and led generative data augmentation initiatives.

Inspekto (acquired by Siemens)

Machine Learning Engineer / Data Scientist

2017 - 2023

Built end-to-end ML pipelines from data acquisition to deployment and monitoring. Developed computer vision models for automated defect detection using CNNs. Implemented production-grade CI/CD pipelines for reliable model delivery.

Tel Aviv University

Teaching Assistant

2018 - 2021

Supported undergraduate mathematics courses including Linear Algebra, Set Theory, Combinatorics & Topology.

Education

Tel Aviv University

B.Sc. Mathematics & Philosophy

2017 - 2020

Stanford University

CS231n: Deep Learning for Computer Vision

2021

Goethe Institut & Sprachsalon Berlin

German Language (A1-B2)

2024 - 2026

Projects

Auto Document Writer

Aug 2025 - Present

SQL-based product for Time Prints (film production company), auto-filling filming-day-reports from MySQL database, saving hundreds of work hours annually.

SQL, NLP

Shift Scheduler

GitHub

Oct 2025

Shift assignment tool for a documentary film festival. Normalized free text using offline LLM, processed with Pulp for optimal scheduling across multiple locations.

Python, Pulp, LLM

Nena Song Generator

GitHub

Jul 2025 - Sep 2025

Auto-regressive generative model that creates Nena-style song lyrics from a given title. Trained on scraped lyrics.

NLP, LLM, Python

Scrape, Archive & Classify

Jul 2025

Automated tool for downloading images, extracting numbers via OCR, renaming files, and generating annotated spreadsheets. One-command solution for repetitive tasks.

Data Scraping, OCR, Python

Skills

PythonPyTorchTensorFlowScikit-learnLLM EvaluationConversational AIGenerative ModelsComputer VisionAnomaly DetectionDockerCI/CDPandasNumPyPostgreSQLLinear AlgebraProbability & Optimization