Experiences

Working Student - Data Science & Engineering

Bremer Institut für Produktion und Logistik GmbH • Feb 2023 – Ongoing • Bremen, Germany

Job Description:

  • Developed automated web-scraping solutions using Python and BeautifulSoup to streamline data collection and download processes, significantly reducing manual effort and improving efficiency.
  • Designed and implemented end-to-end data pipelines and ETL workflows with Apache Airflow, enabling automated scheduling, transformation, and integration of large datasets from diverse sources.
  • Built interactive dashboards and reports in Power BI to deliver actionable insights, track key performance indicators, and support data-driven decision-making.
  • Performed advanced data preprocessing and cleaning, including handling missing values, detecting anomalies, and applying statistical transformations to ensure data quality and readiness for research and analytics.
  • Developed, trained, and optimized machine learning and deep learning models for predictive analytics and research applications, ensuring robust performance and interpretability.

Internship - Data Science & Data Engineering

Max Delbrück Center for Molecular Medicine • May 2022 – Aug 2022 • Berlin, Germany

Job Description:

  • Worked on European child allergy cohort research project, building data pipelines with Airflow and Kafka for batch and streaming data processing.
  • Developed Python automation scripts for data cleaning, anomaly detection, missing value handling,and transformation to ensure high-quality datasets for research.
  • Designed dashboards and visualizations using Matplotlib, Seaborn, and Tableau to explore pat-terns, anomalies, and research findings.
  • Applied statistical analysis and clustering algorithms to extract insights and uncover hidden struc-tures in complex biomedical datasets.

Working Student – Data Analyst

Durstexpress GmbH • Aug 2019 – Dec 2020 • Berlin, Germany

Job Description:

  • Applied advanced statistical tools and techniques to analyze large and complex datasets, identifying trends, patterns, and key performance metrics to address critical business and research questions.
  • Interpreted analytical findings and translated them into actionable insights, directly supporting business strategy, process optimization, and evidence-based decision-making.
  • Interpreted analytical results and extracted actionable insights to guide business strategy and drive operational improvements.

Junior Data Scientist

InseightIn Technology Bangladesh • Jul 2018 – Mar 2019 • Dhaka, Bangladesh

Job Description:

  • Collaborated with a data-driven strategy team and stakeholders to deliver automation and opti-mization solutions across multiple projects.
  • Contributed to machine learning, data visualization, and performance analysis projects, conducting statistical tests such as A/B testing to inform decision making.
  • Developed web scraping scripts to collect and integrate data from various sources for further anal-ysis and insights

Research Assistant – Statistics

Jahangirnagar University • Oct 2016 – Jun 2018 • Dhaka, Bangladesh

Job Description:

  • Assisted in four research projects: Tuberculosis in Rural Women, Biomass Fuel Smoke Exposure Effects, Risk Factor Analysis for Treatment Delay in Hospitals, and Risk Factors of Diabetes in Rural- Urban Residents.
  • Played a key role in developing questionnaires, participating as a data collection team member, performing comprehensive statistical analyses, and contributing to manuscript writing.
  • Deployed statistical and machine learning models for the analysis.

Publications

[1] Patwary, F.A., & Noman, A.A. (2025). Evaluating Subword Tokenization Techniques for Bengali: A Benchmark Study with BengaliBPE. arXiv preprint arXiv:2511.05324.

[2] Al Noman, A., Zitnikov, A., Patwary, F.A., Heuermann, A., & Thoben, K.-D. (2025). Explaining Manufacturing Anomalies: Transformer-Based Detection with xAI for Imbalanced Process Data. IFAC-PapersOnLine, 59(10), 1498–1503.

[3] Patwary, F.A. (2024, August). Data-driven modeling for ETA prediction of vessels in inland natural waterways. Master’s thesis. DOI: 10.13140/RG.2.2.13255.41122

Projects

AI-Powered Website Chatbot: Building with Cutting-Edge LLMs

Skills: Large Language Models (LLaMA2, GPT), Hugging Face Transformers, LangChain, OpenAI API, RAG, Python

BengaliBPE: An Open-Source Python Library for Bengali Language Processing with Byte Pair Embedding (Ongoing)

Skills: Byte-Pair encoding, Tokenization, BengaliBERT, BNLTK, Python, Word embedding

Explaining Manufacturing Anomalies: Transformer-Based Detection with xAI for Imbalanced Data

Skills: Anomaly detection, Transformer, GenAI, Explainable AI, SHAP, LIME, PyTorch

Data-Driven Modeling for ETA Prediction of Vessels in Inland Natural Waterways

Skills: MLOps, LSTM, TensorFlow, BeautifulSoup, xAI, Model monitoring, Deep Learning, Regression

No-Stress Manufacturing: Neuro-Physiological Biometric Data Analysis Using DL

Skills: Data ingestion, Data warehouse, Analytics engineering, Batch processing, Streaming, ML, DL

European Children Medical Allergy Cohort Analysis Using ML Models

Skills: Classification, OCR Development, Statistical Analysis, Hypothesis testing, Scikit-learn, Data Science

Metaphor Detection of Bengali Language Using Large Language Model (Ongoing)

Skills: LLM classification & vector spaces, Byte-Pair Encoding, Tokenization, Fine Tuning, Transformer, BengaliBERT

Mental Health Modeling of Bangladeshi University Students

Skills: Classification, ML, Data Collection, Data Preprocessing, xAI, SHAP, LIME

Predictive Modeling of COVID-19 Cases in Germany Using Time Series Techniques

Skills: Regression, ML, COVID-19 trend analysis, Time-series forecasting, Facebook Prophet, ARIMA, LSTM