Experiences
Working Student - Data Science & Engineering
Bremer Institut für Produktion und Logistik GmbH • Feb 2023 – Ongoing • Bremen, Germany
Job Description:
- Developed automated web-scraping solutions using Python and BeautifulSoup to streamline data collection and download processes, significantly reducing manual effort and improving efficiency.
- Designed and implemented end-to-end data pipelines and ETL workflows with Apache Airflow, enabling automated scheduling, transformation, and integration of large datasets from diverse sources.
- Built interactive dashboards and reports in Power BI to deliver actionable insights, track key performance indicators, and support data-driven decision-making.
- Performed advanced data preprocessing and cleaning, including handling missing values, detecting anomalies, and applying statistical transformations to ensure data quality and readiness for research and analytics.
- Developed, trained, and optimized machine learning and deep learning models for predictive analytics and research applications, ensuring robust performance and interpretability.
Internship - Data Science & Data Engineering
Max Delbrück Center for Molecular Medicine • May 2022 – Aug 2022 • Berlin, Germany
Job Description:
- Worked on European child allergy cohort research project, building data pipelines with Airflow and Kafka for batch and streaming data processing.
- Developed Python automation scripts for data cleaning, anomaly detection, missing value handling,and transformation to ensure high-quality datasets for research.
- Designed dashboards and visualizations using Matplotlib, Seaborn, and Tableau to explore pat-terns, anomalies, and research findings.
- Applied statistical analysis and clustering algorithms to extract insights and uncover hidden struc-tures in complex biomedical datasets.
Working Student – Data Analyst
Durstexpress GmbH • Aug 2019 – Dec 2020 • Berlin, Germany
Job Description:
- Applied advanced statistical tools and techniques to analyze large and complex datasets, identifying trends, patterns, and key performance metrics to address critical business and research questions.
- Interpreted analytical findings and translated them into actionable insights, directly supporting business strategy, process optimization, and evidence-based decision-making.
- Interpreted analytical results and extracted actionable insights to guide business strategy and drive operational improvements.
Junior Data Scientist
InseightIn Technology Bangladesh • Jul 2018 – Mar 2019 • Dhaka, Bangladesh
Job Description:
- Collaborated with a data-driven strategy team and stakeholders to deliver automation and opti-mization solutions across multiple projects.
- Contributed to machine learning, data visualization, and performance analysis projects, conducting statistical tests such as A/B testing to inform decision making.
- Developed web scraping scripts to collect and integrate data from various sources for further anal-ysis and insights
Research Assistant – Statistics
Jahangirnagar University • Oct 2016 – Jun 2018 • Dhaka, Bangladesh
Job Description:
- Assisted in four research projects: Tuberculosis in Rural Women, Biomass Fuel Smoke Exposure Effects, Risk Factor Analysis for Treatment Delay in Hospitals, and Risk Factors of Diabetes in Rural- Urban Residents.
- Played a key role in developing questionnaires, participating as a data collection team member, performing comprehensive statistical analyses, and contributing to manuscript writing.
- Deployed statistical and machine learning models for the analysis.
Publications
[1] Patwary, F.A., & Noman, A.A. (2025). Evaluating Subword Tokenization Techniques for Bengali: A Benchmark Study with BengaliBPE. arXiv preprint arXiv:2511.05324.
[2] Al Noman, A., Zitnikov, A., Patwary, F.A., Heuermann, A., & Thoben, K.-D. (2025). Explaining Manufacturing Anomalies: Transformer-Based Detection with xAI for Imbalanced Process Data. IFAC-PapersOnLine, 59(10), 1498–1503.
[3] Patwary, F.A. (2024, August). Data-driven modeling for ETA prediction of vessels in inland natural waterways. Master’s thesis. DOI: 10.13140/RG.2.2.13255.41122
Projects
AI-Powered Website Chatbot: Building with Cutting-Edge LLMs
Skills: Large Language Models (LLaMA2, GPT), Hugging Face Transformers, LangChain, OpenAI API, RAG, Python
BengaliBPE: An Open-Source Python Library for Bengali Language Processing with Byte Pair Embedding (Ongoing)
Skills: Byte-Pair encoding, Tokenization, BengaliBERT, BNLTK, Python, Word embedding
Explaining Manufacturing Anomalies: Transformer-Based Detection with xAI for Imbalanced Data
Skills: Anomaly detection, Transformer, GenAI, Explainable AI, SHAP, LIME, PyTorch
Data-Driven Modeling for ETA Prediction of Vessels in Inland Natural Waterways
Skills: MLOps, LSTM, TensorFlow, BeautifulSoup, xAI, Model monitoring, Deep Learning, Regression
No-Stress Manufacturing: Neuro-Physiological Biometric Data Analysis Using DL
Skills: Data ingestion, Data warehouse, Analytics engineering, Batch processing, Streaming, ML, DL
European Children Medical Allergy Cohort Analysis Using ML Models
Skills: Classification, OCR Development, Statistical Analysis, Hypothesis testing, Scikit-learn, Data Science
Metaphor Detection of Bengali Language Using Large Language Model (Ongoing)
Skills: LLM classification & vector spaces, Byte-Pair Encoding, Tokenization, Fine Tuning, Transformer, BengaliBERT
Mental Health Modeling of Bangladeshi University Students
Skills: Classification, ML, Data Collection, Data Preprocessing, xAI, SHAP, LIME
Predictive Modeling of COVID-19 Cases in Germany Using Time Series Techniques
Skills: Regression, ML, COVID-19 trend analysis, Time-series forecasting, Facebook Prophet, ARIMA, LSTM