Research Areas
relu(activations)Selected Works
hidden_dim = 10Text vs. Phoneme Intermediates for Swiss German TTS
SwissText 2026 paper comparing text and phoneme intermediates for low-resource Swiss German text-to-speech. DE-TTS, CH-TTS, and PH-TTS pipelines.
Project page →Vision-Language-Action Models for Robot Control
Fine-tuned VLA architectures for real-time robotic manipulation on low-data regimes. Data curation, augmentation, and evaluation pipelines.
Self-Refining Agentic Supervisor
Robot deployment copilot coordinating simulation, validation, failure discovery, and retraining. RL-based decision framework for deployment readiness.
Combining Behavior Cloning and RL for Space Layout Design
JCDE 2025 paper. Evaluates BC+PPO agents in SpaceLayoutGym for architectural spatial planning.
Project page →Swiss German TTS & STT
State-of-the-art Text-to-Speech & Speech-to-Text for Swiss German dialects. WER/CER & MOS evaluation pipelines, phoneme-based TTS, open-sourced parliamentary speech corpus.
stt4sg.fhnw.ch →High German → Swiss German Translation
Translation models across multiple Swiss German dialects. Parallel data collection, tokenization, and benchmarking with automatic metrics.
3D Reconstruction from 2D
Fine-tuned 3D reconstruction on proprietary data. Co-authored a 1M CHF funding proposal. Deployed at identic.ai.
identic.ai →DreamBooth · SDXL
Fine-tuned Stable Diffusion-XL with LoRA. Live photo booth at Enter.ch Museum.
SpaceLayoutGym
Open-source space-layout simulator for RL training. Core of PhD research on deep RL for spatial planning.
Reimagining Space Layout Design through Deep RL
JCDE 2023 paper introducing SpaceLayoutGym, laser-wall partitioning, and PPO agents for generative space layout design.
Project page →Experience & Education
backprop(history)Applied Machine Learning Researcher
FHNW, Switzerland · 2024 → present
RL, LLMs, VLA, CV, speech. Leading Swiss German TTS/STT and 3D reconstruction.
PhD in AI-Aided Design
ETH Zurich · 2019 → 2024
Deep RL for Architectural Space Layout Design. Built SpaceLayoutGym.
Invited Lecturer
Constructor Learning Institute, Zurich · 2022 → present
ML, Deep Learning, NLP, Computer Vision.
Applied ML Researcher
Hochschule Luzern (HSLU) · 2019 → 2022
ML for real-world NLP, CV, time series.
Research Assistant — Decision Neuroscience
ETH Zurich · 2018 → 2019
Human decision-making via drift-diffusion models.
MSc, Electrical & Control Engineering
Ferdowsi University of Mashhad · 2009 → 2012
Adaptive RL for offering prices in electricity markets.
Selected Projects
project_list.length- Swiss German TTS & STT FHNW · 2024 → present · Speech synthesis & recognition
- Vision-Language-Action Models for Robot Control FHNW · 2024 → present · Robotics
- Self-Refining Agentic Supervisor for Robot Deployment FHNW · 2024 → present · Robotics
- High German → Swiss German Translation FHNW · 2024 → present · NLP
- 3D Reconstruction from 2D Images FHNW · 2024 → present · Computer Vision
- NLP for Risk Management & Safety Analysis FHNW · 2024 → present · NLP
- Patent Similarity Detection for Freedom-to-Operate FHNW · 2024 → present · LLM · RAG
- DreamBooth Personalization for Stable Diffusion FHNW · Diffusion · LoRA
- SpaceLayoutGym ETH Zurich · 2019–24 · RL · Open Source
- Multi-Task Learning for Segmentation, Depth & LiDAR ETH Zurich · Course Project
- LSTM-Based Forecasting & Clustering for Seismic Data ETH Zurich · Voluntary Collaboration
- COVID-19 Case Rates Forecasting & Chest X-Ray Clustering ETH Zurich · Voluntary Collaboration
- Drift-Diffusion Model Parameter Estimation ETH Zurich · 2018–19 · Decision Neuroscience
- Medical Language Models for Semantic Similarity HSLU · 2019–22 · NLP
- Sentiment Analysis on German Product Reviews HSLU · 2019–22 · NLP
- NSFW Image Detection & Zero-Shot Classification HSLU · 2019–22 · Computer Vision
- Hidden Markov Model for Multi-Crypto Price Forecasting HSLU · 2019–22 · Time Series
- Dynamic Pricing for Hotels HSLU · 2019–22 · ML
- Anomaly Detection for Sensor Measurement Errors HSLU · 2019–22 · Time Series
- CNN Classifier for Selfie Detection HSLU · 2019–22 · Computer Vision
- Logo Similarity with VGG16 & Inception HSLU · 2019–22 · Computer Vision
- Customer Loyalty & Churn Prediction S-Bank · 2012–17 · ML
- Fuzzy Controller for 3-Degree Parallel Robot FUM · 2012–17 · Control
- Adaptive MPC for Pick-and-Place Robot Arms FUM · 2012–17 · Control
Publications
∑ citations- Kakooee, R., et al. — Text vs. Phoneme Intermediates for Low-Resource Swiss German Text-to-Speech. SwissText, 2026 Project page →
- Kakooee, R., Dillenburger, B. — Enhancing Architectural Space Layout Design By Pretraining Deep RL Agents. J. Computational Design & Engineering, 2025 Project page →
- Kakooee, R., Dillenburger, B. — Illuminating Spaces: Deep RL and Laser-Wall Partitioning. 27th Generative Art Conf., 2024
- Kakooee, R., Dillenburger, B. — Reimagining space layout design through deep RL. J. Computational Design & Engineering 11.3, 2024 Project page →
- Timmel, V., Paonessa, C., Kakooee, R., et al. — Fine-tuning Whisper on Low-Resource Languages for Real-World Applications. arXiv:2412.15726, 2024
- Kakooee, R., Dillenburger, B. — Design Process is a Reinforcement Learning Problem. Deep RL Workshop, NeurIPS 2022
- Kakooee, R., Dillenburger, B. — RLDesigner: Spatial Layout Planning as an MDP. 15th European Workshop on RL, 2022
- Bernhard, M., Kakooee, R., et al. — TopoGAN: Topology Optimization with GANs. Advances in Architectural Geometry, 2021
- Akizuki, Y., Bernhard, M., Kakooee, R., et al. — Generative Modelling with Design Constraints. CAADRIA, 2020
- Yousefi, A., Kakooee, R., et al. — Predicting Learning Dynamics via Variational Bayes. IEEE EMBC, 2017
Blogs
posts.dropna()- What the AI Community Currently Lacks LinkedIn · Sep 2024
- AI is My Last Hope LinkedIn · Sep 2024
- Teaching is All About Knowledge Compression LinkedIn · Sep 2024
- My Journey with AI LinkedIn · Sep 2024
- AI & LLMs LinkedIn · Jul 2024
- Reinforcement Learning & Deep Learning LinkedIn · Jul 2022
- Design Process is a Reinforcement Learning Problem LinkedIn · 2022
- Deep RL for Architectural Space Layout Design — Part III Medium · 2022
- Deep RL for Architectural Space Layout Design — Part II Medium · 2022
- Deep RL for Architectural Space Layout Design — Part I Medium · 2022
- Deep Double Descent LinkedIn · Dec 2019
Stack
model.parameters()Languages
Python · MATLAB · C++ · JavaScript · SQL · R
ML / Deep Learning
PyTorch · TensorFlow · Keras · scikit-learn · Avalanche
NLP & LLMs
HuggingFace · LangChain · TRL · Accelerate · Diffusers
Reinforcement Learning
RLlib · TorchRL · Stable-Baselines · Isaac Sim · Isaac Lab · MuJoCo · Gym
Foundations
ctrl + mathControl Engineering
- Fuzzy Control — 3-degree parallel robot controller design & tuning
- Model Predictive Control — Adaptive MPC for pick-and-place robot arms
- Robust & Multivariate Control — H∞, LQG, and MIMO controller design for uncertain systems
- Signal Processing — Sensor fusion, filtering, and anomaly detection pipelines
Mathematics
- Optimization — Convex & non-convex, gradient methods, constrained optimization
- Dynamical Systems — Phase portraits, stability analysis, Lyapunov methods
- Probability & Statistics — Bayesian inference, stochastic processes, MCMC
- Decision Theory — Drift-diffusion models, evidence accumulation, bounded rationality
Output
softmax → contactOpen to research collaborations, speaking invitations, and industry projects.
© 2025 Reza Kakooee · loss converged