Reza Kakooee

L1

Research Areas

relu(activations)

robotics reinforcement-learning large-language-models vision-language-action computer-vision speech-synthesis architectural-ai

L2

Selected Works

hidden_dim = 10

Speech · FHNW2026

Text vs. Phoneme Intermediates for Swiss German TTS

SwissText 2026 paper comparing text and phoneme intermediates for low-resource Swiss German text-to-speech. DE-TTS, CH-TTS, and PH-TTS pipelines.

Project page →

Robotics2026

Vision-Language-Action Models for Robot Control

Fine-tuned VLA architectures for real-time robotic manipulation on low-data regimes. Data curation, augmentation, and evaluation pipelines.

Robotics2026

Self-Refining Agentic Supervisor

Robot deployment copilot coordinating simulation, validation, failure discovery, and retraining. RL-based decision framework for deployment readiness.

RL · ETH Zurich2025

Combining Behavior Cloning and RL for Space Layout Design

JCDE 2025 paper. Evaluates BC+PPO agents in SpaceLayoutGym for architectural spatial planning.

Project page →

Speech · FHNW2024

Swiss German TTS & STT

State-of-the-art Text-to-Speech & Speech-to-Text for Swiss German dialects. WER/CER & MOS evaluation pipelines, phoneme-based TTS, open-sourced parliamentary speech corpus.

stt4sg.fhnw.ch →

NLP · FHNW2024

High German → Swiss German Translation

Translation models across multiple Swiss German dialects. Parallel data collection, tokenization, and benchmarking with automatic metrics.

Computer Vision · FHNW2024

3D Reconstruction from 2D

Fine-tuned 3D reconstruction on proprietary data. Co-authored a 1M CHF funding proposal. Deployed at identic.ai.

identic.ai →

Diffusion2024

L3

Experience & Education

backprop(history)

Applied Machine Learning Researcher

FHNW, Switzerland · 2024 → present

RL, LLMs, VLA, CV, speech. Leading Swiss German TTS/STT and 3D reconstruction.

PhD in AI-Aided Design

ETH Zurich · 2019 → 2024

Deep RL for Architectural Space Layout Design. Built SpaceLayoutGym.

Invited Lecturer

Constructor Learning Institute, Zurich · 2022 → present

ML, Deep Learning, NLP, Computer Vision.

Applied ML Researcher

Hochschule Luzern (HSLU) · 2019 → 2022

ML for real-world NLP, CV, time series.

Research Assistant — Decision Neuroscience

ETH Zurich · 2018 → 2019

Human decision-making via drift-diffusion models.

MSc, Electrical & Control Engineering

Ferdowsi University of Mashhad · 2009 → 2012

Adaptive RL for offering prices in electricity markets.

L2+

Selected Projects

project_list.length

Swiss German TTS & STT FHNW · 2024 → present · Speech synthesis & recognition
Vision-Language-Action Models for Robot Control FHNW · 2024 → present · Robotics
Self-Refining Agentic Supervisor for Robot Deployment FHNW · 2024 → present · Robotics
High German → Swiss German Translation FHNW · 2024 → present · NLP
3D Reconstruction from 2D Images FHNW · 2024 → present · Computer Vision
NLP for Risk Management & Safety Analysis FHNW · 2024 → present · NLP
Patent Similarity Detection for Freedom-to-Operate FHNW · 2024 → present · LLM · RAG
DreamBooth Personalization for Stable Diffusion FHNW · Diffusion · LoRA
SpaceLayoutGym ETH Zurich · 2019–24 · RL · Open Source
Multi-Task Learning for Segmentation, Depth & LiDAR ETH Zurich · Course Project
LSTM-Based Forecasting & Clustering for Seismic Data ETH Zurich · Voluntary Collaboration
COVID-19 Case Rates Forecasting & Chest X-Ray Clustering ETH Zurich · Voluntary Collaboration
Drift-Diffusion Model Parameter Estimation ETH Zurich · 2018–19 · Decision Neuroscience
Medical Language Models for Semantic Similarity HSLU · 2019–22 · NLP
Sentiment Analysis on German Product Reviews HSLU · 2019–22 · NLP
NSFW Image Detection & Zero-Shot Classification HSLU · 2019–22 · Computer Vision
Hidden Markov Model for Multi-Crypto Price Forecasting HSLU · 2019–22 · Time Series
Dynamic Pricing for Hotels HSLU · 2019–22 · ML
Anomaly Detection for Sensor Measurement Errors HSLU · 2019–22 · Time Series
CNN Classifier for Selfie Detection HSLU · 2019–22 · Computer Vision
Logo Similarity with VGG16 & Inception HSLU · 2019–22 · Computer Vision
Customer Loyalty & Churn Prediction S-Bank · 2012–17 · ML
Fuzzy Controller for 3-Degree Parallel Robot FUM · 2012–17 · Control
Adaptive MPC for Pick-and-Place Robot Arms FUM · 2012–17 · Control

L4

Publications

∑ citations

Kakooee, R., et al. — Text vs. Phoneme Intermediates for Low-Resource Swiss German Text-to-Speech. SwissText, 2026 Project page →
Kakooee, R., Dillenburger, B. — Enhancing Architectural Space Layout Design By Pretraining Deep RL Agents. J. Computational Design & Engineering, 2025 Project page →
Kakooee, R., Dillenburger, B. — Illuminating Spaces: Deep RL and Laser-Wall Partitioning. 27th Generative Art Conf., 2024
Kakooee, R., Dillenburger, B. — Reimagining space layout design through deep RL. J. Computational Design & Engineering 11.3, 2024 Project page →
Timmel, V., Paonessa, C., Kakooee, R., et al. — Fine-tuning Whisper on Low-Resource Languages for Real-World Applications. arXiv:2412.15726, 2024
Kakooee, R., Dillenburger, B. — Design Process is a Reinforcement Learning Problem. Deep RL Workshop, NeurIPS 2022
Kakooee, R., Dillenburger, B. — RLDesigner: Spatial Layout Planning as an MDP. 15th European Workshop on RL, 2022
Bernhard, M., Kakooee, R., et al. — TopoGAN: Topology Optimization with GANs. Advances in Architectural Geometry, 2021
Akizuki, Y., Bernhard, M., Kakooee, R., et al. — Generative Modelling with Design Constraints. CAADRIA, 2020
Yousefi, A., Kakooee, R., et al. — Predicting Learning Dynamics via Variational Bayes. IEEE EMBC, 2017

L5

Blogs

posts.dropna()

What the AI Community Currently Lacks LinkedIn · Sep 2024
AI is My Last Hope LinkedIn · Sep 2024
Teaching is All About Knowledge Compression LinkedIn · Sep 2024
My Journey with AI LinkedIn · Sep 2024
AI & LLMs LinkedIn · Jul 2024
Reinforcement Learning & Deep Learning LinkedIn · Jul 2022
Design Process is a Reinforcement Learning Problem LinkedIn · 2022
Deep RL for Architectural Space Layout Design — Part III Medium · 2022
Deep RL for Architectural Space Layout Design — Part II Medium · 2022
Deep RL for Architectural Space Layout Design — Part I Medium · 2022
Deep Double Descent LinkedIn · Dec 2019

L6

Stack

model.parameters()

Languages

Python · MATLAB · C++ · JavaScript · SQL · R

ML / Deep Learning

PyTorch · TensorFlow · Keras · scikit-learn · Avalanche

NLP & LLMs

HuggingFace · LangChain · TRL · Accelerate · Diffusers

Reinforcement Learning

RLlib · TorchRL · Stable-Baselines · Isaac Sim · Isaac Lab · MuJoCo · Gym

L7

Foundations

ctrl + math

Control Engineering

Fuzzy Control — 3-degree parallel robot controller design & tuning
Model Predictive Control — Adaptive MPC for pick-and-place robot arms
Robust & Multivariate Control — H∞, LQG, and MIMO controller design for uncertain systems
Signal Processing — Sensor fusion, filtering, and anomaly detection pipelines

Mathematics

Optimization — Convex & non-convex, gradient methods, constrained optimization
Dynamical Systems — Phase portraits, stability analysis, Lyapunov methods
Probability & Statistics — Bayesian inference, stochastic processes, MCMC
Decision Theory — Drift-diffusion models, evidence accumulation, bounded rationality

L8

Output

softmax → contact

Get in touch

Open to research collaborations, speaking invitations, and industry projects.

GitHub LinkedIn X Scholar Email