Nana's Homepage

I am an Audio AI Engineer at Zoom working on expressive text-to-speech, audio deepfake detection, and real-time speech enhancement.

💡 Read my latest blog post: Nana's Audio AI Log!

I received my Ph.D. in Computer Science from Nanyang Technological University and my B.Sc. from Sichuan University.

Publications [Blog] [Google Scholar]

* indicates equal contribution

SEA-Spoof: Bridging The Gap in Multilingual Audio Deepfake Detection for South-East Asian

J Wu, N Hou, Z Pan, Q Zhang, SH Bhupendra, S Mondal

arXiv 2025 /

Aligning Generative Speech Enhancement with Human Preferences via Direct Preference Optimization

H Li, N Hou, Y Hu, J Yao, SM Siniscalchi, ES Chng

arXiv 2025 /

Dual-path style learning for end-to-end noise-robust speech recognition

Y Hu, N Hou, C Chen, ES Chng

INTERSPEECH 2023 / arXiv