Publications

Selected publications and reports

Each item includes venue, year, authors, a concise contribution summary, and available links.

Workshop paper
2025

Listening or Reading? An Empirical Study of Modality Importance Analysis Across AQA Question Types

DCASE 2025 Workshop

Zeyu Yin, Yiqiang Cai, Pingsong Deng, Xinyang Lyu, Shengchen Li

Designed the study, implemented modality-importance experiments, analyzed results across question types, and wrote the paper.

Technical report
2025

ECHOTWIN-QA: A Dual-Tower BEATSBERT System for DCASE 2025 Task 5 Audio Question Answering

DCASE 2025 Challenge (Task 5)

Zeyu Yin, Ziyang Zhou, Yiqiang Cai, Shengchen Li, Xi Shao

Built the end-to-end AQA system from scratch, ran training and evaluation pipelines, conducted ablations, and wrote the technical report.

Technical report
2025

ADAPTF-SEPNET: AudioSet-Driven Adaptive Pre-training of TF-SEPNet for Multi-device Acoustic Scene Classification

DCASE 2025 Challenge

Ziyang Zhou, Zeyu Yin, Yiqiang Cai, Shengchen Li, Xi Shao

Contributed to model development and experimental evaluation, and supported results analysis and manuscript preparation.

Workshop contribution
2025

EmoSound: A Multimodal AI Agent for Emotion-Aware Audio Accompaniment of Emoticons

BICS 2025

Jianghui Sun, Haosen Shi, Zeyu Yin, Wansu Mo, Hongyi Ding, Yiming Hu, Xi Yang, Yuyao Yan

Implemented components for the multimodal agent and evaluation pipeline, and contributed to experiments and writing.