Day 2: Thursday, 23 Oct 2025 - Overview |
08:00-08:30 |
D2-0800_IB Registration
Location: Island Ballroom |
08:30-10:00 |
D2-0830_IB Active Noise Cancellation Workshop Keynote
Location: Island Ballroom |
08:30-10:00 |
D2-0830_L1 Advanced Topics in Audio Understanding of Sound Events, Scenes, and Beyond
Location: Lotus I |
08:30-10:00 |
D2-0830_L2 Speech and Language Processing I
Location: Lotus II |
08:30-10:00 |
D2-0830_H3 Research Frontiers in Learned Visual Data Coding and Processing
Location: Hibiscus III |
08:30-10:00 |
D2-0830_P1 Multimodal AI
Location: Peony I |
08:30-10:00 |
D2-0830_P2 Biomedical Signal Processing and Systems I
Location: Peony II |
10:00-10:30 |
Break |
10:30-12:00 |
D2-1030_IB Active Noise Cancellation Panel Discussion
Location: Island Ballroom |
10:30-12:00 |
D2-1030_L1 Advanced Topics on Music Processing
Location: Lotus I |
10:30-12:00 |
D2-1030_L2 Speech and Language Processing II
Location: Lotus II |
10:30-12:00 |
D2-1030_H3 Emerging Technologies and Applications of Image Processing and Computer Vision
Location: Hibiscus III |
10:30-12:00 |
D2-1030_P1 Biomedical Signal Processing and Systems II
Location: Peony I |
10:30-12:00 |
D2-1030_P2 Machine Learning: Algorithms and Application I
Location: Peony II |
12:00-12:30 |
Lunch |
12:30-13:30 |
D2-1230_IB Women in APSIPA Forum
Location: Island Ballroom |
13:30-14:30 |
D2-1330_IB Keynote 2 by Jane Wang
Location: Island Ballroom |
14:30-16:00 |
D2-1430_IB Perspective 3: Neural Speech Assessment and Its Application
Location: Island Ballroom |
14:30-16:00 |
D2-1430_L1 Active Noise Control I
Location: Lotus I |
14:30-16:00 |
D2-1430_L2 Speech and Language Processing III
Location: Lotus II |
14:30-16:00 |
D2-1430_H3 Machine Learning: Information and Medical Applications
Location: Hibiscus III |
14:30-16:00 |
D2-1430_P1 Audio Processing
Location: Peony I |
14:30-16:00 |
D2-1430_P2 Signal & Information Processing I
Location: Peony II |
16:00-16:30 |
Break |
16:30-18:00 |
D2-1630_IB Education Forum
Location: Island Ballroom |
16:30-18:00 |
D2-1630_L1 Active Noise Control II
Location: Lotus I |
16:30-18:00 |
D2-1630_L2 Speech and Language Processing IV
Location: Lotus II |
16:30-18:00 |
D2-1630_H3 Recent Advances in Multimedia Enrichment, Security and Privacy
Location: Hibiscus III |
16:30-18:00 |
D2-1630_P2 Advances in Multimodal AI for Multimedia Applications
Location: Peony II |
19:00-21:30 |
D2-1900_IB Banquet
Location: Island Ballroom |
Day 2: Thursday, 23 Oct 2025 - With Papers |
08:00-08:30 |
D2-0800_IB Registration
Location: Island Ballroom |
08:30-10:00 |
D2-0830_IB Active Noise Cancellation Workshop Keynote
Location: Island Ballroom |
08:30-10:00 |
D2-0830_L1 Advanced Topics in Audio Understanding of Sound Events, Scenes, and Beyond
Location: Lotus I
D2-0830_L1.1 58 Evaluation of auditory and tactile perception for augmented sound-image enhancement using pre-virtual-leading hypersonic signals
Ryota Imanaka, Yuting Geng, Masato Nakayama, Takanobu Nishiura
D2-0830_L1.2 103 Improvement in Variance Estimation in Variable-Step-Size Shared-Error NLMS Algorithm for Acoustic Echo and Noise Canceller
Kenta Iwai
D2-0830_L1.3 117 Hierarchical Sparse Sound Field Reconstruction with Spherical and Linear Microphone Arrays
Shunxi Xu, Craig T. Jin
D2-0830_L1.4 157 Robust Superdirective Beamforming Using a Uniform Circular Array with Directional Microphones
Weilong Huang, Longfei Felix Yan, Emanuël A.P. Habets
D2-0830_L1.5 211 Towards Robust Stereo 3-D SELD: A Study of Perceptual Features and Data Augmentation
Jun Wei Yeow, Ee-Leng Tan, Santi Peksi, Woon-Seng Gan, Huang Qirui
D2-0830_L1.6 258 Pre-training Autoencoder for Acoustic Event Classification via Blinky
Xiaoyang Liu, Yuma Kinoshita
D2-0830_L1.7 275 Sound source enhancement using power spectral density estimation in beamspace for a dual unmanned aerial vehicle system
Mingxue Song, Jin Xuan Teh, Yusuke Hioka, Benjamin Yen, Hiroshi Saruwatari
D2-0830_L1.8 328 Three-Dimensional Gradient-Based Tracking of Multiple Sound Sources
Shaoheng Xu, Wei-Ting Lai, Yile (Angela) Zhang, Jihui (Aimee) Zhang, Amy Bastine, Prasanga Samarasinghe, Thushara Abhayapala
D2-0830_L1.9 389 Retrieval-Augmented Difference Captioning to Explain Unsupervised Anomalous Sound Detection
Ryoya Ogura, Tomoya Nishida, Yohei Kawaguchi
D2-0830_L1.10 398 An Evaluation of Supervised Virtual Microphone Estimators in Reverberant Sound Fields
Kimihiro Hattori, Wen-Chin Huang, Kazuya Takeda, Tomoki Toda
D2-0830_L1.11 459 Human-CLAP: Human-perception-based contrastive language–audio pretraining
Taisei Takano, Yuki Okamoto, Yusuke Kanamori, Yuki Saito, Ryotaro Nagase, Hiroshi Saruwatari
D2-0830_L1.12 461 Training Acoustic Scene Classification Models Robust to Asynchrony in Distributed Microphone Arrays
Takao Kawamura, Nobutaka Ono |
08:30-10:00 |
D2-0830_L2 Speech and Language Processing I
Location: Lotus II
D2-0830_L2.1 95 Neural Speech Separation with Parallel Amplitude and Phase Spectrum Estimation
Fei Liu, Yang Ai, Zhen-Hua Ling
D2-0830_L2.2 127 Single-Channel Speech Enhancement in Spherical-Mapped Short-Time Spectral Domain
Yu Morinaga, Naoto Kotake, Iori Hashimoto, Suehiro Shimauchi, Shigeaki Aoki
D2-0830_L2.3 142 Introducing Self-Supervised Learning Models for Spoken Query-Spoken Term Detection
Masato Nagase, Kazunori Kojima, Shi-wook Lee, Yoshiaki Itoh
D2-0830_L2.4 150 Characterization of Speech Similarity Between Australian Aboriginal and High-Resource Languages: A Case Study on Dharawal
Ting Dang, Trini Manoj Jeyaseelan, Eliathamby Ambikairajah, Vidhyasaharan Sethu
D2-0830_L2.5 179 Segment Transformer: AI-Generated Music Detection via Music Structural Analysis
Yumin Kim, Seonghyeon Go
D2-0830_L2.6 213 Dialect Identification Using Resource-Efficient Fine-Tuning Approaches
Zirui Lin, Haris Gulzar, Monnika Roslianna Busto, Akiko Masaki, Takeharu Eda, Kazuhiro Nakadai
D2-0830_L2.7 236 A High-Quality and Low-Complexity Streamable Neural Speech Codec with Knowledge Distillation
En-Wei Zhang, Hui-Peng Du, Xiao-Hang Jiang, Yang Ai, Zhen-Hua Ling
D2-0830_L2.8 253 Effectiveness of streaming ASR for real-time laughter and screaming detection
Mizuki Kurasawa, Yoshiko Arimoto
D2-0830_L2.9 262 Mitigating Data Imbalance in Automated Speaking Assessment
Fong-Chun Tsai, Kuan-Tang Huang, Bi-Cheng Yan, Tien-Hong Lo, Berlin Chen
D2-0830_L2.10 446 An Information-Theoretic Approach to Data Selection for Generative Topic Modeling
Michael Santoso, Bhone Tay Zar Kyaw, Valentinus Roby Hananto, Victor Kryssanov
D2-0830_L2.11 571 Collective Learning-based Optimal Transport GAN with Multi-Level Fine-Grained and Global Discriminators for Voice Conversion
Sandipan Dhar, Md. Tousin Akhter, Nanda Dulal Jana, Swagatam Das, Monorama Swain, Saurav Chowdhury |
08:30-10:00 |
D2-0830_H3 Research Frontiers in Learned Visual Data Coding and Processing
Location: Hibiscus III
D2-0830_H3.1 64 Neural Implicit Representations for Object-centric Machine Vision Tasks
Yeoneui Kim, Je-Won Kang
D2-0830_H3.2 106 GoP-to-Frame Encoder Adaptation for Learned Video Compression
Xiaohan Pan, Runsen Feng, Henan Wang, Yixin Gao, Zhibo Chen
D2-0830_H3.3 161 Efficient Adversarial Attack and Training on Learned Image Compression
Jun Kurihara, Heming Sun
D2-0830_H3.4 204 Accelerating VVC Inter-Frame Coding: A Lightweight CNN for Fast QTMT Partitioning
Jui-Chen Luo, Jiann-Jone Chen, Tien-Ying Kuo, Yi-Fan Wu, Zhang Kai-Jie
D2-0830_H3.5 335 Multimodal Speech Analysis for Early Detection of Mild Cognitive Impairment: A Scalable Approach
Muhammad Bilal, Waleed Abdulla, Gary Cheung, Lynette Tippett, Reza Shahamiri
D2-0830_H3.6 383 Boundary-Enhanced Attention Network for Breast Mass Segmentation
Rong Chen, Karungaru Stephen, Kenji Terada, Linhuang Wang
D2-0830_H3.7 465 Scale and Rotation Estimation of Similarity-Transformed Images via Cross-Correlation Maximization Based on Auxiliary Function Method
Shinji Yamashita, Yuma Kinoshita, Hitoshi Kiya
D2-0830_H3.8 468 Strong Eye Closure Detection in Children with Profound Intellectual and Multiple Disabilities Using Robust Temporal Difference Features
Kaito Kosaki, Teppei Nakano, Mari Wakabayashi, Tomomi Sato, Tetsuji Ogawa
D2-0830_H3.9 532 A Rate-Quality Model for Learned Video Coding
Sang NguyenQuang, Cheng-Wei Chen, Xiem HoangVan, Wen-Hsiao Peng
D2-0830_H3.10 539 Low-Light RAW Image Enhancement with Additive Parameterization and State Space Model
Shugo Yamashita, Masaaki Ikehara
D2-0830_H3.11 621 Synthesizing and Restoring Weather-corrupted Images with Conditional Diffusion Models
Youngho Go, Sung-Hak Lee |
08:30-10:00 |
D2-0830_P1 Multimodal AI
Location: Peony I
D2-0830_P1.1 23 VoxRep: Enhancing 3D Spatial Understanding in 2D Vision-Language Models via Voxel Representation
Alan (Gia Tuan Dao) Dao, Norapat Buppodom
D2-0830_P1.2 155 Active Multi-Object Tracking for 3D Reconstruction with Hierarchical Reinforcement Learning
Heng Li, Cheng Cai
D2-0830_P1.3 193 Multimodal Sentiment Analysis with Missing Modality: A Knowledge-Transfer Approach
Weide Liu, Huijing Zhan
D2-0830_P1.4 299 Modeling Spatiotemporal Multimodal Data With Kernel Graph Regression Models And Copulas
Jeffrey Wu, Gareth Peters
D2-0830_P1.5 510 CopeCap: A lightweight image captioning model with collaborative prompt learning
Xiwei Yu, Guoshun He, Huijing Zhan
D2-0830_P1.6 541 Lyric-Aware Karaoke Background Video Selection Using Large Language Models and Moment Retrieval
Tomoki Ariga, Jun Taniguchi, Yosuke Higuchi, Sayaka Toma, Kunihiro Abe, Rie Shigyo, Tetsuji Ogawa
D2-0830_P1.7 550 Audio-Visual Speech Recognition Based on Cross-Lingual Transfer Learning
Fumiya Kondo, Tamura Satoshi
D2-0830_P1.8 562 Exploring Machine Learning and Language Models for Multimodal Depression Detection
Javier Si Zhao Hong, Timothy Zoe Delaya, Sherwyn Chan Yin Kit, Pai Chet Ng, Xiaoxiao Miao |
08:30-10:00 |
D2-0830_P2 Biomedical Signal Processing and Systems I
Location: Peony II
D2-0830_P2.1 26 Predicting Problematic Internet Use in Children Using Feature-Rich Structured Data with Ensemble Machine Learning and Bayesian Optimisation
Niteesh K R, Pooja T S
D2-0830_P2.2 148 Phonocardiogram Signal Analysis for Myocardial Infarction Level Prediction Using Deep Learning Model
Ira Puspasari, Tati L.R. Mengko, Agung W. Setiawan, Miftah Pramudyo, Nobuo Watanabe, Trio Adiono
D2-0830_P2.3 173 Prediction of Maximum and Minimum Postprandial Blood Glucose Levels in People with Diabetes
Kotaro Nagayama, Shota Kato, Kana Eguchi, Masahide Hamaguchi, Hiroyuki Tominaga, Youji Hamaguchi, Michiaki Fukui, Manabu Kano
D2-0830_P2.4 214 Towards Telepathic Communication: A Multi-Band EEG Model for Imaginary Speech Decoding
Yifan Zhang, Yuting Ding, Fei Chen
D2-0830_P2.5 240 Tiny-VRN: A Lightweight Variational Residual Network for EEG-Based Emotion Recognition
Sivaraj Nimishan, Selvarajah Thuseethan, Shanmuganathan Vasanthapriyan, Roshan G. Ragel
D2-0830_P2.6 241 A Comparison of Solicited and Longitudinal Cough Sounds for Tuberculosis Detection
Aprianto Dwi Prasetyo, Bagus Tris Atmaja, Dhany Arifianto, Sakriani Sakti
D2-0830_P2.7 336 Detecting Defecation Premonition from the Acoustic Activity of Bowel Sounds
Shota Miyagawa, Toshitaka Yamakawa, Masayuki Tanabe, Kazushi Ikeda
D2-0830_P2.8 407 EegCNR: A Novel Feature for Attention Estimation from EEG
Asif M S, Sagila Gangadharan K, Achutavarrier Prasad Vinod
D2-0830_P2.9 413 Lower Limb Calf Muscle Segmentation from Diffusion-Weighted Magnetic Resonance Images Using Deep Learning
Eshan Pandey, Xiaomeng Wang, Julian Gan, Ying-Hwey Nai, Derek Hausenloy, Pek Lan Khong, Forest Su Lim Tan, Thiruneepan Selvakulasingam, Ryan Fraser Kirwan, Cheryl Pei Ling Lian
D2-0830_P2.10 415 Principal Component Regularization in Iterative Inversion of DBIM for Ultrasound Tomography
Nguyen Thi Thu, Tran Quang-Huy, Luong Thi Theu, Duc-Tan Tran
D2-0830_P2.11 447 Reasoning Visualization for Critical Care EEG Classification with Prototypical Part Networks
Takuma Bingo, Hajime Yano, Taichiro Ashizaki, Kazuma Koda, Masaya Togo, Riki Matsumoto, Tetsuya Takiguchi
D2-0830_P2.12 227 Plant Species-Specific Anomaly Detection Based on Electrophysiological Signals
Andy Desman Lo, Elvin Nur Furqon, Junaidul Islam, Isack Farady, Kahlil Muchtar, Ronnie Concepcion II, Chih-Yang Lin |
10:00-10:30 |
Break |
10:30-12:00 |
D2-1030_IB Active Noise Cancellation Panel Discussion
Location: Island Ballroom |
10:30-12:00 |
D2-1030_L1 Advanced Topics on Music Processing
Location: Lotus I
D2-1030_L1.1 177 Drum-to-Vocal Percussion Sound Conversion and Its Evaluation Methodology
Rinka Nobukawa, Makito Kitamura, Tomohiko Nakamura, Shinnosuke Takamichi, Hiroshi Saruwatari
D2-1030_L1.2 283 How do Deaf and Hard of Hearing people listen to Music Instruments? Subjective Evaluation and Acoustic Features
Rumi Hiraga, Yuhki Shiraishi, Keiichi Yasu
D2-1030_L1.3 298 Quality Assessment of DNN–Based Algorithms for Music Boundary Detection
Aneeka Azmat, Li Su, ChengHsin Hsu
D2-1030_L1.4 319 Note-level Nonchord-tone Identification with Graph Neural Networks
Yui Uehara, Satoshi Tojo
D2-1030_L1.5 337 Evaluation Score Prediction for Japanese Songs Based on Melody Fitness to Lyrics
Sosuke Nishimura, Eita Nakamura
D2-1030_L1.6 349 A Comparative Study of Statistical Features and Deep Learning for Orchestral Texture Classification
Zih-Syuan Lin, Jun-You Wang, Li Su
D2-1030_L1.7 356 Efficient Transformer-Based Piano Transcription With Sparse Attention Mechanisms
Weixing Wei, Kazuyoshi Yoshii
D2-1030_L1.8 424 Transformer-Based Unpaired Piano Accompaniment Style Transfer
Hsin Ai, Yi-Hsuan Yang
D2-1030_L1.9 441 Designing a Music Difficulty Measure for Controllable Automatic Piano Rearrangement
Hikari Miyaji, Keito Sawada, Wen-Chin Huang, Tomoki Toda
D2-1030_L1.10 453 Vocal onset detection and pitch segmentation in medieval choral music guided by original notational sources
Samuel Bellows, Sarabeth Mullins, Brian Katz
D2-1030_L1.11 496 MORTM: MoE-Optimized Rhythmic Transformer Model for Symbolic MIDI Generation
Takaaki Nagoshi, Tetsuro Kitahara
D2-1030_L1.12 517 TAPA-ICL: Taxonomy-Aware Prompt Augmentation for In-Context Learning in Music Understanding
Jiahao Zhao, Yunjia Li, Kazuyoshi Yoshii |
10:30-12:00 |
D2-1030_L2 Speech and Language Processing II
Location: Lotus II
D2-1030_L2.1 97 Beyond Binary Detection: Multi-Etiology Dysarthria Classification with Pre-trained Speech Models
Zihan Zhong, Qianli Wang, Satwinder Singh, Clarion Mendes, Mark Hasegawa-Johnson, Waleed Abdulla, Seyed Reza Shahamiri
D2-1030_L2.2 118 A Dual-Path Speaker-Independent Acoustic-to-Articulatory Inversion Model Based On Content and Speaker Information Disentanglement
Qiang Fang
D2-1030_L2.3 120 Dementia Prediction From Speech Signal Using Optimized Prosodic Features
Bagus Tris Atmaja, Sakriani Sakti
D2-1030_L2.4 154 Speech Emotion Recognition via Entropy-Aware Score Selection
ChenYi Chua, JunKai Wong, Chengxin Chen, Xiaoxiao Miao
D2-1030_L2.5 264 Improving Exemplar-Based Electrolaryngeal Speech Voice Conversion via Robust Content Representations
Fo-Rui Li, Hsin-Te Hwang, Ming-Chi Yen, Men-Tung Lo, Yu Tsao, Hsin-Min Wang
D2-1030_L2.6 308 An Efficient Transfer Learning Method Based on Adapter with Local Attributes for Speech Emotion Recognition
Haoyu Song, Mcloughlin Ian, Qing Gu, Nan Jiang, Yan Song
D2-1030_L2.7 331 ASRQ-VC: ASR-Guided Speech Content Quantization for High-Fidelity Voice Conversion
Songting Liu, Deheng Ye, Wei Yang, Haoyang Li, Eng Siong Chng
D2-1030_L2.8 338 PUNSER: Large-Scale Pre-trained and Unified Model for Practical Speech Emotion Recognition
Yu Hayashizaki, Takashi Nose, Sumiharu Kobayashi, Satoru Fukayama, Akinori Ito
D2-1030_L2.9 440 Investigation of the effectiveness of converted speech auditory feedback in low-latency real-time voice conversion
Kiseki Niwa, Kazuhiro Kobayashi, Tomoki Toda
D2-1030_L2.10 587 Study on Signal Processing Techniques in Protecting Voice Personae Against Speech Synthesis Systems
Nopparut Li, Candy Olivia Mawalim, Masashi Unoki
D2-1030_L2.11 603 MixedG2P-T5: G2P-free Speech Synthesis for Mixed-script texts using Speech Self-Supervised Learning and Language Model
Joonyong Park, Daisuke Saito, Nobuaki Minematsu |
10:30-12:00 |
D2-1030_H3 Emerging Technologies and Applications of Image Processing and Computer Vision
Location: Hibiscus III
D2-1030_H3.1 54 You Only Touch Once: One-Touch System for Personalized 3D Music Video Generation
Kyungjune Lee, Youngjin Shin, Jungwoo Huh, Sanghoon Lee
D2-1030_H3.2 143 Single-Image Pupil Localization via Implicit 3D Eye Reconstruction
Taejun Roh, Yejin Cho, Duong Hai Nguyen, Chul Lee
D2-1030_H3.3 171 Flow-Guided Consistent Video Depth Estimation for Cross-Dataset Generalization
Jaeseok Jang, Chang-Su Kim
D2-1030_H3.4 196 DCB: An Efficient Approach for Building Long-Range Dependencies in CNNs
Tianxiang Lan, Mingyi He, Yuchao Dai
D2-1030_H3.5 269 A User-Guided and Local Motion-Adaptive Framework for Virtual Product Placement in Video
Tianwen Zhang, Ju-Won Seo, Kang-Min Kim, Keunsoo Ko
D2-1030_H3.6 332 Shallow yet Perceptual Decoding for Neural Image Compression through Minimal Nonlinearity
JaeKyung Ryu, Nam Ik Cho
D2-1030_H3.7 445 SyncScore: A Framework for Synchronization Scoring in Group Sports via Human Pose Estimation
Khai Pin Ang, Iven Zi Yin Low, Yumun Hooi, Yuen Peng Loh
D2-1030_H3.8 518 Data Augmentation-Driven Segmentation of Ovarian Tumor Ultrasound Images using Vision Mamba
Thanh-Phuc Dao, Huyen-Trang To, Hoang-Son Bui, Thi-Lan Le
D2-1030_H3.9 548 Optimizing JPEG Decoder for Bitstream-Corrupted Image Restoration
Shumin Jiang, Hao Qin, Tianyi Liu, Yi Wang
D2-1030_H3.10 568 Semantic Scene Completion from a Single Depth Image with Coarse-Grained Segmentation
Jiun Yen Ching, Lai-Kuan Wong, Wai Lee Kung
D2-1030_H3.11 572 Pixel-weighted Domain Adaptation for Agricultural Segmentation
Shunta Kimura, Handie Shao, Shogo Matsumoto, Daiki Yamada, Toshihiro Kitajima, Hideki Nakayama |
10:30-12:00 |
D2-1030_P1 Biomedical Signal Processing and Systems II
Location: Peony I
D2-1030_P1.1 454 Freeze and Learn using KAN for Infant Cry Classification
Arth Shah, Vishnu Vardhan, Hemant Patil
D2-1030_P1.2 466 Investigation of Enhancement Strategies for Recurrent Spiking Neural Network based Brain-Machine Interface Decoding
Wilson Tansil, Nur Ahmadi, Timothy Constandinou, Dessi Puji Lestari
D2-1030_P1.3 531 Detecting Deceptive Responses Due to Psychological Bias by the Probability Density Function of EEG Content Rate Dynamics During NEO-FFI Answering
Yuto Ashikawa, Yosuke Kurihara
D2-1030_P1.4 535 A Comparative Analysis of Statistical, Regional CNN, and Sequential Transformer Approaches for Alzheimer's Disease Classification
Trí Huynh, Xuan Hoc Pham, Nhu Nguyen, Thi Thu Nguyen, Huong Ha, Lua Ngo
D2-1030_P1.5 591 Beyond Speech and More: Investigating the Emergent Ability of Speech Pre-Trained Models for Classifying Physiological Time-Series Signals
Orchid Chetia Phukan, Swarup Ranjan Behera, Girish, Mohd Mujtaba Akhtar, Arun Balaji Buduru, Rajesh Sharma
D2-1030_P1.6 630 Channel Selection Guided by Layer-wise Relevance Propagation for CNN-Based EEG Classification of Major Depressive Disorder
Woo-Seok Ahn, Seung-Hwan Lee, Han-Jeong Hwang
D2-1030_P1.7 631 Development of HRV-Based Biomarkers for Predicting Blood Glucose Levels
Ju-An Park, Jun-Seok Lee, Na-Ri Kim, Han-Jeong Hwang
D2-1030_P1.8 632 Development of 3D Textile Electrodes for Electrocardiography Measurement
Sang-Ho Lee, In-Su Park, Han-Jeong Hwang |
10:30-12:00 |
D2-1030_P2 Machine Learning: Algorithms and Application I
Location: Peony II
D2-1030_P2.1 22 Kernel Ridge Regression for Efficient Learning of High-Capacity Hopfield Networks
Akira Tamamori
D2-1030_P2.2 47 Enhanced Sliding Discrete Fourier Transform (eSDFT) with Error-Bound Control for Real-Time Parallel Processing
Jetsada Arnin, Danial Kahani, Bernard A. Conway
D2-1030_P2.3 216 Sparse-Coded Time-Delay DMD with Control for Nonlinear State-Space Modeling on Graphs
Ryuto Ito, Hiromu Kanauchi, Hiroyasu Yasuda, Masaaki Nagahara, Shogo Muramatsu
D2-1030_P2.4 229 Nonnegative Matrix Factorization Using Dirichlet-Distribution-Based Regularization
Haru Ogawa, Daichi Kitamura, Shoma Ayano
D2-1030_P2.5 274 Significance of co-occurring biomarkers in localization of epileptic seizure onset zone
Nawara Mahmood Broti, Masaki Iwasaki, Yumie Ono
D2-1030_P2.6 294 Reinforcement Learning in Portfolio Management: A Survey of Methods and Trends
Silan Hu, Yulin Huang, Arjun Agarwal, Tanya Warrier, Yuwen Wang, Haozhe Ma, Zhengding Luo
D2-1030_P2.7 313 Large Sparse Covariance Matrix Estimation via Dual Proximal Gradient Method
Fengpei Li, Ziping Zhao
D2-1030_P2.8 361 An improved method for Image Shadow Removal by Combining Deterministic and Stochastic Models
Hongjun Sheng, Lanqing Guo, Xinggan Peng, Zhiping Lin, Bihan Wen
D2-1030_P2.9 576 Knowledge-Infused Topic Model for Empathetic Dialogue Response
Po-Chuan Chen, Jen-Tzung Chien
D2-1030_P2.10 578 Cross-Patient Seizure Onset Zone Classification by Patient-Dependent Weight
Xuyang ZHAO, Hidenori Sugano, Toshihisa Tanaka
D2-1030_P2.11 609 NOCTUA: A High-Efficiency Reconfigurable NoC-based Transformer Universal Accelerator
Kun-Chih Chen, Pin-Ching Shen, Bo-Chun Chen |
12:00-12:30 |
Lunch |
12:30-13:30 |
D2-1230_IB Women in APSIPA Forum
Location: Island Ballroom |
13:30-14:30 |
D2-1330_IB Keynote 2 by Jane Wang
Location: Island Ballroom |
14:30-16:00 |
D2-1430_IB Perspective 3: Neural Speech Assessment and Its Application
Location: Island Ballroom
D2-1430_IB.1 637 Progress and Challenges in DNN-based Objective Quality Assessment of Synthesized Speech
Erica Cooper
D2-1430_IB.2 635 Advancing Speech Quality Assessment Through Scientific Challenges and Open-source Activities
Wen-Chin Huang
D2-1430_IB.3 646 Non-Intrusive Intelligibility Prediction for Hearing Aids: Recent Advances, Trends, and Challenges
Ryandhimas Zezario
D2-1430_IB.4 647 From Evaluation to Optimization: Neural Speech Assessment for Downstream Applications
Yu Tsao |
14:30-16:00 |
D2-1430_L1 Active Noise Control I
Location: Lotus I
D2-1430_L1.1 56 Design of speech leakage-suppressed audio-spot based on auditory masking area control with active masker cancellation using parametric array loudspeakers
Tomoki Hashida, Yuting Geng, Masato Nakayama, Takanobu Nishiura
D2-1430_L1.2 57 Multichannel feedforward active noise control system with optical laser microphone in reverberant environments
Maoto Mizutani, Kenta Iwai, Masato Nakayama, Takanobu Nishiura, Yoshiharu Soeta
D2-1430_L1.3 72 Frequency-domain online modeling of multiple secondary paths without auxiliary noise for active noise control
Siyuan Lian, Xiaofeng Zeng, Ruquan Sun, Jing Lu
D2-1430_L1.4 124 Applying Model-Agnostic Meta-Learning with Iterative Dichotomiser 3 for Alternating-Switching Active Noise Control Systems
Xiaoyi Shen, Dongyuan Shi, Woon-Seng Gan, Jun Yang
D2-1430_L1.5 285 A Robust Proactive Communication Strategy for Distributed Active Noise Control Systems
Junwei Ji, Dongyuan Shi, Zhengding Luo, Boxiang Wang, Ziyi Yang, Haowen Li, Woon-Seng Gan
D2-1430_L1.6 289 Directional Selective Fixed-Filter Active Noise Control Based on Convolutional Neural Network in Reverberant Environments
Boxiang Wang, Zhengding Luo, Haowen Li, Dongyuan Shi, Junwei Ji, Ziyi Yang, Woon-Seng Gan
D2-1430_L1.7 301 An Online Secondary Path Modeling Technique in a Hybrid Active Noise Control System
Harold Alexis Lao, Cheng-Yuan Chang
D2-1430_L1.8 345 A Diffusion Remote Microphone Technique for Distributed Active Noise Control
Tianyou Li, Sipei Zhao, Haowen Li, Xiaofeng Zeng, Ruquan Sun, Jing Lu
D2-1430_L1.9 511 An Integrated Active Noise Control and Crosstalk Cancellation System Designed Under a Generalized Model-Matching Framework
Michael Edy, Chih Yen Wang, Ching En Huang, You Siang Chen, Mingsian R. Bai
D2-1430_L1.10 553 Improvement of Noise Reduction in a Panel Combined with Multiple Loudspeakers Using Active Noise Control
Tatsuya Murao
D2-1430_L1.11 610 Selective Fixed Filter Sub-band Active Noise Control System Based on Reference Signal Power Estimation
Shota Toyooka, Ryo Matsuura, Kenta Iwai, Yoshinobu Kajikawa
D2-1430_L1.12 626 Performance analysis of active noise control over a spatial region
Jihui (Aimee) Zhang, Thushara Abhayapala, Naoki Murata, Prasanga Samarasinghe, Yu Maeno, Yuki Mitsufuji |
14:30-16:00 |
D2-1430_L2 Speech and Language Processing III
Location: Lotus II
D2-1430_L2.1 27 I^2TTS: Image-indicated Immersive Text-to-speech Synthesis with Spatial Perception
Jiawei Zhang, Tian-Hao Zhang, Jun Wang, Jiaran Gao, Ruijie Tao, Xinyuan Qian, Xu-Cheng Yin
D2-1430_L2.2 130 Chain-of-Thought Distillation for ASR Error Correction with Multimodal Large Language Models
Shaomeng Yang, Jiaming Luo, Jinran Wang, Rongfeng Su, Yongjie Zhou, Lan Wang, Nan Yan
D2-1430_L2.3 163 Direction-guided Spatial Attention for Multichannel Speech Enhancement
Shuai Nie, Yaran Chen, Shan Liang, Jiaming Xu, Runyu Shi
D2-1430_L2.4 168 A Study of Japanese Mixed Emotional Speech Synthesis Based on an End-to-End Emotional Speech Synthesis Model
Issei Sakata, Tetsuo Kosaka
D2-1430_L2.5 191 EFTTS: Zero-Shot Emotional Speech Synthesis via Conditional Flow Matching and Self-Supervised Representations
Haoyu Wang, Jiale Chen, Jiaxun Li, Sizhe Shan, Yuehai Wang
D2-1430_L2.6 208 Improving Speech-to-Speech Translation for Low-Resource Languages via Transfer Learning
Rui Zhou, Akinori Ito, Takashi Nose
D2-1430_L2.7 235 DialoSpeech: Dual-Speaker Dialogue Generation with LLM and Flow Matching
Hanke Xie, Dake Guo, Chengyou Wang, Yue Li, Wenjie Tian, Xinfa Zhu, Xinsheng Wang, Xiulin Li, Guanqiong Miao, Bo Liu, Lei Xie
D2-1430_L2.8 237 VICNet: FaderNet-Based Voice Impression Conversion with Affective Dimensional Representation
Takuya Takahashi, Saki Kugimoto, Toru Nakashika
D2-1430_L2.9 248 Strategic Re-weighting of U-Net Components in Diffusion Models for Enhanced Speech Enhancement without Retraining
Yuehai Zhang, Yang Li, Yuehao Zhao, Shoji Makino
D2-1430_L2.10 261 Fast and Speaker-Independent Utterance Selection for ASR-Free CALL Systems of Minority Languages
Takaki Koshikawa, Akinori Ito, Takashi Nose
D2-1430_L2.11 414 Speech-Content-Driven Highlighting of Translated Lecture Slides for Foreign Language Lecture Understanding
Naoki Muto, Chee Siang Leow, Junichi Hoshino, Takehito Utsuro, Hiromitsu Nishizaki |
14:30-16:00 |
D2-1430_H3 Machine Learning: Information and Medical Applications
Location: Hibiscus III
D2-1430_H3.1 24 Class Incremental Learning using Continual Backpropagation on Honey Botanic Origin Classification with Hyperspectral Imaging
Guyang Zhang, Iman Ardekani, Waleed Abdulla
D2-1430_H3.2 260 Multi-strategy improved electric eel foraging optimisation algorithm for UAV path planning
Zexin Zhang, Chengbiao Fu, Hongwei Guo, Anhong Tian
D2-1430_H3.3 278 A Deep Reinforcement Learning Approach to Roundabout Traffic Signal Control
Cheng-Yu Chen, Daniil Buryakov, Valentinus Roby Hananto, Victor Kryssanov
D2-1430_H3.4 300 A preliminary study on machine learning to predict circuit exchange in pediatric patients with ECMO
Tatsuya Hasegawa, Toshiyuki Nakanishi, Koichi Fujiwara
D2-1430_H3.5 303 HasRL Robot: A Heterogeneous Asynchronous Reinforcement Learning System for High-Dimensional Bipedal Control
Jingyang Mai, Zechen Guo, Zhengding Luo, Haozhe Ma
D2-1430_H3.6 324 A Psychological Strategy Annotation Method Using Multiple LLMs with a Chain of Thought Based on Deductive Reasoning
Jinran Wang, Jiaming Luo, Shaomeng Yang, Yongjie Zhou, Xuefang Zhang, Rongfeng Su, Nan Yan, Lan Wang
D2-1430_H3.7 431 Outlier Removal in MEG Data for Imagined Speech Classification
Koki Nose, Hajime Yano, Tetsuya Takiguchi, Seiji Nakagawa
D2-1430_H3.8 558 Performance Evaluation of CHIRPS and ETCCDI Indices for Extreme Rainfall Risk Mapping in Thailand Using XGBoost
Vinitar Khettar, Nuntikorn Kitratporn, Sawarin Lerk-u-suke, Jirabhorn Chaiwongsai, Phaisarn Jeefoo, Chanika Sukawattanavijit
D2-1430_H3.9 580 Riverbed Estimation Using Locally-Structured Unitary Network
Seiyu Hitomi, Hiroyasu Yasuda, Kiyoshi Hayasaka, Shogo Muramatsu
D2-1430_H3.10 594 Contrastive Learning of Temporal and Event-Based Behavioral Views for Universal User Embeddings
Yuuki Tachioka
D2-1430_H3.11 595 Market Forecasting Using LSTM-ARIMA Model with MACD Decomposition
Teng-Chih Yu, Jian-Jiun Ding |
14:30-16:00 |
D2-1430_P1 Audio Processing
Location: Peony I
D2-1430_P1.1 129 Anomalous Sound Detection Based on Derivative Features of Short-Time Holomorphic Fourier Transform
Iori Hashimoto, Yu Morinaga, Suehiro Shimauchi, Shigeaki Aoki
D2-1430_P1.2 140 Elastic Additive Angular Margin Loss Integrated with Mixup for Anomalous Sound Detection
Yihao Zhao, Yichen Yang, Xiao Zhang, Shoji Makino
D2-1430_P1.3 174 A Distilled Low-Latency Neural Vocoder with Explicit Amplitude and Phase Prediction
Hui-Peng Du, Yang Ai, Zhen-Hua Ling
D2-1430_P1.4 408 Directional Filtering of Sound Fields for Emphasizing Specific Directions of Arrival and Its Applications
Ryo Murakami, Natsuki Ueno
D2-1430_P1.5 409 Sound Field Estimation Method Robust to Microphone Position and Directivity Errors
Takumi Koga, Natsuki Ueno
D2-1430_P1.6 434 Anomalous Sound Detection Using Time-Frequency Derivative of Instantaneous Phase Features
Tran-Quang-Tuan Vo, Quoc-Huy Nguyen, Masashi Unoki
D2-1430_P1.7 492 Few-Step Diffusion-Based Voice Conversion Using Consistency Trajectory Models
Ryuichi Hatakeyama, Toru Nakashika, Takuya Takahashi
D2-1430_P1.8 583 Spatial Audio Signal Enhancement: A Multi-output MVDR Method in The Spherical Harmonic-domain
Huawei Zhang, Jihui (Aimee) Zhang, Huiyuan (June) Sun, Prasanga Samarasinghe |
14:30-16:00 |
D2-1430_P2 Signal & Information Processing I
Location: Peony II
D2-1430_P2.1 46 Generalized Student's t Sparse Kernel Learning for Robust Signal Processing
Long Pan, Libiao Peng, Xifeng Li, Dongjie Bi, Yongle Xie
D2-1430_P2.2 61 A Hierarchical Attention Model for Local and Global Feature Integration in RCS Classification
Yida Wu, Caiyun Wang, Jianing Wang, Xiaofei Li, Ying Nan
D2-1430_P2.3 66 A Sliding-Window Range–Bearing Scan STAP for Underwater Active Sonar Target Detection
Weisi Hua, Yixin Yang, Yuxuan Chen, Xianghao Hou
D2-1430_P2.4 76 TH-LDV: Transformer-based Hybrid method for Signal Detection in Laser Doppler Velocimetry
Yue Wang, Ruifeng Li, Changsong Liu, Liangrui Peng, Ning Ding, Gang Yao
D2-1430_P2.5 159 Estimating Dynamic Graph Flows with Kernel Models and Hadamard-Structured Riemannian Constraints
Duc Thien Nguyen, Konstantinos Slavakis, Dimitris Pados
D2-1430_P2.6 202 Period Estimation for Time-Varying Graph Signals and Its Application to Graph Wiener Filter
Tsutahiro Fukuhara, Junya Hara, Hiroshi Higashi, Yuichi Tanaka
D2-1430_P2.7 244 Computationally Efficient Sparse Signal Recovery by Deep Unfolded-Periodic Sketched ISTA
Tatsuki Tokumura, Ayano Nakai-Kasai, Tadashi Wadayama
D2-1430_P2.8 259 Fisher Information-based Metrics for Representation Learning
Do Nguyen Dang Thi, Le Quoc Anh, Tran Trong Duy, Le Vu Ha, Nguyen Linh Trung
D2-1430_P2.9 271 Wave Direction Estimation Based on Local Gradient Techniques from Satellite Imagery for Coastal Dynamics Monitoring
Woramet Simrum, Paweena Kanokhong, Chakapat Chokchaisiri, Somrudee Deepaisarn, Kittipisut Chansri, Chanyut Lisawat, Waranrach Viriyavit, Akkharawoot Takhom, Phutphalla Kong, Didin Agustian Permadi, Sharifah Hafizah Syed Ariffin, Surasak Boonkla, Kasorn Galajit, Jessada Karnjana
D2-1430_P2.10 318 HIQA-DB: A Benchmark Dataset for Image Quality Assessment in Hospital Surveillance
Yujin Han, Taewan Kim
D2-1430_P2.11 627 Semantic Neural View Synthesis for Key Content Preservation in Horizontal-to-Vertical Video Conversion
Dipanita Chakraborty, Minoru Okada, Kosin Chamnongthai |
16:00-16:30 |
Break |
16:30-18:00 |
D2-1630_IB Education Forum
Location: Island Ballroom |
16:30-18:00 |
D2-1630_L1 Active Noise Control II
Location: Lotus I
D2-1630_L1.1 63 Electro-acoustic component placement optimization for helicopter cabin ANC systems
Yuhang Yang, Liquan Shi, Ningyuan Liang, Guoyong Jin
D2-1630_L1.2 87 Spatial-Correlation-Based Error Weighting Method for Efficient Application of Filtered Reference Algorithm in Multichannel Active Noise Control
Meiling Hu, Jing Lu, Qingyu Ma
D2-1630_L1.3 134 An Alternating Mode Strategy for Adaptive Sound Field Control and Acoustic Path Tracking
Junqing Zhang, Jingli Xie, Dongyuan Shi, Wen Zhang, Jingdong Chen, Jacob Benesty
D2-1630_L1.4 265 DOA Estimation with Lightweight Network on LLM-Aided Simulated Acoustic Scenes
Haowen Li, Zhengding Luo, Dongyuan Shi, Boxiang Wang, Junwei Ji, Ziyi Yang, Woon-Seng Gan
D2-1630_L1.5 305 Co-forecasting of Time-varying Spatial-frequency Map for Selective Fixed-Filter Multichannel ANC based on Dynamic Factor Graph
Xiruo Su, Bin Wu
D2-1630_L1.6 310 Unsupervised Spectrogram Enhancement Algorithm Based on Bi-LSTM
Hanwen Zhang, Xiruo Su, Zhijuan Zhu, Bin Wu, Lingyun Ye
D2-1630_L1.7 330 Continual Learning-Based Selective Fixed-filter Active Noise Control
Jingsong Xiao, Qirui Huang
D2-1630_L1.8 340 Meta-Learned Regional Initialization of Control Filters for Headphone Active Noise Control
Ziyi Yang, Zhengding Luo, Dongyuan Shi, Junwei Ji, Boxiang Wang, Haowen Li, Qirui Huang, Woon-Seng Gan
D2-1630_L1.9 458 RAMDC: Room-Aware Multi-Device Clustering for Large Scale Teleconferencing
Yile Zhang, Weiting Lai, Amy Bastine, Xingyu Chen, Lachlan Birnie, Thushara Abhayapala, Prasanga Samarasinghe
D2-1630_L1.10 490 Multi-channel ANC with Adaptive Kernel Assisted On-line Secondary Path Modeling
Hucheng Wang, Tao Liu, Junqing Zhang, Wen Zhang
D2-1630_L1.11 497 A Laplace Distribution-Based Variable Step-Size FxlogLMS Algorithm for Active Impulsive Noise Control
Aoi Haneda, Yosuke Sugiura, Tetsuya Shimamura
D2-1630_L1.12 513 Research Progress on Active Control of Road Noise in Vehicles
Wangxiaoxu Chen, Jiancheng Tao, Shuping Wang, Kai Chen, Haishan Zou, Xiaojun Qiu |
16:30-18:00 |
D2-1630_L2 Speech and Language Processing IV
Location: Lotus II
D2-1630_L2.1 29 Leveraging Language Information for Target Language Extraction
Mehmet Sinan Yildirim, Ruijie Tao, Wupeng Wang, Junyi Ao, Haizhou Li
D2-1630_L2.2 116 VietLyrics: A Large-Scale Dataset and Models for Vietnamese Automatic Lyrics Transcription
Nguyen Quoc Anh, Bernard Cheng, Kelvin Soh
D2-1630_L2.3 449 Autofocus Neural Beamformer Based on Steering Vector Estimation
Reiya Marukawa, Takeshi Yamada
D2-1630_L2.4 478 Estimating User Sentiment at Sub-exchange Granularity from Exchange-level Annotations
Daichi Yukizawa, Kazunori Komatani, Ryu Takeda, Kenta Yamamoto
D2-1630_L2.5 502 DAU-KDAH Dysarthic Multi-Lingual and Multimodal Speech Corpora for Indic Languages
Arth Shah, Hiya Chaudhari, Kavya Kumar, Arushi Srivastava, Priya Damdar, Ravindrakumar Purohit, Dharmendra Vaghera, Bhavna Singh, Aparna Walanj, Abhishek Srivastava, Hemant Patil
D2-1630_L2.6 503 Gamma-VAE-VC: Voice conversion based on VAE assuming gamma distribution for both latent variables and observation
Nanako Imaichi, Takuya Takahashi, Toru Nakashika
D2-1630_L2.7 514 Zero-shot Context Biasing with Trie-based Decoding using Synthetic Multi-Pronunciation
Changsong Liu, Yizhou Peng, Eng Siong Chng
D2-1630_L2.8 551 Dimension 414 and Minimal Embedding Dimensions for Phonetic Feature Encoding in WavLM
Narthana Sivalingam, Uthayasanker Thayasivam
D2-1630_L2.9 560 Directional Hybrid Optimization of HRTFs for Low-Order Spherical Harmonics Binaural Rendering
Rui Zhang, Yuxuan Ke, Qunping Ni, Ge Yao, Xiaodong Li, Chengshi Zheng
D2-1630_L2.10 577 Speech Enhancement Network With Windowed Cross Attention Using Noise-Reference Microphone
Kota Suzuki, Yosuke Sugiura, Tetsuya Shimamura
D2-1630_L2.11 641 BAANI: A 296M-Parameter Neural Vocoder for End-to-End Punjabi Speech Synthesis
Siddharth Kumar, Nisarg Trivedi, Ravindrakumar Purohit, Hemant Patil |
16:30-18:00 |
D2-1630_H3 Recent Advances in Multimedia Enrichment, Security and Privacy
Location: Hibiscus III
D2-1630_H3.1 89 Reversible Data Hiding in EtC Images with Flexible Access Privileges
Yusaku Kato, Shoko Imaizumi
D2-1630_H3.2 109 Robust Ownership Verification of DNN Models Against JPEG Compression via Probability-Controlled Adversarial Attacks
Teruki Sano, Minoru Kuribayashi, Masao Sakai, Shuji Ishobe, Eisuke Koizumi, Zhang Zhang
D2-1630_H3.3 136 Detoxification of Poisoned Recognition Models by Fine-tuning with Out-of-Distribution Samples
Junsuke Takano, Kazuaki Nakamura
D2-1630_H3.4 192 Layer-Wise Weight Statistics for Node Classification and Defense of Federated Large Language Models
Alexander Berns, Reon Akai, Minoru Kuribayashi, Rémi Cogranne
D2-1630_H3.5 210 Robustness evaluation against fine-tuning in associative watermarking method for CNN
Keiichi Mori, Masaki Kawamura
D2-1630_H3.6 218 Lossless Image Processing for OpenEXR Images with Flexible Functions
Anna Yamaguchi, Shoko Imaizumi
D2-1630_H3.7 225 Proposal of a Random Encoding Layer Compatible with Arbitrary Message Lengths for DiffuseTrace
Ou Egami, Masaki Kawamura
D2-1630_H3.8 292 Automatic Dependent Surveillance-Broadcast Preamble Classification for Spoofing Detection
Darren Kah Hou Quek, Guang Hua, Zhiping Lin
D2-1630_H3.9 320 Model Extraction Attack and Its Countermeasure for Denoising Diffusion Implicit Models
Hayato Shoji, Kazuaki Nakamura
D2-1630_H3.10 325 Content-Aware Dominant Color Extraction and Its Application to Mltiple-key-Color Image Retrieval
Mei Hashimoto, Michiharu Niimi
D2-1630_H3.11 469 Privacy-Preserving Image Retrieval Scheme Using Combined Features in Cloud Computing
Jing Liang, Yuxuan Wang, Tingting Song, Ce Zheng, Peiya Li |
16:30-18:00 |
D2-1630_P2 Advances in Multimodal AI for Multimedia Applications
Location: Peony II
D2-1630_P2.1 59 Efficient Generative Adversarial Networks for Color Document Image Enhancement and Binarization Using Multi-scale Feature Extraction
Rui-Yang Ju, KokSheik Wong, Jen-Shiun Chiang
D2-1630_P2.2 68 Leveraging Large Language Models in Visual Speech Recognition: Model Scaling, Context-Aware Decoding, and Iterative Polishing
Zehua Liu, Xiaolou Li, Li Guo, Lantian Li, Dong Wang
D2-1630_P2.3 96 Computationally-efficient Call Classification of New Zealand Birds using Texture-based Features
Yonghui Tao, Mathis Quere, Yusuke Hioka, Stephen Marsland
D2-1630_P2.4 245 Incorporating Semantic Visual Content into Click-Through Rate Prediction for Video Advertisements
Yoshiaki Tanabe, Shuntaro Masuda, Gakumatsu Ryu, Naoto Tanji, Hiroyuki Seshime, Ling Xiao, Toshihiko Yamasaki
D2-1630_P2.5 249 From Blurry to Brilliant Detection: YOLO-Based Aerial Object Detection with Super Resolution
Ragib Amin Nihal, Benjamin Yen, Takeshi Ashizawa, Katsutoshi Itoyama, Kazuhiro Nakadai
D2-1630_P2.6 252 ATJO: Adaptive three-dimensional joint optimization for remote sensing video super-resolution
Tian Qin, Lijing Bu, Zhengpeng Zhang, Mingjun Deng, Yin Yang, Jingxue Wang, Xinyu Lan, Wenjuan Peng, Yang Hu
D2-1630_P2.7 268 Block-level Lagrange multiplier adaptation based on distortion propagation factors
Hongwei Guo, Yipeng Liu, Lei Luo, Chengbiao Fu, Ce Zhu
D2-1630_P2.8 317 Distributed Compressed Video Sensing with Enhanced Boundary Handling Based on Extended Convolutional Sparse Representation
Ibuki Muta, Yoshimitsu Kuroki
D2-1630_P2.9 354 Joint Modeling of Big Five and HEXACO for Multimodal Apparent Personality-trait Recognition
Ryo Masumura, Shota Orihashi, Mana Ihori, Tomohiro Tanaka, Naoki Makishima, Taiga Yamane, Naotaka Kawata, Satoshi Suzuki, Taichi Katayama
D2-1630_P2.10 359 Foreground-Background Segmentation Based Surveillance Video Coding
Jiyong Yu, Luheng Jia, Yifan Zang, Zhaoyang Yu, Shuyuan Zhu, Li Song, Kebin Jia
D2-1630_P2.11 436 Rain Removal via VAE-Enhanced Transformer with Hierarchical Feature Integration
Yaya Huang, Litong Liu, KokSheik Wong |
19:00-21:30 |
D2-1900_IB Banquet
Location: Island Ballroom |