Program

 

Local Time (GMT+7)

Session

Details

DAY 1: November 24, 2022

8:30-9:00

 

Opening Ceremony

9:00-10:00

Keynote 1

Seeing to Hear Better

Professor Haizhou Li

Chinese University of Hong Kong and National University of Singapore

10:00-10:15

 

Break

10:15-11:45

Session 1:

Speech Recognition & Speech Synthesis

#603. Development of a High Quality Text to Speech System for Lao

#4193. NICT-Tib1: A Public Speech Corpus of Lhasa Dialect for Benchmarking Tibetan Language Speech Recognition Systems

#7252. The Speech Labeling and Modeling Toolkit (SLMTK) Version 1.0

#8531. Toward Automatic Generation of Transcript from Spoken Lectures: The “Dream of The Red Chamber” Series

#8910. End-to-End Named Entity Recognition for Vietnamese Speech

#9634. MNASR: a Free Speech Corpus for Mongolian Speech Recognition and Accompanied Baselines

11:45-14:00

 

Lunch Break

14:00-15:30

Session 2:

Speech Prosody

 

#896. Patterns of Vowel Production in The Speakers of Sanskrit Language

#1937. The Interaction Pattern of Focal Accent and Declarative Intonation in Mongolian

#2910. Neural Network Models for User Attribute Extraction from Dialogues

#4525. Multilingual Analysis of Intelligibility Classification using English, Korean, and Tamil Dysarthric Speech Datasets

#6681. Nasality in Zhangzhou: Distribution and Constraint

#8904. A Corpus-Based Analysis of Age-Related Changes in the Acoustic Features of Elderly to Super Elderly Speech

15:30-15:45

 

Break

15:45-17:15

Session 3:

Poster

 

#542. Towards the Development of Accent Conversion Model For (L1) Bengali Speaker Using Cycle Consistent Adversarial Network (Cyclegan)

#2053. Speaking-Rate Effect on Prosodic Grammar of Mandarin Read Speech

#2284. An Automated Speech Recognition System for Phonological Awareness of Kindergarten Students in Filipino

#2580. Experimentation of Various Preprocessing Pipelines for Sentiment Analysis on Twitter Data About New Indonesia’s Capital City using SVM and CNN

#2928. Harvard-NGSL Sentences for English Learner Speech Corpora

#3619. Acoustical Analysis of Speech of ASD Children and Typically Developing Children

#3711. Designing a Speaking Assessment Task using EI to Build a Korean Learner Corpus

#3844. Analysis on the Interference of Chinese /n/-/l/ Confusion to Japanese /n/-/r/ Discrimination by Chinese JFL Learners

#4157. Creakiness Judgments by Burmese and Vietnamese Speakers

#4258. Text to Speech System for Lambani - a Zero Resource, Tribal Language of India

#4493. Production and Perception of Intonation Features by Cantonese EFL Learners

#5944. Analysis of Layer-wise Training in Direct Speech to Speech Translation using Bi-LSTM

#8967. The Influence of Working Memory on Intonation Production of Chinese EFL Learners

DAY 2: November 25, 2022

8:30-9:45

Session 4:

Multimodal Databases

 

#2975. UIT-VLFC: Vietnamese Lipstick Feedbacks Corpus

#4801. ESAA: an EEG-Speech Auditory Attention Detection Database

#9020. A Speech Corpus for Chronic Kidney Disease

#9232. Building a Speech Corpus of Children with Cochlear Implants via an Enhanced Metadata Structure

#9830. Taiwanese Across Taiwan Corpus and Its Applications

9:45-10:00

 

Break

10:00-11:00

Keynote 2

Seeing to Hear Better

Professor Sakriani Sakti

Japan Advanced Institute of Science and Technology

11:00-11:15

 

Break

11:15-12:15

 

O-COCOSDA Steering Committee Meeting (Committee members only)

12:15-14:00

 

Lunch Break

14:00-15:30

Session 5:

Language Learning

 

#622. The Effect of Acoustic Features on Chinese EFL Learners' Perception of English Accentual Prominence

#1348. Designing a Korean French-Learners' Speech Corpus (KFLSC) for Spoken Language Assessment

#1966. Syntactic Complexity in Narrative Speech Produced by Prelingually Deaf Mandarin-Speaking Children with Cochlear Implants

#2256. voisTUTOR 2.0: A Speech Corpus with Phonetic Transcription for Pronunciation Evaluation of Indian L2 English Learners

#4834. Spanish Stops and Their Allophones Produced by Proficient Mandarin Learners of Spanish

#5279. NASAM 2.0: Cleft-Palate Speech Assessment Application

15:30-15:45

 

Break

15:45-17:15

Country/Region Reports and Discussion

 

China, Aijun Li and Dong Wang

Hong Kong, Tan Lee

India, S.S Agrawal

Indonesia, Hammam Riza

Japan, Satoshi Nakamura

Korea, Yong-Ju Lee

Malaysia, Zuraidah Mohd Don

Myanmar, Win Pa Pa

Philippine, Nathaniel Oco

Singapore,

Taiwan, Hsin-Min Wang, Yuan-Fu Liao

Thailand,

Vietnam, Luong Chi Mai

DAY 3: November 26, 2022

8:30-9:30

Session 6:

Dialects and Accents

 

 

#490. Effects of the Syntactic Structure on the Productivity of Tone Sandhi Rules: In the Case of Xiamen Dialect

#1234. Improving Vietnamese Accent Recognition using ASR transfer learning

#4539. Transliteration of Foreign Words In Burmese: Descriptions by a Mortise-and-Tenon Notation

#9464. Korean Dialect Identification Based on an Ensemble of Prosodic and Segmental Feature Learning for Forensic Speaker Profiling

9:30-9:35

 

Break

9:35-11:20

 

Speaker Verification Challenge

11:20-11:25

 

Break

11:25-11:55

 

O-COCOSDA Closing Ceremony

 

VLSP workshop

8:30-12:00

Speech Processing

Text to Speech Challenge

Multilingual Speaker Verification Challenge

Automatic Speech Recognition Challenge

13:30-18:00

Text Processing

Constituency Parsing Challenge

Machine Translation Challenge

Multilingual Visual Question Answering Challenge

Vietnamese Abstractive Multi-document Summarization Challenge