3.8 Proceedings Paper

Addressing Code-Switching in French/Algerian Arabic Speech

出版社

ISCA-INT SPEECH COMMUNICATION ASSOC
DOI: 10.21437/Interspeech.2017-1373

关键词

Code-switching; Language Identification; Algerian Arabic; French

资金

  1. French National Agency for Research as part of the SALSA project (Speech and Language technologies for Security Applications) project [ANR-14-CE28-0021]
  2. French Investissements d'Avenir - Labex EFL program [ANR-10LABX-0083]
  3. LPP-CNRS Paris-III Sorbonne Nouvelle University
  4. LIMSI-CNRS Paris-Orsay University
  5. Agence Nationale de la Recherche (ANR) [ANR-14-CE28-0021] Funding Source: Agence Nationale de la Recherche (ANR)

向作者/读者索取更多资源

This study focuses on code-switching (CS) in French/Algerian Arabic bilingual communities and investigates how speech technologies. such as automatic data partitioning, language identification and automatic speech recognition (ASR) can serve to analyze and classify this type of bilingual speech. A preliminary study carried out using a corpus of Maghrebian broadcast data revealed a relatively high presence of CS Algerian Arabic as compared to the neighboring countries Morocco and Tunisia. Therefore this study focuses on code switching produced by bilingual Algerian speakers who can be considered native speakers of both Algerian Arabic and French. A specific corpus of four hours of speech from 8 bilingual French Algerian speakers was collected. This corpus contains read speech and conversational speech in both languages and includes stretches of code-switching. We provide a linguistic description of the code-switching stretches in terms of intra-sentential and inter-sentential switches, the speech duration in each language. We report on some initial studies to locate French, Arabic and the code-switched stretches, using ASR system word posteriors for this pair of languages.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据