Does Basque have an impossible phonological opposition?

Ander Egurtzegi1, Andrea García-Covelo2 & Iñigo Urrestarazu-Porta3

CNRS12 – IKER UMR5478123, IPS-LMU2, UPV/EHU3, UPPA23

01-02-2024

Glottal fricatives

How about a nasalized glottal approximant //?

Typology

/h/ in the world’s languages

// in the world’s languages

// in the world’s languages

/h/ vs. //: An extremely rare opposition

If // is so rare, then languages that include both /h/ and //…

  • Kwangali (kwan1273)

  • ThiMbukushu (mbuk1240)

  • Seimat (seim1238)



  • North-Eastern Basque (basq1248)

Why so rare?

Acoustic nasalization and aspiration are perceptually difficult to distinguish.

  • One resembles the other
  • The ambiguity between them can produce sound changes (rhinoglottophilia)

The oral and nasal segments are especially difficult to distinguish in aspirates due to their low amplitude and short duration.

Nasality spreads over to adjacent vowels

Blevins, Juliette & Egurtzegi, Ander. 2023. Refining explanations in Evolutionary Phonology. Linguistic Typology 27. doi: 10.1515/lingty-2021-0036



Nasal spread observed in:

  • Languages with //: Madi (jama1261), Yine (yine1238), Piro (mash1270)

  • Languages with the /h/ vs. // opposition: Basque, Seimat (seim1238) (historical evidence only)



Nasality must spread over a suprasegmental domain for the opposition to be maintained

The Basque case

Zuberoan Basque

Laryngeal approximants

  • Oral /h/, potentially present in Proto-Basque (hori ‘that’)

  • Nasalized //, from historical intervocalic n (Lat. (h)onore* > oh̃ore ‘honor’)

Geographic distribution:

  • /h/ -> Only in continental varieties

  • // -> Only in Zuberoan & Mixean Basque

Restrictions:

  • Onset only

  • First 2 syllables only

  • Only 1 aspirated segment in each word

Some examples of /h/

Example Transcription Gloss
hori /hoɾi/ ‘that (one)’
harri /hari/ ‘stone’
aho /aho/ ‘mouth’
ehun /ehun/ ‘hundred’
elhe /elhe/ ‘word’
senhar /s̺enhar/ ‘husband’

Some examples of //

Example Transcription Gloss
ahai /ah̃ai̯/ ‘ram’
ahuntz /ah̃unts̻/ ‘goat’
ihes /ih̃es̺/ ‘run away’
ahate /ah̃ate/ ‘duck’
dihaü /dih̃ay/ ‘money’
uhue /uh̃ue/ ‘honor’

Let’s listen to them

Oral

aihai ‘dinner’

Nasalized

ihitza ‘dew’

Diachrony

Historical origin

*n > // /V_V

mih̃i ‘tongue’ ~ min-gain ‘upper part of the tongue’

*ardah̃o ‘wine’ ~ ardan-degi ‘winery’

*gaztah̃a ‘cheese’ ~ gaztan-bera ‘curd’



But mihi, ardo & gazta in Standard Basque

Chronology

500-800 CE
Lenition of intervocalic /n/

/VnV/ > /Vh̃V/

1000-1300 CE
Loss of laryngeals
in post-tonic syllables
in Central-Eastern varieties

/hV.’hV.hV/ > /hV.hV.V/

Chronology

1600-1900 CE
Loss of nasality in aspirates
in most Basque varieties

/Vh̃V/ > /VhV/

Today
Nasalization of aspirates
preserved in Eastern Basque

ehi ‘finger’ vs. eh̃i ‘easy’

Romance languages

*n > ∅ /V_V in Romance


Galician-Portuguese


Latin Gal-Port. Gloss
plānum chão ‘flat’
tenebrās tẽevras ‘darkness’
lūnam lũa ‘moon’
regīnam raĩa ‘queen’
gallīnam galĩa ‘hen’

*n > ∅ /V_V in Romance


Corsican


Chisa Corsican Galeria Corsican Gloss
[ˈpanɛ] [ˈpãe] ‘bread’
[ˈpona] [ˈpe] ‘to put’
[ˈvĩnu] [ˈbĩu] ‘wine’
[ˈɔ̃ne] [ɔ̃] (suffix)

*n > ∅ /V_V in Romance


Medieval Gascon


Latin Gascon Gloss
Dominicus Domeeg (personal name)
gallīnas garias ‘hen (pl.)’
abellanētum aueraed ‘hazelnut grove’
camināre camiar ‘to walk’
Castanētum Castahied (place name)

*n > ∅ /V_V in Romance


Sardinian


Sard. Var. A Sard. Var. B Sard. Var. C
[ˈpanɛ] [ˈpãi] -
[ˈkɛna] [ˈtʃɛ̃a] -
[ˈaʒina] [ˈaʒĩa] [ˈaʒĩʔa]
[ˈluna] [ˈlũa] [ˈlũʔa]

Lack of aspiration in Romance?

Scarce evidence for an aspirate outcome in Romance.

  • [h] in Medieval Gascon?

  • [ʔ] in Sarrabus/Isili Sardinian



Why did // develop in Basque? / Why was it maintained?

  • /h/ was likely part of the language before // developed.

  • [] was likely an allophonic variant of /h/.

Phonology

Establishing the // vs. /h/ opposition


Minimal pair/triplet:


ehi /ˈehi/ ‘finger’

vs.


ehi /ˈeh̃i/ ‘easy’

vs.


ei /ˈe.i/ ~ /ei̯/ ‘ill’

/Vh̃V/ or /ṼhṼ/?

  • Zuberoan only shows contrastive nasalization in word-final stressed vowels.

  • Mixean does not show contrastive nasalization in vowels.

Nasalization spread / assimilation

All vowels surrounding nasal consonants are contextually nasalized in Basque:


ona [ˈõnã] ‘the good’

ama [ˈãmã] ‘mother’



uhue [ũˈh̃ũẽ] ‘honor’

ahate [ãˈh̃ãte] ‘duck’

Nasalization spread / assimilation

Example Transcription Gloss
janhari [jãnˈɦ̃ãi̯] ‘food’
sinhets [s̺ĩɲˈɦ̃ẽts̺] ‘to believe’
nahi [ˈnãɦ̃ĩ] ‘to want’
mehe [ˈmẽɦ̃ẽ] ‘thin’
senhar [ẽɦ̃ũn] ‘hundred’
lehen [lẽɦ̃ẽn] ‘first’

Background & Research question

Previous phonetic research




No phonetic research on the opposition in Basque…


…or in any other language

Research is pressing

Basque is an endangered language (vulnerable according to UNESCO)

(Almost?) All Basque varieties other than the standard

Particularly the Basque varieties spoken in the Northern Basque Country in France

Research questions



  1. Is the opposition between /h/ and // still present North-Eastern Basque?


Most likely varieties:

  • Mixean Basque

  • Zuberoan Basque


  1. Is it realized as a suprasegmental feature spreading over the /Vh̃V/ sequence?

1st Study

Amiküze (Mixe)

Materials

Data

behi ‘cow’

ihizin ‘hunting’

Problem: Measuring acoustic nasality

Acoustic analysis

20 explicit measurements of vocalic nasality (Styler 2017):

  • F1, F2, F3 frequency
  • F1, F2, F3 bandwidth
  • A1, A2, A3 amplitude
  • P0, P1, P2 amplitude
  • P0, P1 prominence
  • A1-P0, A1-P1, A1-P2
  • H1-H2, A3-P0, overall spectral center of gravity

Measurements taken in Praat and R (using wrassp; Winkelmann et al. 2017)

Additionally: MFCCs 2-12 (OpenSMILE)

All measurements taken at 5ms intervals

PCA transformation & data reduction

For each speaker, the 31-feature set was submitted to a principal components analysis (PCA) model

The number of PCs that cumulatively explained at least 80% of the total variance were retained for each speaker:

  • 11-13 PCs
  • Mean: 12.3, SD: 0.82

The scores for the retained PCs were used as linear predictors in speaker-specific models built to distinguish oral and nasalized vocalic contexts

Oral-nasal model training & prediction

“Unambiguously” oral and nasalized tokens were selected:

  • Nasalized: 10% in NVC, 50% in NVN, and 90% in CVN
  • Oral: 10%, 50%, and 90% in CVC

Average training data per speaker: 258 nasalized tokens, 222 oral tokens

Logistic regression models (with PC score IVs) built for each speaker

Hosmer and Lemeshow goodness of fit tests

LME with random intercepts by speaker

Pair-wise post hoc tests with Tukey alpha correction

Results

2nd Study

Larraine

Experimental design

Stimuli + SpeechRecorder + Nasalance device

Stimuli: wordlist


/h/ // Nh / NVh / hVN
behi ‘cow’ ahate ‘duck’ uɲhu ‘onion’
bihotz ‘heart’ ihaute ‘carnival’ lehen ‘first, before’
ehi ‘finger’ ehi ‘easy’ nihaur ‘me, myself’

SpeechRecorder: prompting

Nasalance device

Wooden plate

Data

Participants

5 participants


4 male : 1 female

60-70 years old

L1 Zuberoan Basque

L2 French at age 5 (school)

Recordings

aihaia ‘dinner’

desuhue ‘dishonor’

Nasalance

\[ Amplitude \]

\[ A_n \]

\[ A_o \]

\[ \frac{A_n}{A_n + A_o} \times 100 \]

Static analysis: Is the opposition between /h/ and // still present in Zuberoan Basque?

Bayesian generalized hierarchical model with brms



Dependent variable: z-scored nasalance

Population-level predictors: aspirate category, trial

Group-level predictors: by-speaker correlated varying intercept & slope adjustments and by-word intercept adjustments


Weakly informative priors


Posterior distributions

Posterior distributions

Contrast distribution of nasalized and oral posteriors

Discussion

Etymologically oral aspirates ≠ Assimilated nasal aspirates ≈ Etymologically nasalized aspirates

There is an opposition between /h/ and //!!!

Discussion

Etymologically oral aspirates ≠ Assimilated nasal aspirates ≈ Etymologically nasalized aspirates

There is an opposition between /h/ and //!!!

BUT…

Not all speakers maintain the opposition to the same extent

Discussion

Etymologically oral aspirates ≠ Assimilated nasal aspirates ≈ Etymologically nasalized aspirates

There is an opposition between /h/ and //!!!

BUT…

Not all speakers maintain the opposition to the same extent

Some lexical items have lost nasalization

Discussion

Etymologically oral aspirates ≠ Assimilated nasal aspirates ≈ Etymologically nasalized aspirates

There is an opposition between /h/ and //!!!

BUT…

Not all speakers maintain the opposition to the same extent

Some lexical items have lost nasalization

There is an opposition, but it is receding

Dynamic analysis: Is nasality realized as a suprasegmental feature spreading over the whole the /Vh̃V/ sequence?

fPCA decomposes the shape of the curve



  • Functional Principal Component Analysis summarizes numerically the main shape of variation in a functional dataset
  • In fPCA, the function of t approximates the mean of t plus the product of each PC score and its PC

  • Each given PC captures more variation than the following one

\(f(t) \approx \mu(t) + s_1 \times PC_1(t) + s_2 \times PC_2(t) + ...\)

fPCA allows to model and reconstruct the approximate shape of the curve




  • We can recreate the approximate shape of a curve by adding the product of its PCs and their respective PC scores to the mean


\(f(t) \approx \mu(t) + s_1 \times PC_1(t) + s_2 \times PC_2(t) + ...\)

In our data, the first PC accounts for most of the variation

The variation accounted for by the first PC is vertical

A Bayesian hierarchical model of the scores of PC1



Dependent variable: PC1 scores

Population-level predictor: aspirate category

Group-level predictors: by-speaker correlated varying intercept and slope adjustments, by-word intercept adjustments


Weakly informative priors

Posterior distributions

Contrast distribution of the posteriors of nasal and oral PC scores

From estimated PC1 values back to our nasalance curves

Discussion



Nasality is realized as a suprasegmental feature that


spreads over the whole [VHV] sequence

Yet, the uncertainty around the estimates is high

It’s not over!

Read the ICPhS paper

Revisit the slides

Selected references

  • Egurtzegi, Ander; García-Covelo, Andrea & Urrestarazu-Porta, Iñigo. 2023. A nasalance-based study of the /h/ vs. // opposition in Zuberoan Basque. In Proc. of the 20th ICPhS, pp. 3427-3431. https://guarant.cz/icphs2023/1047.pdf

  • Gubian, Michele; Harrington, Jonathan; Stevens, Mary; Schiel, Florian & Warren, Paul. 2019. Tracking the New Zealand English NEAR/SQUARE Merger Using Functional Principal Components Analysis. Proc. Interspeech, pp. 296-300, doi: 10.21437/Interspeech.2019-2115

  • Walker, Rachel & Pullum, Geoffrey K. 1999. Possible and Impossible Segments. Language 75(4). 764-780. doi: 10.2307/417733

Phonological literature on Basque nasalized aspirates

  • Hualde, José Ignacio. 1993. Topics in Souletin phonology. In J.I Hualde & J. Ortiz de Urbina (eds.) Generative Studies in Basque Linguistics , pp. 289-327. John Benjamins.

  • Egurtzegi, Ander. 2013. Diferentes tipos de aspiración en vasco (con análisis espectrales del dialecto suletino actual) [Different kinds of aspiration in Basque (with spectral analyses of the modern Zuberoan dialect)]. In Blasco et al. (eds.), Iberia e Sardegna . Firenze: Le Monnier. 151–169.

  • Egurtzegi, Ander. 2018. On the phonemic status of nasalized /h̃/ in Modern Zuberoan Basque. Linguistics 56. 1353-1367. doi: 10.1515/ling-2018-0024

  • Egurtzegi, Ander & Carignan, Christopher. 2020. A typological rarity: the /h/ vs /h̃/ contrast of Mixean Basque. Labphon 17.

  • Egurtzegi, Ander. 2023. /h̃/ hasperen sudurkarituaren inguran [On the nasalized aspiration /h̃/]. International Journal of Basque Linguistics and Philology 57.

Stimuli source

  • Larrasquet, Jean. 1939. Le Basque de la Basse-Soule Orientale . Paris: C. Klincksieck.

Software and R packages

  • Boersma, Paul & Weenink, David. 2022. Praat. Doing phonetics by computer . https://www.praat.org

  • Bürkner, Paul-Christian. 2017. brms: An R Package for Bayesian Multilevel Models Using Stan. Journal of Statistical Software 80(1). 1-28. doi: 10.18637/jss.v080.i01

  • Draxler, Christoph & Jänsch, Klaus. 2004. SpeechRecorder - a Universal Platform Independent Multi-Channel Audio Recording Software. In Proc. of LREC. 559-562. https://www.bas.uni-muenchen.de/Bas/software/speechrecorder/

  • R Core Team. 2022. R: A language and Environment for Statistical Computing . https://www.R-project.org

  • Wickhan, Hadley et al. 2019. Welcome to the tidyverse. Journal of Open Source Software 43(4). doi: 10.21105/joss.01686