Chroma Corpus

Aneta
Anna
Jan
Julie
Klara
Sara
Viktor
total
Initial age in format Y;MM.DD
2;02.08
1;09.30
1;07.05
1;07.05
2;04.22
1;07.06
2;06.23
---
Final age
3;03.18
2;07.27
2;09.27
3;09.11
3;04.24
3;09.08
3;09.15
---
Initial MLU*
1.52
1.47
1.09
1.11
1.94
1.01
3.29
---
Final MLU
2.24
2.94
2.66
2.74
3.19
3.36
5.33
---
Number of recorded months
14
11
15
27
13
27
16
---
Total recorded time
11:25:16
6:29:54
7:11:13
15:09:16
4:32:12
12:26:26
14:22:44
23:37:01
Number of minutes per month on average
49
35
29
34
21
28
54
36
UTTERANCES
CHILDREN number of all utterances
5286
3127
4483
9345
3627
7750
8485
42103
CHILDREN number of fully intelligible** utt
4747
2459
3886
7951
3079
7632
8124
37878
CHILDREN proportion of fully intelligibible
89.8 %
78.6 %
86.7 %
85.1 %
84.9 %
98.5 %
95.7 %
90.0 %
ADULTS number of all tokens
7671
5367
4317
18769
3197
12686
9245
61252
ADULTS number of fully intelligible utt
7533
5115
4216
18517
3128
12674
9195
60378
ADULTS proportion of fully intelligible
98.2 %
95.3 %
97.7 %
98.7 %
97.8 %
99.9 %
99.5 %
98.6 %
TOKENS
CHILDREN number of all tokens
11803
6166
6626
18436
8937
17646
29774
99388
CHILDREN number of tokens from fully intelligible utt
11331
5567
6279
17388
7955
17454
28924
94898
ADULTS number of all tokens
31812
21636
15057
71059
10499
48853
39295
238211
ADULTS number of tokens from fully intelligible utt
31699
20846
14852
70429
10355
48829
39209
236219
recordings at disposal?***
yes
yes
no
yes
yes
yes
yes
---

* MLU = mean length of utterance (in words)

** Utterances containing any unintelligible parts (coded as xxx) are excluded.

*** The recordings are not anonymized and are not intended for publication. However, it is possible to analyze them further (while maintaining the confidentiality and anonymity of the published results). If you are interested in analyzing the recordings, please contact us.

The number of tokens and utterances and the MLU are calculated using the CLAN program based on the Chroma corpus version 2023.07.