AO-CHILDES

Python API for retrieving American-English child-directed speech transcripts, ordered by the age of the target child.

Usage

Processed transcripts, ordered by age of target child

from aochildes.dataset import AOChildesDataSet

transcripts = AOChildesDataSet().load_transcripts()

Filter male vs. female

from aochildes.dataset import AOChildesDataSet

transcripts = AOChildesDataSet(sex='male').load_transcripts()  # excludes many transcripts not annotated with sex

List entities

Retrieve sets of entities, like fictional characters mentioned during child-language interactions (e.g. book reading):

from aochildes.persons import FICTIONAL

print(FICTIONAL)

Parameters

A variety of parameters can be set, to influence much processing should be performed on the raw transcripts. These parameters can be found in params.py and should be edited there, directly. For example, one can set a parameter determining whether or not all utterances with the unicode symbol '�', 'xxx', and 'yyy' are discarded.

Compatibility

Developed on Ubuntu 18.04 and Python 3.7.

Name		Name	Last commit message	Last commit date
Latest commit History 133 Commits
aochildes		aochildes
examples		examples
scripts		scripts
.gitignore		.gitignore
MANIFEST.in		MANIFEST.in
README.md		README.md
setup.py		setup.py
substitutable_nouns.txt		substitutable_nouns.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AO-CHILDES

Usage

Processed transcripts, ordered by age of target child

Filter male vs. female

List entities

Parameters

Compatibility

About

Releases 5

Contributors 2

Languages

UIUCLearningLanguageLab/AOCHILDES

Folders and files

Latest commit

History

Repository files navigation

AO-CHILDES

Usage

Processed transcripts, ordered by age of target child

Filter male vs. female

List entities

Parameters

Compatibility

About

Topics

Resources

Stars

Watchers

Forks

Releases 5

Contributors 2

Languages