,
Jorge Baptista
Creative Commons Attribution 4.0 International license
This study examines the intersection of sociodemographic characteristics, linguistic features, and writing placement outcomes at a community college in the United States of America. It focuses on 210 anonymized writing samples from native English speakers (L1) that were automatically classified by Accuplacer and independently assessed by two trained raters. Disparities across gender and race using 40 top-ranked linguistic features selected from Coh-Metrix, CTAP, and Developmental Education-Specific (DES) sets were analyzed. Three statistical tests were used: one-way ANOVA, Tukey’s HSD, and Chi-square. ANOVA results showed racial differences in nine linguistic features, especially those tied to syntactic complexity, discourse markers, and lexical precision. Gender differences were more limited, with only one feature reaching significance (Positive Connectives, p = 0.007). Tukey’s HSD pairwise tests showed no significant gender group variation but revealed sensitivity in DES features when comparing racial groups. Chi-square analysis indicated no significant association between gender and placement outcomes but suggested a possible link between race and human-assigned levels (χ² = 9.588, p = 0.048). These findings suggest that while automated systems assess general writing skills, human-devised linguistic features and demographic insights can support more equitable placement practices for all students entering college-level programs.
@InProceedings{dacorte_et_al:OASIcs.SLATE.2025.6,
author = {Da Corte, Miguel and Baptista, Jorge},
title = {{Beyond the Score: Exploring the Intersection Between Sociodemographics and Linguistic Features in English (L1) Writing Placement}},
booktitle = {14th Symposium on Languages, Applications and Technologies (SLATE 2025)},
pages = {6:1--6:18},
series = {Open Access Series in Informatics (OASIcs)},
ISBN = {978-3-95977-387-4},
ISSN = {2190-6807},
year = {2025},
volume = {135},
editor = {Baptista, Jorge and Barateiro, Jos\'{e}},
publisher = {Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
address = {Dagstuhl, Germany},
URL = {https://drops.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2025.6},
URN = {urn:nbn:de:0030-drops-236861},
doi = {10.4230/OASIcs.SLATE.2025.6},
annote = {Keywords: Developmental Education (DevEd), sociolinguistic variation, text classification, Machine Learning, placement equity}
}