-
HarfoSokhan: Bridging Persian's Formal-Colloquial Divide with 6M Parallel Pairs
Our EACL 2026 Best Resource Paper introduces HarfoSokhan (حرف و سخن), the first large-scale colloquial-to-formal Persian parallel dataset.
Our EACL 2026 Best Resource Paper introduces HarfoSokhan (حرف و سخن), the first large-scale colloquial-to-formal Persian parallel dataset.