CC-100 卡纳达语单语言数据集:来自Web爬网数据的1300万条单语言数据集
This monolingual dataset includes roughly 13 million uncleaned Kannada sentences crawled from numerous websites....NLP,Text Data,Languages Classification
3.51G
398
Darshan