https://wortschatz.uni-leipzig.de/en/download/spanish:
spa_news_2011_10K
spa_newscrawl_2015_10K
spa_newscrawl-public_2019_10K
spa_web_2016_10K
spa_wikipedia_2016_10K
spa-ar_web-public_2019_10K
spa-co_web_2015_10K
spa-mx_web_2015_10K
spa-pe_web_2016_10K
spa-ve_web_2016_10K
---
total: 100K
---
Spanish:
Spanish 100k - 10 files
---
Conversion tool for diacritics (ñ|á|é|í|ó|ú|Ñ|Á|É|Í|Ó|Ú):
https://drive.google.com/drive/folders/ ... sp=sharing
---
Conversion to all small characters:
Spanish with diacritics: ---
Spanish with converted diacritics (ñ|á|é|í|ó|ú|Ñ|Á|É|Í|Ó|Ú): ---
I uploaded the configuration files that I used for the optimization so that the optimization can be reproduced later:
Link: viewtopic.php?f=12&t=20
Code: Select all
./opt -2 spanish2020.txt -i 20000 -K optS1V1.cfg
Code: Select all
./opt -2 spanish2020.txt -i 20000 -K optS2V1.cfg
Code: Select all
echo SPANISH:;./opt -2 spanish2020.txt -r bsptast.txt -K controlS1V1.cfg;