You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
TBuscher137 edited this page Mar 11, 2016
·
2 revisions
Open your browser and go to localhost:5984/_utils/ IF THIS DOESN’T OPEN THE DATABASE, SOMETHING WENT WRONG BEFORE THIS POINT
Type cd
Type cd tools/
Run bash ./getcorpora.sh /home/vagrant/
Run bash ./en-sp-align_words.sh /home/vagrant/ /home/vagrant/tools/mgiza_configfile (This will take ~15 minutes)
Type head -n 99 ../corpora/src_trg.dict.A3.final.part000 > ./sample_data.txt (This shortens the processing time during development by reducing the number of records loaded into the database)
Run python ./parse_mgiza.py /home/vagrant/tools/sample_data.txt ./sample.out