
/ MASAKARI revision 5: the free console x-gram spell-checker.
Checked against corpus 'Gamera' r15 and corpus enwiki-20120403-pages-articles, removed all x-grams with 1&2&3 occurrences:
4andabove_Gamera.tar.1.sorted (comprised of 2,715,302 1-grams) 56,502,419 bytes;
4andabove_Gamera.tar.2.sorted (comprised of 35,116,064 2-grams) 889,537,624 bytes;
4andabove_Gamera.tar.3.sorted (comprised of 100,088,208 3-grams) 2,938,594,566 bytes;
4andaboveenwiki-20120403-pages-articles.1.sorted (comprised of 4,014,713 1-grams) 84,222,463 bytes;
4andaboveenwiki-20120403-pages-articles.2.sorted (comprised of 36,382,919 2-grams) 915,914,243 bytes;
4andaboveenwiki-20120403-pages-articles.3.sorted (comprised of 65,903,363 3-grams) 1,972,732,692 bytes.
Copyleft Sanmayce 2013-Jan-02/