Collecting data of kanji usage frequencies from Twitter Streaming API.
See POST statuses/filter for details.
git clone https://github.com/THIS/REPO.gitnpm installcp tokens.json.dist tokens.json- Fill in your tokens in
tokens.json(create a new app) node collect-data.js- a new stream is started right away, but data is not being saved automatically!- Enter
helpto see a list of available commands - Enter
saveto store data in a file (kanji.jsonin root directory)
{ state: 'RUNNING',
percentFull: 0,
uniqueKanji: 3666,
totalKanji: 2300350 }
state- if'PAUSED', data is not being collectedpercentFull- how full is a stream queue, see description ofstall_warningsparameteruniqueKanji- how many distinct kanji were seen so fartotalKanji- how many kanji were seen in total, including repetitions