Skip to main content

How To Find Out How Easy A Certain Japanese Text Would Be For You


Are you trying to up your Japanese reading skills? Have a set of electronic reading materials, but don't know which one to start with? One way to rate them is by how easy they would be for you to read, so you can pick the level you're comfortable with. To do that, you'll first need a text list of words you already know, and then use that as an input file to the Japanese Text Frequency Analyzer tool.

An easy way to get the list is to use the Morphman plugin in Anki (I'm assuming you're using Anki. Find out how to export the list if you use another SRS/flashcard system).


What you'll need:

Steps:
1. Ensure your database of known words is up to date by forcing a Morphman Recalc. You can do this by using the Anki tools menu or the default shortcut Ctrl+M.

If there are words you already know that are not in Anki, see this section of MorphMan Wiki. To make things easier for myself, I just imported a list of vocab (eg, JLPT N5 vocab) into Anki and marked them all as known + suspend them.

2. Next, open MorphMan Manager. Click on "Browse for DB A". By default it should show you the paths where the known.db, mature.db files etc are stored. 
Load known.db as DB A, and click on "A" to show the known morphemes (if the jargon is confusing, just think of morphemes as words).

3. Click on the pane, Ctrl+A to select all the text, and then paste it into a text file. Voila, you now have a text file containing a list of the words you know.

4. Open Japanese Text Analysis Tool. Put in the input file/directory, as well as where you want the output files to be. Enable user-based readability and use the text file created from step 3.
Basically your tool settings will look like something below. Then just click on Analyze!
Note: do not use Japanese characters in your file paths, it crashes the tool.


5. Then just open the output directory and check the reports. For more info on how to read the reports, check out the excellent tool documentation.



  

Comments

Popular posts from this blog

Use a game controller for web browsing and more

I'm not a serious gamer, and little did I know that one day I would actually use a wireless gaming controller actively. Most of the time it's not for playing games though - I'm using it as a partial replacement of the keyboard and mouse, by mapping custom commands to the buttons. Shortcuts without even touching the keyboard? Bringing keyboard ninja skills to the next level ;) The wireless controller I use - Logitech F710 It all began when I started experiencing some pain in my wrist from too much mouse use. A friend loaned me a Logitech F710 wireless game controller*, and to my delight, I found that Logitech provides software to map the buttons to keystrokes and the analog sticks to the mouse. Since I use a lot a lot of keyboard shortcuts in my everyday computing, it was a great solution. Now I usually have my hands comfortably on the controller, only leaving it occasionally to type or when I need precision mousing. I even mapped Aero Flip so I could switch between d...

Anki flow to pre-study for a Japanese game/book

I've been trying to self-study Japanese recently by immersing myself in a Japanese game/book. There are various study strategies out there, mine is learning vocabulary strategically before playing/reading. By strategic, I mean I only learn a handful of vocab that is frequently used - any other vocab I'll attempt to pick up when I see them. This should enable me to understand a significant chunk of the game/book with minimal time spent studying. There are tons of forum threads on doing this, but I couldn't find a comprehensive and easy-to-follow tutorial while I struggled with the tools, so I decided to write my own flow down for future reference. Bonus points if this helps anyone else reading this ;) What you'll need: Anki Morphman Anki Plugin cb's Japanese Text Analysis Tool AntConc  Epwing2Anki Brief steps: Find the text scripts of the game. Usually <game> 台詞集 / セリフ集 will give you results. Save the game scripts as txt. Recommended to encode in...