Saturday, July 06, 2002, 01:54 GMT
Tom wrote about using Google to search English sentences.
That is a good idea, but, as Tom wrote, it is sometimes tricky to use Google because there are writings written by non-native English speakers. If you can make your own language database, this problem can be avoided.
First, you need a good text editor having "grep" capability. With "grep", you can search words in *many* text files.
Next you go to the following site where you can download movie scripts.
http://www.movie-page.com/movie_scripts.htm
Download as many scripts as possible. You need to download only files with file name extentions txt or htm or html. Note that html files are ascii texts so you can read them with an ascii text editor, although they contain tag symbols and codes which don't have much to do with the texts' contents.
You put them in a folder. That's all. You use the text editor's grep to search words in scripts in the folder.
That is a good idea, but, as Tom wrote, it is sometimes tricky to use Google because there are writings written by non-native English speakers. If you can make your own language database, this problem can be avoided.
First, you need a good text editor having "grep" capability. With "grep", you can search words in *many* text files.
Next you go to the following site where you can download movie scripts.
http://www.movie-page.com/movie_scripts.htm
Download as many scripts as possible. You need to download only files with file name extentions txt or htm or html. Note that html files are ascii texts so you can read them with an ascii text editor, although they contain tag symbols and codes which don't have much to do with the texts' contents.
You put them in a folder. That's all. You use the text editor's grep to search words in scripts in the folder.