POS Taggers

Does anyone have an opinion on the relative merits of the various part-of-speech taggers? I’ve used (and had decent luck with) Lingpipe, which seems pretty quick and very accurate in my limited tests. I also just read a post by Matthew Jockers about the Stanford Log-linear Part-Of-Speech Tagger (which is what got me thinking about this; I admit I was largely sucked in by the discussion of Xgrid, which I’d really like to try). And I thought the Cornell NLP folks had one, too, though I now can’t find any reference to it, so I may well be wrong. Plus there’s MONK/Northwestern’s MorphAdorner (code not yet generally available, though I don’t think it would be a problem to get it), and any number of commercial options (less attractive, for many reasons).

I surely just need to test a bunch of them is some semi-systematic way, but is there any existing consensus about what works best for literary material?

2 thoughts on “POS Taggers

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s