Skip to content
This repository has been archived by the owner on May 6, 2018. It is now read-only.

More wikipedia language support, wiki disambiguation entity extraction, and output wiki docId in plaintext dump #55

Open
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

gragtah
Copy link

@gragtah gragtah commented Feb 7, 2015

  • Increased language support for Wikipedia for top 24 languages by # of articles
  • Added disambiguation patterns for each of the 24 supported languages
  • ExtractWikipediaDisambiguations lets you extract disambiguated terms from wiki disambiguation pages in the 24 languages supported
  • Added wikipedia docId as output in DumpWikipediaToPlainText since it's very useful to have

@gragtah
Copy link
Author

gragtah commented Feb 7, 2015

@lintool let me know how this looks, thanks!

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant