URL → scrape → tokenize → visualize top frequent words.
Standard NLTK stopwords aren't enough for Wikipedia. Added domain-specific terms:
Elements removed before text extraction:
table.infoboxSidebar info boxessup.referenceCitation bracketsdiv.navboxNavigation templates#ReferencesReferences section