Python wrapper for Wikipedia
-
Updated
Nov 18, 2024 - Python
Python wrapper for Wikipedia
Web scraping, data parsing and automation tutorials. Suited for both beginners and intermediate/advanced programmers.
Python wrapper for the MediaWiki API to access and parse data from Wikipedia
Java tool to get wikipedia data
A 🤖 which provides features from Wikipedia like summary, title searches, location API etc.
Graphically display the connections between different Wikipedia articles
A complete Python text analytics package that allows users to search for a Wikipedia article, scrape it, conduct basic text analytics and integrate it to a data pipeline without writing excessive code.
SpaceX Launches 🚀 and Starlink Satellites 🛰
Collects a multimodal dataset of Wikipedia articles and their images
Just Refs - extract just the references and related topics from any page on the English Wikipedia
Music tagger with GUI that parses wikipedia for information. Can also download album art and lyrics.
This project collects Wikipedia articles from a search term entered by the user and formats the data into a .docx (Word Document) document with images related to each section of the collected article.
A NLP algorithm I developed to determine the similarity or relation between two documents/Wikipedia articles. Inspired by the cosine similarity algorithm and built from WordNet.
A tutorial and code samples of web scraping with PHP
Wikipedia Article Summarizer a simple Python project based on NLP techniques
Taxonomic trees (cladograms) from Wikipedia-scraped data.
Wikipedia Entities Lexicon Extractor
Extracts geodata from a wikipedia dump
Linked Data Knowledge Base Population (KBP) framework built on top of Snorkel. The default configuration uses Wikipedia as text corpus and DBpedia as target.
Add a description, image, and links to the wikipedia-scraper topic page so that developers can more easily learn about it.
To associate your repository with the wikipedia-scraper topic, visit your repo's landing page and select "manage topics."