Skip to content

Commit d5d948d

Browse files
committed
Merge remote-tracking branch 'vinta/master'
2 parents d56c87f + 692e9bf commit d5d948d

File tree

4 files changed

+13
-11
lines changed

4 files changed

+13
-11
lines changed

Makefile

Lines changed: 5 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -1,15 +1,12 @@
1-
BASEDIR=$(CURDIR)
2-
DOCDIR=$(BASEDIR)/docs
3-
4-
install:
1+
site_install:
52
pip install mkdocs==0.16.3
63
pip install mkdocs-material==1.12.2
74

8-
link:
9-
ln -sf $(BASEDIR)/README.md $(DOCDIR)/index.md
5+
site_link:
6+
ln -sf $(CURDIR)/README.md $(CURDIR)/docs/index.md
107

11-
preview: link
8+
site_preview: site_link
129
mkdocs serve
1310

14-
deploy: link
11+
site_deploy: site_link
1512
mkdocs gh-deploy --clean

README.md

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -85,7 +85,7 @@ Inspired by [awesome-php](https://github.com/ziadoz/awesome-php).
8585
- [URL Manipulation](#url-manipulation)
8686
- [Video](#video)
8787
- [Web Content Extracting](#web-content-extracting)
88-
- [Web Crawling](#web-crawling)
88+
- [Web Crawling & Web Scraping](#web-crawling--web-scraping)
8989
- [Web Frameworks](#web-frameworks)
9090
- [WebSocket](#websocket)
9191
- [WSGI Servers](#wsgi-servers)
@@ -342,6 +342,7 @@ Inspired by [awesome-php](https://github.com/ziadoz/awesome-php).
342342
* [Open Mining](https://github.com/mining/mining) - Business Intelligence (BI) in Pandas interface.
343343
* [Orange](https://orange.biolab.si/) - Data mining, data visualization, analysis and machine learning through visual programming or scripts.
344344
* [Pandas](http://pandas.pydata.org/) - A library providing high-performance, easy-to-use data structures and data analysis tools.
345+
* [Optimus](https://github.com/ironmussa/Optimus) - Cleansing, pre-processing, feature engineering, exploratory data analysis and easy Machine Learning with a PySpark backend.
345346

346347
## Data Validation
347348

@@ -729,6 +730,7 @@ Inspired by [awesome-php](https://github.com/ziadoz/awesome-php).
729730

730731
* [bpython](https://github.com/bpython/bpython) - A fancy interface to the Python interpreter.
731732
* [Jupyter Notebook (IPython)](https://jupyter.org) - A rich toolkit to help you make the most out of using Python interactively.
733+
* [awesome-jupyter](https://github.com/markusschanta/awesome-jupyter)
732734
* [ptpython](https://github.com/jonathanslenders/ptpython) - Advanced Python REPL built on top of the [python-prompt-toolkit](https://github.com/jonathanslenders/python-prompt-toolkit).
733735

734736
## Internationalization
@@ -815,6 +817,7 @@ Inspired by [awesome-php](https://github.com/ziadoz/awesome-php).
815817
* [SnowNLP](https://github.com/isnowfy/snownlp) - A library for processing Chinese text.
816818
* [spaCy](https://spacy.io/) - A library for industrial-strength natural language processing in Python and Cython.
817819
* [TextBlob](https://github.com/sloria/TextBlob) - Providing a consistent API for diving into common NLP tasks.
820+
* [PyTorch-NLP](https://github.com/PetrochukM/PyTorch-NLP) - A toolkit enabling rapid deep learning NLP prototyping for research.
818821

819822
## Network Virtualization
820823

@@ -1200,9 +1203,9 @@ Inspired by [awesome-php](https://github.com/ziadoz/awesome-php).
12001203
* [textract](https://github.com/deanmalmgren/textract) - Extract text from any document, Word, PowerPoint, PDFs, etc.
12011204
* [toapi](https://github.com/gaojiuli/toapi) - Every web site provides APIs.
12021205

1203-
## Web Crawling
1206+
## Web Crawling & Web Scraping
12041207

1205-
*Libraries for scraping websites.*
1208+
*Libraries to automate data extraction from websites.*
12061209

12071210
* [cola](https://github.com/chineking/cola) - A distributed crawling framework.
12081211
* [Demiurge](https://github.com/matiasb/demiurge) - PyQuery-based scraping micro-framework.
File renamed without changes.

mkdocs.yml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -21,5 +21,7 @@ extra:
2121
google_analytics:
2222
- 'UA-510626-7'
2323
- 'auto'
24+
extra_css:
25+
- css/extra.css
2426
pages:
2527
- "Life is short, you need Python.": "index.md"

0 commit comments

Comments
 (0)