A command line program that recursively processes files to find either python scripts or jupyter notebooks and processes them to extract statistics. Python scripts and jupyter notebooks are treated in a very different maner.
From jupyter notebooks the size in terms of lines of code of code or markdown cells are aggregated across all files. For Python scripts there are two different options either aggregate the token rank and frequency distribution or calculate the size (in terms of lines of code) of all classes and functions in the scripts.
python -m pip install clone git+https://github.com/sellisd/pycodeseq.git@mainTo parse the ASTs in all python files found under pyrepository use the following:
pycodeseq --input_path pyrepository --output stats.tsv --method levelsTo calculate the token rank frequency distribution of all python files under pyrepository run:
pycodeseq --input_path pyrepository --output stats.tsv --method tokensTo calculate the size of different types of cells in jupyter notebooks under pyrepository run:
pycodeseq --input_path pyrepository --output stats.tsv --method cellsTo develop it is easier to clone and install editable:
git clone [email protected]:sellisd/pycodeseq.git
cd pycodeseq/
pip install -e .