A tool for extracting statistics from python ASTs (Abstract Syntax Trees) and jupyter notebooks

A command line program that recursively processes files to find either python scripts or jupyter notebooks and processes them to extract statistics. Python scripts and jupyter notebooks are treated in a very different maner.

From jupyter notebooks the size in terms of lines of code of code or markdown cells are aggregated across all files. For Python scripts there are two different options either aggregate the token rank and frequency distribution or calculate the size (in terms of lines of code) of all classes and functions in the scripts.

Installing / Getting started

python -m pip install clone git+https://github.com/sellisd/pycodeseq.git@main

To parse the ASTs in all python files found under pyrepository use the following:

pycodeseq --input_path pyrepository --output stats.tsv --method levels

To calculate the token rank frequency distribution of all python files under pyrepository run:

pycodeseq --input_path pyrepository --output stats.tsv --method tokens

To calculate the size of different types of cells in jupyter notebooks under pyrepository run:

pycodeseq --input_path pyrepository --output stats.tsv --method cells

Developing

To develop it is easier to clone and install editable:

git clone [email protected]:sellisd/pycodeseq.git
cd pycodeseq/
pip install -e .

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.github/workflows		.github/workflows
pycodeseq		pycodeseq
tests		tests
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

A tool for extracting statistics from python ASTs (Abstract Syntax Trees) and jupyter notebooks

Installing / Getting started

Developing

About

Uh oh!

Releases

Packages

Languages

sellisd/pycodeseq

Folders and files

Latest commit

History

Repository files navigation

A tool for extracting statistics from python ASTs (Abstract Syntax Trees) and jupyter notebooks

Installing / Getting started

Developing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages