Co-pilot for pandas users, AI that understands the content of data, greatly enhancing the relevance of suggestions.
Adding data context to AI code-writing assistants, usable in any Jupyter in seconds.
pip install sketch
import sketch
...
df.sketch.howto("Check for any duplicate rows, and keep the first one based on the time feature")
(Show a copilot suggestion with comment block, and its output) (Show a GPT-3 codex response)
If you add OPENAI_API_KEY
environment variable and LOCAL_LAMBDA_PROMPT=True
, then sketch will run the prompts locally, directly using your API key with openAI's endpoints.
Sketch uses efficient approximation algorithms (data sketches) to quickly summarize your data, and feed that information into language models. Right now it does this by summarizing the columns and writing these summary statistics as additional context to be used by the code-writing prompt. In the future we hope to feed these sketches directly into custom made "data + language" foundation models.