Stars
Examples and guides for using the OpenAI API
开源微信爬虫:爬取公众号所有 文章、阅读量、点赞量和评论内容。易部署。持续维护!!!
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
Entity Matching Model solves the problem of matching company names between two possibly very large datasets.
R package fastLink: Fast Probabilistic Record Linkage
In this project we manage data from the Observatory of Economic Complexity (OEC) and from the Global Trade Alert (GTA) to construct a measure of how well countries protect and liberalize their mark…
Data Visualization for Global Trade Alert using ggplot.
R package containing functions to work with Global Trade Alert data
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Te…
This repository provides the replication code for the analysis results in Kelly, B., Papanikolaou, D., Seru, A. and Taddy, M., 2021. American Economic Review: Insights.
A high-throughput and memory-efficient inference and serving engine for LLMs
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models
Retrieval and Retrieval-augmented LLMs
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
Commandline tool for running SQL queries against JSON, CSV, Excel, Parquet, and more.
What can be learned from 1M+ college course syllabi? (OLD)
A Flexible Deep Learning Approach to Fuzzy String Matching
🆔 Examples for using the dedupe library
This 10-week training program is designed to prepare incoming pre-doctoral research fellows at the Princeton Empirical Studies of Conflict (ESOC) lab with the skills needed to support faculty resea…
Guide for Tencent Cloud GPU server configs
Match Patent Assignees with Compustat and SDC via Bing Search
FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs o…
A mapping between SDCs M&A database and the gvkey's in Compustat
A topic-centric list of HQ open datasets.
This repository provides updates and extended data following Kogan, L., Papanikolaou, D., Seru, A. and Stoffman, N., QJE 2017