Data Related Tools A set of tools to preprocess, generate and curate data. Running on CPU Upgrade 40 40 Argilla Space ✍ Running 419 419 Synthetic Data Generator 🧬 Build datasets using natural language Running 250 250 Infinite Dataset Hub ♾ Search and save datasets generated with a LLM in real time
Instruction Models Popular instruction models to be run with Inference Endpoints meta-llama/Llama-3.2-1B-Instruct Text Generation • Updated Oct 24, 2024 • 1.67M • • 764 meta-llama/Llama-3.2-3B-Instruct Text Generation • Updated Oct 24, 2024 • 1.44M • • 1.05k meta-llama/Llama-3.3-70B-Instruct Text Generation • Updated Dec 21, 2024 • 519k • • 1.99k meta-llama/Llama-3.1-8B-Instruct Text Generation • Updated Sep 25, 2024 • 6.04M • • 3.64k
sdiazlor/deepseek-r1-distill-qwen-1.5-unsloth-sft-python-60-steps Text Generation • Updated 9 days ago • 6
sdiazlor/modernbert-embed-base-crossencoder-human-rights Text Classification • Updated 30 days ago • 17
sdiazlor/modernbert-embed-base-crossencoder-human-rights-1-epoch Text Classification • Updated 30 days ago • 8
sdiazlor/modernbert-embed-base-biencoder-human-rights Sentence Similarity • Updated about 1 month ago • 17