Starred repositories
SEED-Story: Multimodal Long Story Generation with Large Language Model
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
An automated AI system (Python framework) designed to analyze any type of website content and generate structured reports using Claude 3.5 Sonnet API and Firecrawl. While currently configured for e…
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
liseami / screenshot-to-code
Forked from abi/screenshot-to-codeDrop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes! [In Beta]
An AI web browsing framework focused on simplicity and extensibility.
ai-generated apps , full stack + generative UI
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Build your own second brain with supermemory. It's a ChatGPT for your bookmarks. Import tweets or save websites and content using the chrome extension.
Open-source Next.js template for building apps that are fully generated by AI. By E2B.
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
OpenMMLab Pose Estimation Toolbox and Benchmark.
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper
Prompt, run, edit, and deploy full-stack web applications
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
A cross-platform, customizable science fiction terminal emulator with advanced monitoring & touchscreen support.
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。