代码库

DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
Python
emuMonitor is a tool for "palladium" and "zebu" usage information data-collection, data-analysis and data-display.
Python
Taming Stable Diffusion for Lip Sync!
Python
diffusion-modelslipsyncresearchvideo-genvirtual-avatars
iBOT :robot:: Image BERT Pre-Training with Online Tokenizer (ICLR 2022)
Jupyter Notebook
ibotresearchssl
FlowGram is an extensible workflow development framework with built-in canvas, form, variable, and materials that helps developers build AI workflow platforms faster and simpler.
TypeScript
aiautomationcozedata-flowdiagramflowflowchartgraphintegration-frameworkjavascriptno-codenode-based-uireacttypescripttypescript-libraryvisualizationworkflowworkflow-automation
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
TypeScript
agentagent-tarsbrowser-usecomputer-usecoworkgui-agentgui-operatormcpmcp-servermultimodaltarsui-tarsvisionvlm
[ICLR 2026] Official repo for paper "Video-As-Prompt: Unified Semantic Control for Video Generation"
Python
controllable-video-generationin-context-generationvideovideo-datasetvideo-generation
The Family of Diffusion Protein Language Models (DPLM)
Python
research
Tarsier -- a family of large-scale video-language models, which is designed to generate high-quality video descriptions , together with good capability of general video understanding.
Python
research
Q-Insight Family: Q-Insight, VQ-Insight and RALI (NeurIPS 2025 Spotlight, AAAI 2026 Oral, and ICLR 2026 Oral)
Python
research
Rust async runtime based on io-uring.
Rust
Awesome samples for Volcengine AgentKit Platform with VeADK.
Python
agentaisamplesvolcengine
Python
research
Pioneering Automated GUI Interaction with Native Agents
Python
research
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of tasks that could take minutes to hours.
Python
agentagenticagentic-frameworkagentic-workflowaiai-agentsdeep-researchharnesslangchainlanggraphlangmanusllmmulti-agentnodejspodcastpythonsuperagenttypescript