
DeerFlow (Deep Exploration and Efficient Research Flow) is ByteDance's community-driven open-source deep research framework, initially released under the MIT license in 2025 and upgraded to version 2.0 in March 2026 as a full 'SuperAgent Harness.' It deeply integrates large language models with web search, crawling, Python code execution, RAG knowledge base retrieval, and MCP tool invocation, orchestrating them through a stateful graph workflow built on LangGraph. The system decomposes high-level research tasks into parallelized sub-task pipelines, dispatching work to five specialized agent roles: Coordinator, Planner, Researcher, Coder, and Reporter. Execution occurs inside isolated Docker sandboxes for security, enabling DeerFlow to safely run code, build web apps, and produce structured research reports, PowerPoint presentations, and AI-generated podcast audio all from a single natural language prompt.
| ✕Traditional Pain Points | ✓Innovative Solutions |
|---|---|
| Traditional single-agent frameworks such as early AutoGPT struggle to break down long-horizon complex tasks and often loop or fail midway | The SuperAgent Harness architecture treats the framework as an orchestration substrate rather than a single agent, enabling stronger extensibility |
| Mainstream research products such as Perplexity and OpenAI Deep Research are largely closed black boxes, limiting customization of LLMs, toolchains, and knowledge strategies | Docker isolation plus a persistent filesystem allows secure code execution, file writing, and full web app construction |
| Many open-source research pipelines lack code sandboxes and multimodal outputs, producing only plain text | Through litellm, DeerFlow unifies access to 100+ models including GPT-4, Claude, and Qwen, improving switching flexibility and cost control |
| Some multi-agent frameworks such as CrewAI still have gaps in RAG integration and MCP protocol support, making private knowledge onboarding expensive | The Human-in-the-Loop mechanism enables live plan revision in natural language, balancing automation with operator control |
| - | It natively produces reports, PowerPoint slides, and TTS podcast output, going beyond text-only research tools |
1git clone https://github.com/bytedance/deer-flow.git && cd deer-flow && pip install -r requirements.txt1cp .env.example .env2# Edit .env, fill in OPENAI_API_KEY or ANTHROPIC_API_KEY, and TAVILY_API_KEY1uvicorn src.app:app --host 0.0.0.0 --port 8000 --reload1cd web && npm install && npm run dev1docker compose up -d| Core Scene | Target Audience | Solution | Outcome |
|---|---|---|---|
| [Competitive Intelligence Research] | Market analysts and strategy teams | Enter a competitor name, let Researcher agents gather earnings, news, and product updates, Coder agents run comparisons, and Reporter agents produce charted reports plus PPT | Compresses 2 to 3 days of manual work into roughly 30 minutes while improving coverage and freshness |
| [Academic Literature Review Generation] | University researchers and thesis writers | Enter research keywords, then the system searches papers and web sources and combines them with a private RAG base to summarize methods, findings, and gaps | Produces structured reviews faster and reduces missed-source risk |
| [Automated Content Marketing Pipeline] | Content teams and solo creators | After a topic is entered, DeerFlow automates research, writing, chart creation, and podcast plus slide generation | Enables one operator to produce multimodal content that previously needed coordinated team effort |