Skip to content
Change the repository type filter

All

    Repositories list

    • crawlab

      Public
      Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
      Go
      1.9k12k1472Updated Aug 16, 2025Aug 16, 2025
    • Documentation for Crawlab
      JavaScript
      533922Updated Jul 16, 2025Jul 16, 2025
    • crawleval

      Public archive
      Resources and tools for evaluating the performance and behavior of web crawling systems
      Python
      1700Updated May 9, 2025May 9, 2025
    • 🎉 A Vue.js 3.0 UI Library made by Crawlab team
      Vue
      19k2300Updated Apr 14, 2025Apr 14, 2025
    • bm25

      Public
      This is a golang implementation of various BM25 algorithms. It is a port of `dorianbrown/rank_bm25`
      Go
      3000Updated Mar 28, 2025Mar 28, 2025
    • Python
      4412Updated Mar 18, 2025Mar 18, 2025
    • fizz

      Public
      🍋 Gin wrapper with OpenAPI 3 spec generation
      Go
      56000Updated Mar 17, 2025Mar 17, 2025
    • mcp-go

      Public
      A Go implementation of the Model Context Protocol (MCP), enabling seamless integration between LLM applications and external data sources and tools.
      Go
      710000Updated Mar 12, 2025Mar 12, 2025
    • JavaScript
      0000Updated Mar 6, 2025Mar 6, 2025
    • Python SDK for Crawlab
      Python
      1100Updated Feb 14, 2025Feb 14, 2025
    • Node.js SDK for Crawlab
      TypeScript
      0000Updated Jan 6, 2025Jan 6, 2025
    • Crawlab Go SDK
      Go
      91000Updated Jan 6, 2025Jan 6, 2025
    • Java SDK for Crawlab
      Java
      0000Updated Jan 3, 2025Jan 3, 2025
    • e2e-tests

      Public
      TypeScript
      0100Updated Dec 20, 2024Dec 20, 2024
    • 0000Updated Dec 19, 2024Dec 19, 2024
    • crawlab-core

      Public archive
      Backend core modules for Crawlab
      Go
      565000Updated Jun 14, 2024Jun 14, 2024
    • crawlab-grpc

      Public archive
      gRPC for Crawlab
      Shell
      12400Updated Jun 14, 2024Jun 14, 2024
    • docker-base-images

      Public archive
      Shell
      7200Updated Jun 11, 2024Jun 11, 2024
    • crawlab-demo

      Public archive
      Python
      6200Updated Jun 4, 2024Jun 4, 2024
    • SDK for Crawlab AI
      Python
      3820Updated Jun 4, 2024Jun 4, 2024
    • SDK for Crawlab, including SDK for different programming languages such as Python, Node.js and Java, and a CLI Tool written in Python.
      Python
      525625Updated Jun 3, 2024Jun 3, 2024
    • Backend file system module for Crawlab
      Go
      12400Updated Mar 29, 2024Mar 29, 2024
    • A complete Golang client for SeaweedFS
      Go
      51400Updated Mar 29, 2024Mar 29, 2024
    • scrapy-ai

      Public
      AI-powered scrapy plugin
      Python
      0100Updated Feb 7, 2024Feb 7, 2024
    • Version Control System (VCS) for Crawlab
      Go
      11401Updated Aug 8, 2023Aug 8, 2023
    • webspot

      Public archive
      An intelligent web service to automatically detect web content and extract information from it.
      Python
      128610Updated Jul 13, 2023Jul 13, 2023
    • C#
      1000Updated May 31, 2023May 31, 2023
    • Convert HTML to JSON. Can also (intelligently) convert HTML tables to JSON (using table headers (if available) as keys in the resulting JSON).
      HTML
      8400Updated Apr 1, 2023Apr 1, 2023
    • artipub

      Public
      Article publishing platform that automatically distributes your articles to various media channels
      TypeScript
      5333.1k2911Updated Mar 5, 2023Mar 5, 2023
    • Lite version of Crawlab. 轻量版 Crawlab 爬虫管理平台
      Vue
      7722818Updated Feb 9, 2023Feb 9, 2023