0

/ 100

GradeA

A well-known project done right. Strong docs and solid engineering throughout.

A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.

Documentation

95

Contributing guide5pt89

Contributing guide is detailed and thorough.

Install and run instructions9pt90

README documents how to install the project.

README12pt100

README is present.

License6pt100

Licensed under MIT.

Engineering

94

CI/CD14pt85

CI is configured (.github/workflows/ci-docker.yaml).

Reproducibility6pt90

Lockfile present (Cargo.lock). Installs are reproducible.

Tests18pt100

Test files detected (crates/xberg-candle-ocr/tests).

Linting and formatting5pt100

Linter or formatter configured (.editorconfig).

Issue and PR templates6pt100

Issue or PR templates present.

Project health

86

Dependency manifest6pt55

Dependency manifest found (Cargo.toml).

Repository metadata5pt100

Repository has a description.

Activity5pt100

Actively maintained (pushed within the last month).

Housekeeping3pt100

.gitignore present.

Repository health signals

Activity, community, and responsiveness at scan time

Activity

  • -
    Commits (30d / 90d)
  • 506
    Forks
  • 205
    Releaseslatest 1y ago

Community

  • 87% - Good
    Community health
  • 1 bus factorlow
    author own >50% of commits
  • 8,565
    Watchers

Responsiveness

  • 3h
    Median issue response
  • <1h
    Median PR merge time
  • 20
    Open issues
Repository files63 root entries
  • .ai-rulez
  • .cargo
  • .github
    Good: CI is configured (.github/workflows/ci-docker.yaml).
    Good: Dependabot covers 9 ecosystems (cargo, pip, npm, bundler, composer, gomod, maven, nuget, mix). Dependencies stay current.
    Good: Issue or PR templates present.
  • .task
  • cli-proxy
  • crates
    Good: Test files detected (crates/xberg-candle-ocr/tests).
  • docker
  • docs
  • e2e
  • fixtures
  • packages
  • scripts
  • templates
  • tools
  • .clang-format
  • .dockerignore
  • .editorconfig
    Good: Linter or formatter configured (.editorconfig).
  • .gh-actions-updater.toml
  • .gitattributes
  • .gitignore
    Good: .gitignore present.
  • .gitmodules
  • .golangci.yml
  • .hadolint.yaml
  • .lychee.toml
  • .npmrc
  • .oxfmtrc.json
  • .oxlintrc.json
  • .pre-commit-config.yaml
  • .rumdl.toml
  • .sdkmanrc
  • .shellcheckrc
  • .textlintrc.json
  • .typos.toml
  • alef.toml
  • ATTRIBUTIONS.md
  • Cargo.lock
    Good: Lockfile present (Cargo.lock). Installs are reproducible.
  • Cargo.toml
    Good: Dependency manifest found (Cargo.toml).
  • CHANGELOG.md
  • CODE_OF_CONDUCT.md
    Good: Code of conduct present.
  • composer.json
  • composer.lock
  • config.m4
  • CONTRIBUTING.md
    Good: Contributing guide is detailed and thorough.
    Good: Contributing guide includes setup/install instructions.
    Issue: Contributing guide lacks a code style section (−8 pts).Fix: Describe your linting/formatting rules and how to run them.
    Issue: Contributing guide lacks a testing section (−8 pts).Fix: Show contributors how to run the test suite (e.g. npm test, pytest, cargo test).
    Good: Contributing guide describes the PR/review workflow.
    Good: Contributing guide includes code examples.
  • deny.toml
  • go.work
  • go.work.sum
  • LICENSE
    Good: Licensed under MIT.
  • package.json
  • Package.swift
  • pnpm-lock.yaml
  • pnpm-workspace.yaml
  • pyproject.toml
  • README.md
    Good: README is present.
    Good: README is well structured with multiple sections.
    Good: README includes screenshots or visuals. Great for first impressions.
    Good: README has code examples.
    Good: README links to a live demo or deployed app.
    Good: README includes status badges.
    Good: README documents how to install the project.
    Good: README documents how to run the project.
  • rust-toolchain.toml
  • rustfmt.toml
  • SECURITY.md
    Good: Security policy present.
  • server.json
  • Taskfile.yml
  • test_documents
  • THIRD_PARTY_LICENSES.md
  • tsconfig.json
  • uv.lock
  • zensical.toml