0
/ 100
A well-known project done right. Strong docs and solid engineering throughout.
A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.
Documentation
95
Contributing guide is detailed and thorough.
README documents how to install the project.
README is present.
Licensed under MIT.
Engineering
94
CI is configured (.github/workflows/ci-docker.yaml).
Lockfile present (Cargo.lock). Installs are reproducible.
Test files detected (crates/xberg-candle-ocr/tests).
Linter or formatter configured (.editorconfig).
Issue or PR templates present.
Project health
86
Dependency manifest found (Cargo.toml).
Repository has a description.
Actively maintained (pushed within the last month).
.gitignore present.
Repository health signals
Activity, community, and responsiveness at scan time
Activity
- -Commits (30d / 90d)
- 506Forks
- 205Releaseslatest 1y ago
Community
- 87% - GoodCommunity health
- 1 bus factorlowauthor own >50% of commits
- 8,565Watchers
Responsiveness
- 3hMedian issue response
- <1hMedian PR merge time
- 20Open issues
Repository files63 root entries
- .ai-rulez
- .cargo
- .githubGood: CI is configured (.github/workflows/ci-docker.yaml).Good: Dependabot covers 9 ecosystems (cargo, pip, npm, bundler, composer, gomod, maven, nuget, mix). Dependencies stay current.Good: Issue or PR templates present.
- .task
- cli-proxy
- cratesGood: Test files detected (crates/xberg-candle-ocr/tests).
- docker
- docs
- e2e
- fixtures
- packages
- scripts
- templates
- tools
- .clang-format
- .dockerignore
- .editorconfigGood: Linter or formatter configured (.editorconfig).
- .gh-actions-updater.toml
- .gitattributes
- .gitignoreGood: .gitignore present.
- .gitmodules
- .golangci.yml
- .hadolint.yaml
- .lychee.toml
- .npmrc
- .oxfmtrc.json
- .oxlintrc.json
- .pre-commit-config.yaml
- .rumdl.toml
- .sdkmanrc
- .shellcheckrc
- .textlintrc.json
- .typos.toml
- alef.toml
- ATTRIBUTIONS.md
- Cargo.lockGood: Lockfile present (Cargo.lock). Installs are reproducible.
- Cargo.tomlGood: Dependency manifest found (Cargo.toml).
- CHANGELOG.md
- CODE_OF_CONDUCT.mdGood: Code of conduct present.
- composer.json
- composer.lock
- config.m4
- CONTRIBUTING.mdGood: Contributing guide is detailed and thorough.Good: Contributing guide includes setup/install instructions.Issue: Contributing guide lacks a code style section (−8 pts).Fix: Describe your linting/formatting rules and how to run them.Issue: Contributing guide lacks a testing section (−8 pts).Fix: Show contributors how to run the test suite (e.g. npm test, pytest, cargo test).Good: Contributing guide describes the PR/review workflow.Good: Contributing guide includes code examples.
- deny.toml
- go.work
- go.work.sum
- LICENSEGood: Licensed under MIT.
- package.json
- Package.swift
- pnpm-lock.yaml
- pnpm-workspace.yaml
- pyproject.toml
- README.mdGood: README is present.Good: README is well structured with multiple sections.Good: README includes screenshots or visuals. Great for first impressions.Good: README has code examples.Good: README links to a live demo or deployed app.Good: README includes status badges.Good: README documents how to install the project.Good: README documents how to run the project.
- rust-toolchain.toml
- rustfmt.toml
- SECURITY.mdGood: Security policy present.
- server.json
- Taskfile.yml
- test_documents
- THIRD_PARTY_LICENSES.md
- tsconfig.json
- uv.lock
- zensical.toml