Metadata-Version: 2.4
Name: corpustools
Version: 2.5.0
Summary: Manage giellalt corpora
License: GPLv3+
License-File: LICENSE.txt
Author: Børre Gaup
Author-email: albbas@gmail.com
Requires-Python: >=3.10,<4.0
Classifier: License :: Other/Proprietary License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Programming Language :: Python :: 3.14
Requires-Dist: GitPython (>=3.1.44,<4.0.0)
Requires-Dist: epub (>=0.5.2,<0.6.0)
Requires-Dist: feedparser (>=6.0.10,<7.0.0)
Requires-Dist: gitdb (>=4.0.11,<5.0.0)
Requires-Dist: html5lib (>=1.1,<2.0)
Requires-Dist: lxml-html-clean (>=0.4.2,<0.5.0)
Requires-Dist: prompt-toolkit (>=3.0.51,<4.0.0)
Requires-Dist: pytesseract (>=0.3.10,<0.4.0)
Requires-Dist: python-dateutil (>=2.8.2,<3.0.0)
Requires-Dist: python_tca2 @ git+https://github.com/divvun/python_tca2.git@main
Requires-Dist: regex (>=2024.11.6,<2025.0.0)
Requires-Dist: requests (>=2.31.0,<3.0.0)
Requires-Dist: unidecode (>=1.4.0,<2.0.0)
Project-URL: Homepage, https://github.com/giellalt/CorpusTools
Description-Content-Type: text/markdown

# Corpus Tools

Tools to manipulate a giellalt corpus in different ways.

## Install and update from Apertium nightly

These tools are a part of
[Apertium nightly packages](https://wiki.apertium.org/wiki/Installation/Developers).

For Mac users, running the `install-nightly.sh` suffices, Linux users will have
to run `<package-manager> install <name of package>`, as well (search for corpustools in
the package system).

To update the tools on Mac, run `install-nightly.sh`. On Linux, update packages
using the package manager.

## Install and update using pipx

- [Install pipx](https://pypa.github.io/pipx/installation/)
- Run `pipx install --force git+https://github.com/divvun/CorpusTools.git`

[API and manual pages](https://divvun.github.io/CorpusTools/)

