Skip to content

Releases: MantisAI/sieves

v0.13.0: Construct pipelines with +; simplify progress logging

25 Sep 08:44
Compare
Choose a tag to compare

✨ New features and improvements

  • Replace >> operator with +, += for task chaining (#159)
  • Simplify progress logging by logging progress only on pipeline level (#160)

🔴 Bug fixes

-

⚠️ Backwards incompatibilities

  • Use +, += instead of >> for task chaining (#159)

📖 Documentation and examples

-

👥 Contributors

@rmitsch


Full Changelog: v0.12.0...v0.13.0

v0.12.0: Construct pipelines with >>; PyData demo; QoL improvements

24 Sep 09:43
Compare
Choose a tag to compare

✨ New features and improvements

  • Add >> operator for task chaining (#158)
  • Make classification and handling more robust (#154)
  • Rename task OCR to Ingestion (#156, #157)

🔴 Bug fixes

-

⚠️ Backwards incompatibilities

  • Renamed task OCR to Ingestion (#156)

📖 Documentation and examples

  • PyData demo (#156)

👥 Contributors

@rmitsch


Full Changelog: v0.11.1...v0.12.0

v0.11.1: Fix engine import; upgrade to Outlines v1.x; temporarily disable vLLM

29 Jul 21:06
6414bd0
Compare
Choose a tag to compare

✨ New features and improvements

  • Migrate to support Outlines >= 1.0 (#146)

🔴 Bug fixes

  • Resolve #145: direct engine object import didn't work (#146)

⚠️ Backwards incompatibilities

  • vllm support has been temporarily disabled (#146). This is because vllm currently doesn't support outlines >= 1.0. Will be re-enabled once vllm migrates to outlines v1.

📖 Documentation and examples

-

👥 Contributors

@rmitsch


Full Changelog: v0.11.0...v0.11.1

v0.11.0: Distillation, caching, making engine dependencies optional

11 May 11:25
Compare
Choose a tag to compare

✨ New features and improvements

  • New Distillation task that allows to distill a local model using setfit or model2vec. Currently only implemented for the classification task. (#138)
  • Caching support: Pipeline objects now optionally cache documents to ensure that no document is processed twice unnecessarily. (#140)
  • Missing third-party dependencies (like vllm, instructor, outlines, ...) at import time are now allowed. This enables running sieves with the engine of your choice without having to have all engine dependencies allowed. If you still want to install all dependencies, install with pip install sieves[all]. (#137)

🔴 Bug fixes

-

⚠️ Backwards incompatibilities

  • Engine dependencies (except for outlines, which is used as a default engine) are not installed by default. If you want to install all engine dependencies, install with pip install sieves[all].

📖 Documentation and examples

-

👥 Contributors

@rmitsch


Full Changelog: v0.10.0...v0.11.0

v0.10.0: vLLM, Hugging Face datasets conversion, single-label classification

22 Apr 12:19
Compare
Choose a tag to compare

✨ New features and improvements

  • vLLM support (#134)
  • Add Doc.from_hf_dataset() to generate Doc instances from HF datasets (#130)
  • Add single-label prediction mode for Classification task (#132, #134)
  • Simplified interface for chunking tasks (#125)
  • Speed up test suite (#121)
  • Set default model for Engine (#129)

🔴 Bug fixes

-

⚠️ Backwards incompatibilities

-

📖 Documentation and examples

-

👥 Contributors

@rmitsch


Full Changelog: v0.9.0...v0.10.0

v0.9.0: OCR simplification, NER task

06 Apr 20:02
Compare
Choose a tag to compare
Pre-release

✨ New features and improvements

  • Unify tasks for doing OCR (Docling, Marker) in single OCR task (#111, #114)
  • Add task for OCR with Marker (#106)
  • Extend classification task to support label descriptions (#114)
  • Speed up test suite (#121)

🔴 Bug fixes

-

⚠️ Backwards incompatibilities

-

📖 Documentation and examples

-

👥 Contributors

@adgianv, @rmitsch


Full Changelog: v0.8.0...v0.9.0

v0.8.0: PII masking, API simplifications, doc fixes

15 Mar 13:41
Compare
Choose a tag to compare

✨ New features and improvements

  • Add task for masking personally identifiable information (#101)
  • Simplify usage of engines: no need to use specific engines like Outlines anymore, instead the new Engine class deals with this based on the passed model class (#108)
  • Simplify config and usage of GliX, Ollama, Instructor engines (#100)

🔴 Bug fixes

  • Fix errors in documentation (#93, #94)

⚠️ Backwards incompatibilities

  • PredictiveTask instances now expect an engine of type Engine instead of GliXEngine | HuggingFaceEngine | ....

📖 Documentation and examples

-

👥 Contributors

@rmitsch, @nsorros


Full Changelog: v0.7.0...v0.8.0

v0.7.0: Sentiment analysis, installation fixes

22 Feb 22:00
Compare
Choose a tag to compare

✨ New features and improvements

  • Add sentiment analysis task (#90)

🔴 Bug fixes

  • Fix bug in fetching version in Config (#90)

⚠️ Backwards incompatibilities

-

📖 Documentation and examples

-

👥 Contributors

@rmitsch


Full Changelog: v0.6.1...v0.7.0

v0.6.1: Fix PyPi package installation

22 Feb 15:46
Compare
Choose a tag to compare
Pre-release

✨ New features and improvements

-

🔴 Bug fixes

⚠️ Backwards incompatibilities

-

📖 Documentation and examples

-

👥 Contributors

@nsorros, @rmitsch


Full Changelog: v0.6.0...v0.6.1

v0.6.0: Summarization, translation, multi-question answering

09 Feb 19:37
Compare
Choose a tag to compare

✨ New features and improvements

  • Add translation task (#81)
  • Add summarization task (#82)
  • Add multi-question answering task (#83)
  • overwrite flag determining whether original text should be overwritten for tasks with a textual output (e.g. translation) (#81)

🔴 Bug fixes

-

⚠️ Backwards incompatibilities

  • Bridge moved from sieves.task.predictive.core into sieves.task.predictive.bridges

📖 Documentation and examples

-

👥 Contributors

@rmitsch


Full Changelog: v0.4.0...v0.5.0