exporters

Export Hugging Face models to Core ML and TensorFlow Lite (by huggingface)

Exporters Alternatives

Similar projects and alternatives to exporters

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better exporters alternative or higher similarity.

exporters reviews and mentions

Posts with mentions or reviews of exporters. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-01-07.
  • I made an app that runs Mistral 7B 0.2 LLM locally on iPhone Pros
    12 projects | news.ycombinator.com | 7 Jan 2024
    Conceptually, to the best of my understanding, nothing too serious; perhaps the inefficiency of processing a larger input than necessary?

    Practically, a few things:

    If you want to have your cake & eat it too, they recommend Enumerated Shapes[1] in their coremltools docs, where CoreML precompiles up to 128 (!) variants of input shapes, but again this is fairly limiting (1 tok, 2 tok, 3 tok... up to 128 token prompts.. maybe you enforce a minimum, say 80 tokens to account for a system prompt, so up to 200 tokens, but... still pretty short). But this is only compatible with CPU inference, so that reduces its appeal.

    It seems like its current state was designed for text embedding models, where you normalize input length by chunking (often 128 or 256 tokens) and operate on the chunks ā€” and indeed, thatā€™s the only text-based CoreML model that Apple ships today, a Bert embedding model tuned for Q&A[2], not an LLM.

    You could used a fixed input length thatā€™s fairly large; I havenā€™t experimented with it once I grasped the memory requirements, but from what I gather from HuggingFaceā€™s announcement blog post[3], it seems that is what they do with swift-transformers & their CoreML conversions, handling the details for you[4][5]. I havenā€™t carefully investigated the implementation, but Iā€™m curious to learn more!

    You can be sure that no one is more aware of all this than Apple ā€” they published "Deploying Transformers on the Apple Neural Engine" in June 2022[6]. I look forward to seeing what they cook up for developers at WWDC this year!

    ---

    [1] "Use `EnumeratedShapes` for best performance. During compilation the model can be optimized on the device for the finite set of input shapes. You can provide up to 128 different shapes." https://apple.github.io/coremltools/docs-guides/source/flexi...

    [2] BertSQUAD.mlmodel (fp16) https://developer.apple.com/machine-learning/models/#text

    [3] https://huggingface.co/blog/swift-coreml-llm#optimization

    [4] `use_fixed_shapes` "Retrieve the max sequence length from the model configuration, or use a hardcoded value (currently 128). This can be subclassed to support custom lengths." https://github.com/huggingface/exporters/pull/37/files#diff-...

    [5] `use_flexible_shapes` "When True, inputs are allowed to use sequence lengths of `1` up to `maxSequenceLength`. Unfortunately, this currently prevents the model from running on GPU or the Neural Engine. We default to `False`, but this can be overridden in custom configurations." https://github.com/huggingface/exporters/pull/37/files#diff-...

    [6] https://machinelearning.apple.com/research/neural-engine-tra...

  • [P] Deploying Transformers with Apple's Core ML
    1 project | /r/MachineLearning | 1 Sep 2022
    Give it a try and leave a ā­ļø if you find it useful šŸ‘‰: https://github.com/huggingface/exporters

Stats

Basic exporters repo stats
3
526
7.1
5 months ago

huggingface/exporters is an open source project licensed under Apache License 2.0 which is an OSI approved license.

The primary programming language of exporters is Python.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com