Our great sponsors
-
doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Cape has recently deployed a confidential optical character recognition (OCR) service. Anyone can try it through the Cape UI after signing up through Cape’s website. You can also integrate the confidential OCR service with your application using the cape-js and pycape SDKs. In this blog post, we will use this OCR service example to demonstrate how you can benefit from a machine learning service while maintaining users’ data confidentiality.
For its OCR service, Cape uses the excellent Python docTR library. Some of the critical benefits of docTR are its ease of use, flexibility, and matching state-of-the-art performance. The OCR model consists of two steps: text detection and text recognition. Cape uses a pre-trained DB Resnet50 architecture for detection, and for recognition, it uses a MobileNetV3 Small architecture. To learn more about the level of OCR accuracy you can expect for your document, you can consult these benchmarks provided by docTR. As you will see, model performance is very competitive compared to other commercial services.
In this blog post, we focus on invoking the OCR service. However, if you want to learn how to deploy your machine learning model with Cape, you can check this example, where we deploy an image classification model with the ONNX runtime.
Once you have your access token, you can invoke the OCR service. Here is a code snippet showing you how to do it with cape-js.
In this blog post, we focus on invoking the OCR service. However, if you want to learn how to deploy your machine learning model with Cape, you can check this example, where we deploy an image classification model with the ONNX runtime.
The code examples presented in this blog post are in the Cape's function examples repository. And you can learn more from the documentation.