TextRecognitionDataGenerator
A synthetic data generator for text recognition (by Belval)
faker
Faker is a Python package that generates fake data for you. (by joke2k)
TextRecognitionDataGenerator | faker | |
---|---|---|
1 | 9 | |
3,043 | 17,101 | |
- | - | |
5.1 | 9.5 | |
3 months ago | 7 days ago | |
Python | Python | |
MIT License | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
TextRecognitionDataGenerator
Posts with mentions or reviews of TextRecognitionDataGenerator.
We have used some of these posts to build our list of alternatives
and similar projects.
-
[D] How to generate syntactically correct text examples for CRNN-CTC
[1]: https://github.com/Belval/TextRecognitionDataGenerator
faker
Posts with mentions or reviews of faker.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2024-01-27.
-
Leveling up your custom fake data with Faker.js
Faker was originally written in Perl and is also available as a library for Ruby, Java, and Python.
-
The Uncreative Software Engineer's Compendium to Testing
Faker: a library that generates fake data that can be useful when you need data to test for various components.
-
Exploring LLMs for Data Synthesizing & Anonymization: looking for Insights on Current & Future Solutions
Don't get me wrong, LLMs are awesome but totally unsuited for what you are describing. Classic data science tools like faker will be better for the task in pretty much every aspect. They can generate synthetic datasets and anonymize existing ones faster and far more reliable than any LLM.
-
Undercover work
The Python package, Faker, is just what you're looking for!
-
Is there a way to automate testing in python? In my case :
for datatypes like string/date and other stuff, there is a Python library called faker, which you can use. It can generate fake names, fake phone numbers, dates, and addresses. here is the link to the documentation. https://faker.readthedocs.io/ here is a link to a blog post explaining Faker. https://levelup.gitconnected.com/pythons-faker-library-your-go-to-solution-for-test-data-generation-3a070065cc04
-
Testing files in Python like a pro
Then test cases became more complex. Primary data sources were often files. We needed to test pipelines. Faker still helped a lot, but it was not convenient to copy your last-best-approach for files and reinvent the wheel over and over with each project.
-
Database automation challenges and how to solve them
For a cloud-based solution, one can write their own Terraform or CloudFormation for installation as soon as their RDS instance boots up with appropriate security and authentication details. For a local dev environment, one can rely on Faker to create mock database data for your database.
-
How to create a 1M record table with a single query
Creating realistic fake data is useful in lower environments and for load testing. Outside of SQL I like faker: https://github.com/joke2k/faker
-
DuckDB: an embedded DB for data wrangling
To test a database, first you need some data. So I created a python script and used Faker to create the following CSV files: