datefinder

Find dates inside text using Python and get back datetime objects (by akoumjian)

Datefinder Alternatives

Similar projects and alternatives to datefinder

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better datefinder alternative or higher similarity.

datefinder reviews and mentions

Posts with mentions or reviews of datefinder. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-18.
  • Sneller Regex vs Ripgrep
    3 projects | news.ycombinator.com | 18 May 2023
    That's with DFA minimization. Also, '\w' has 311 states while '(?-u)\w' has 5 states.

    I don't have a precise definition of enormous or impractical. Does it matter? I suppose one obvious one is when DFA construction time starts having a significant impact on total search times.

    > Additionally, the results are not the same: the number of matches is not equal to 7882. How could I make `\w` conform to other regex implementations in ripgrep?

    By following UTS#18: https://unicode.org/reports/tr18/#word

    Most regex engines make \w be ASCII-only by default. But most also have a way to opt into Unicode-aware mode. RE2, Go's regexp and ECMAScript are popular regex engines that have no way to change the interpretation of \w.

    > Fair question how regex compilers handle nefarious regexes. Go does not handle NFA with more than 1000 states, and, as you observed, we added some more restrictions when processing the NFA. It can be an interesting academic exercise to find monstrous regexes, but we haven't encountered useful regexes that hit these limits. But I guess you know some...

    It's definitely not academic. People use regexes for lexers. People use big regexes to recognize certain things like email addresses and dates. Here's a real regex used in real software to identify dates in unstructured text for example: https://github.com/akoumjian/datefinder/blob/5376ece0a522c44...

    Otherwise, as I hinted at above, the thing that can make regexes very large very quickly is when you mix Unicode classes with counted repetitions. It doesn't take a lot to make them "big."

  • Is there a Python library for reading human-written times?
    2 projects | /r/learnpython | 23 Sep 2021
  • Tuesday Daily Thread: Advanced questions
    2 projects | /r/Python | 27 Apr 2021
    Looking at this issue it seems a recent pull request should fix the strict mode problem. That said, the pull request is still open due to a failing test so you can either build from source with the pull request or looking at the comments in the issue, look at dateparser as is mentioned. It might suit your needs.
  • A note from our sponsor - SaaSHub
    www.saashub.com | 30 Apr 2024
    SaaSHub helps you find the best software and product alternatives Learn more →

Stats

Basic datefinder repo stats
3
625
0.0
about 1 year ago

akoumjian/datefinder is an open source project licensed under MIT License which is an OSI approved license.

The primary programming language of datefinder is HTML.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com