rbspy
Nokogiri
Our great sponsors
rbspy | Nokogiri | |
---|---|---|
10 | 20 | |
2,459 | 6,105 | |
0.6% | 0.2% | |
8.6 | 9.4 | |
9 days ago | 1 day ago | |
Rust | C | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
rbspy
- Flameshow: A Terminal Flamegraph Viewer
-
When counting lines in Ruby randomly failed SerpApi deployments
We used rbspy to generate the flamegraph:
-
EventMachine Performance Spikes
You could use rbspy to profile the EventMachine process as it's doing the workload, and try to see there where most of the time is being spent.
-
Rails Resque - AWS ECS task randomly stuck
Not sure what your issue is and you got some of this info from strace, but FYI rbspy can also help profile where a running Ruby process is spending its time: https://rbspy.github.io/. I think it's a little more helpful than strace in that it should bring it back to Ruby code rather than just showing the raw syscalls.
-
Optimizing your tests in 5 steps
Even the most general profiler tool will show you each statement's accumulated time. These are called statistical profilers and give you a panoramic view of what the test is doing. An example of such a profiler is rbspy:
-
Fantastic Symbols and Where to Find Them - Part 2
As a result, you need to craft a specific routine for each interpreter runtime (in some cases, each version of that runtime) to obtain symbol information. Educated eyes might have already noticed, it's not an easy undertaking considering the sheer amount of interpreted languages out there. For example, a very well known Ruby profiler, rbspy, generates code for reading internal structs of the Ruby runtime for each version.
-
How to use strace on threads managed by supervisor? i.e. i want to ´supervisorctl restart someService´ and strace someService.
For production, it depends on what the process is built with. There are possibly better tracing tools than just strace. For example Java has JMX, Go has pprof. There's also things like rbspy for Ruby.
-
Why Is JRuby Slow?
I was a bit surprised to find out that the tool used to generate the flamegraph for Ruby (rbspy [1]) is written in Rust. lol
[1] https://github.com/rbspy/rbspy
-
Profiling Rails app that uses websockets
Maybe https://github.com/rbspy/rbspy would allow you to profile the running ruby process handling web sockets?
-
How SerpApi sped up data extraction from HTML from 3s to 800ms (or How to profile and optimize Ruby code and C extension)
c function is not very helpful to find the performance problem, so we dug deeper.
Nokogiri
- Web Scraping in Python – The Complete Guide
-
Did you know Nokogiri now has opt-in HTML5 parsing?
release planning: v1.16.0 · Issue #2897 · sparklemotion/nokogiri
-
As a Go developer, I’m surprised Crystal isn’t more popular
What's holding me back from going all in with Crystal is I have a lot of pre-existing Ruby code, and porting Ruby code to Crystal can be tricky. For example, Crystal lacks an Enumerator class (aka generators) due to captured block semantics. I also wish the shards ecosystem was a little more mature; for example there's multiple HTML parsing libraries, but none have all of the features that Ruby's Nokogiri has. For new greenfield backend projects, I would totally use Crystal.
-
Two months into learning Ruby, it is the most beautiful language I ever learned
Welcome! Ruby isn't exactly "dying", but the hype/popularity is definitely fading. This is primarily because Ruby is no longer "new", most of Ruby's popularity came from Rails, and now Rails is no longer the "new hotness". However, Ruby still has lots of awesome features and lots of awesome other libraries and frameworks, such as the new fancy irb gem that uses reline, nokogiri, chunky_png, the async gems, Dragon Ruby, SciRuby, Ronin, and the new Hanami web framework.
- What should I be learning?
- Comparable maintained Kimurai alternative?
-
In "Your Name" (2016), Mitsuha and Tesshi are seen turning a tree into their makeshift café, which is why one of the trees in the town is later missing
great for hacking at xml
-
Ditch Your Version Manager
Mike has worked hard over the years to have Nokogiri come with its dependencies. It does come with libxml and all that is required.
From https://nokogiri.org
> These dependencies are met by default by Nokogiri's packaged versions of the libxml2 and libxslt source code, but a configuration option --use-system-libraries is provided to allow specification of alternative library locations.
Some authors work hard to have their tools do the right thing and consistently.
-
Web scraping with rails
If the page is rendered as html you can use Nokogiri. It has great support and is pretty easy to get started with too.
-
Nokogiri 1.12 supports HTML5 parsing (after assimilating Nokogumbo)
And even now, pulling in a Java-based HTML5 parser is still probably easier than re-implementing in FFI, which is why I created https://github.com/sparklemotion/nokogiri/issues/2227 and would love to have this the conversation there if possible.
What are some alternatives?
flamegraph - Easy flamegraphs for Rust projects and everything else, without Perl or pipes <3
Oga - Oga is an XML/HTML parser written in Ruby.
bcc - BCC - Tools for BPF-based Linux IO analysis, networking, monitoring, and more
Ox - Ruby Optimized XML Parser
parca-agent - eBPF based always-on profiler auto-discovering targets in Kubernetes and systemd, zero code changes or restarts needed!
HTML::Pipeline - HTML processing filters and utilities
perf-map-agent - A java agent to generate method mappings to use with the linux `perf` tool
Oj - Optimized JSON
ruby-ll
ROXML - ROXML is a module for binding Ruby classes to XML. It supports custom mapping and bidirectional marshalling between Ruby and XML using annotation-style class methods, via Nokogiri or LibXML.
parca-demo - A collection of languages and frameworks profiled by Parca and Parca agent
HappyMapper - Object to XML mapping library, using Nokogiri (Fork from John Nunemaker's Happymapper)