incubation-engineering
hosts
incubation-engineering | hosts | |
---|---|---|
18 | 306 | |
- | 25,494 | |
- | - | |
- | 9.4 | |
- | 5 days ago | |
Python | ||
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
incubation-engineering
-
Why Postgres RDS didn't work for us
However if you really want to optimize data currently residing in Postgres for analytical workloads, as the original comment suggests - consider moving to a dedicated OLAP DB like ClickHouse.
See results from Gitlab benchmarking ClickHouse vs TimescaleDB: https://gitlab.com/gitlab-org/incubation-engineering/apm/apm...
Key findings:
-
Automating Your Homelab with Proxmox, Cloud-init, Terraform, and Ansible
ansible: stage: configure image: alpine rules: - if: $ANSIBLE_SETUP_VM != "" && $ANSIBLE_SETUP_HOST != "" variables: ANSIBLE_HOST_KEY_CHECKING: "False" script: - apk add curl bash openssh python3 py3-pip - pip3 install ansible paramiko - ansible-galaxy collection install -r ansible/requirements.yml - curl --silent "https://gitlab.com/gitlab-org/incubation-engineering/mobile-devops/download-secure-files/-/raw/main/installer" | bash - mkdir /root/.ssh && cp .secure_files/ansible.priv /root/.ssh/id_rsa && chmod 600 /root/.ssh/id_rsa - ansible-playbook ansible/main.yml -i ansible/inventory --extra-vars vyos_host=$ANSIBLE_SETUP_VM --limit $ANSIBLE_SETUP_HOST,$ANSIBLE_SETUP_VM ```
-
Float Compression 3: Filters
Interesting to match with the observations from the practice of using ClickHouse[1][2] for time series:
1. Reordering to SOA helps a lot - this is the whole point of column-oriented databases.
2. Specialized codecs like Gorilla[3], DoubleDelta[4], and FPC[5] lose to simply using ZSTD[6] compression in most cases, both in compression ratio and in performance.
3. Specialized time-series DBMS like InfluxDB or TimescaleDB lose to general-purpose relational OLAP DBMS like ClickHouse [7][8][9].
[1] https://clickhouse.com/blog/optimize-clickhouse-codecs-compr...
[2] https://github.com/ClickHouse/ClickHouse
[3] https://clickhouse.com/docs/en/sql-reference/statements/crea...
[4] https://clickhouse.com/docs/en/sql-reference/statements/crea...
[5] https://clickhouse.com/docs/en/sql-reference/statements/crea...
[6] https://github.com/facebook/zstd/
[7] https://arxiv.org/pdf/2204.09795.pdf "SciTS: A Benchmark for Time-Series Databases in Scientific Experiments and Industrial Internet of Things" (2022)
[8] https://gitlab.com/gitlab-org/incubation-engineering/apm/apm... https://gitlab.com/gitlab-org/incubation-engineering/apm/apm...
[9] https://www.sciencedirect.com/science/article/pii/S187705091...
- ClickHouse Cloud is now in Public Beta
-
Dokter 1.4.0 released
Documentation of rules is now available: https://gitlab.com/gitlab-org/incubation-engineering/ai-assist/dokter/-/blob/main/docs/overview.md
- Dokter: the doctor for your Dockerfiles
hosts
-
Does PiHole block porn?
Not by default but a blocklist can be found here https://github.com/StevenBlack/hosts
-
Steven Black DNS blocklist blocked gstatic.com
While it is now unblocked, the Steven Black list has been blocking a lot of innocent CDNs.
jQuery: https://github.com/StevenBlack/hosts/issues/2520
-
Open Source Ad Blocker for Mac, Windows, and Linux
How does this compare to using a hosts file with known ad servers?
like: https://github.com/StevenBlack/hosts
-
Show HN: YouTube banned adblockers so I built an extension to skip their ads
I use the Hosts file to block a ton of ads and that works really well. https://github.com/StevenBlack/hosts Something worth considering if your ad blocker isn't working well.
-
Big things are happening with RaspAP's Ad Blocking 🛑 Users will soon have more blocklist sources to choose from
The no-tracking project used by RaspAP is shutting down, so we took the opportunity to search for open source blocklist alternatives. Among the best is Steven Black's hosts list: https://github.com/StevenBlack/hosts
-
Radar Maps: $0.50 per 1K map loads
No idea, api.radar.io is on the block list since January 2020.
The commit's comment is "major update from adaway.org"
https://github.com/stevenblack/hosts/commit/4fa0470
-
Browser extensions spy on you, even if its developers don't
You can also use a declarative adblocker like uBlock Origin Lite [1], which only provides the browser with a list of elements to filter, but doesn't have any permissions to read content or perform requests. Or simply use your hosts file to apply OS-wide filtering with no browser add-ons needed: https://github.com/StevenBlack/hosts
Be aware that if you use these "passive" blocking methods, there are some sites like YouTube where you will see ads, because in these cases it's necessary to actually manipulate page content to hide them. What you can do is use a traditional adblocker but enable it only for these few sites where the declarative approach is not enough, take a look at [2] for more details.
[1] https://github.com/uBlockOrigin/uBOL-home
[2] https://seirdy.one/posts/2022/06/04/layered-content-blocking...
-
I installed Firefox + uBlock Origin like everyone suggested in my previous post, but this pop-up still appears, now with a 5 sec timer.
https://github.com/StevenBlack/hosts if you want to do it on your PC.
- “We have nothing to do with ads ” (2021)
-
[Paid Release]CCAdsBeGone - Customized Ads Blocking At Your Fingertips
When I select my custom hosts file, it basically breaks internet. However, if I choose a custom hosts file that is a copy of the dev's default, or if I just add a few lines to it, it will work. If I add too many lines, or use a different hosts file altogether (like the ones recommended by the dev), all connectivity breaks. Of course the latest official LetMeBlock is installed and mDNSResponder killed/restarted. I'm using Dopamine on A12+.
What are some alternatives?
hadolint - Dockerfile linter, validate inline bash, written in Haskell
blitz-app-adblock - Simple and quick patcher that blocks ads/trackers on the Blitz.gg desktop application.
ploomber - The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
shallalist - DISCONTINUED!!! - Unpacked ShallaList Repo
orchest - Build data pipelines, the easy way 🛠️
uBlock - uBlock Origin - An efficient blocker for Chromium and Firefox. Fast and lean.
v4
easylist - EasyList filter subscription (EasyList, EasyPrivacy, EasyList Cookie, Fanboy's Social/Annoyances/Notifications Blocking List)
ClickBench - ClickBench: a Benchmark For Analytical Databases
Pi-hole - A black hole for Internet advertisements
databooks - A CLI tool to reduce the friction between data scientists by reducing git conflicts removing notebook metadata and gracefully resolving git conflicts.
hosts-blocklists - Automatically updated, moderated and optimized lists for blocking ads, trackers, malware and other garbage