parquet-format
orbstack
parquet-format | orbstack | |
---|---|---|
4 | 36 | |
1,655 | 4,382 | |
2.4% | 3.5% | |
7.2 | 6.2 | |
5 days ago | 6 months ago | |
Thrift | Shell | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
parquet-format
-
Summing columns in remote Parquet files using DuckDB
Right, there's all sorts of metadata and often stats included in any parquet file: https://github.com/apache/parquet-format#file-format
The offsets of said metadata are well-defined (i.e. in the footer) so for S3 / blob storage so long as you can efficiently request a range of bytes you can pull the metadata without having to read all the data.
- FLaNK Stack for 4th of July
-
I have question related to Parquet files and AWS Glue
As i read here https://github.com/apache/parquet-format/blob/master/LogicalTypes.md , they are store in Integer formats and these integers represent the number of days (for Date) or number of milliseconds, microseconds or nanoseconds (for DateTime) since 1970-01-01. This works as expected with the parquet file that written by our ETL tool from internal database --> S3, all Data/DateTime columns are Integers, means that in Glue Job, i have to convert these Integers back to Date/Datetime value to do some transformation on them. But when parquet files are written by Spark, they are Date/DateTime (or TimeStamp to be more concise) format not Integers (i checked by read these files again in other Glue Job) and that make me confused.
-
Parquet: More than just “Turbo CSV”
Date is confusing with a timezone (UTC or otherwise) and the doco makes no such suggestion.
The Parquet datatypes documentation is pretty clear that there is a flag isAdjustedToUTC to define if the timestamp should be interpreted as having Instant semantics or Local semantics.
https://github.com/apache/parquet-format/blob/master/Logical...
Still no option to include a TZ offset in the data (so the same datum can be interpreted with both Local and Instant semantics) but not bad really.
orbstack
-
Show HN: OpenOrb, a curated search engine for Atom and RSS feeds
For a brief moment, I thought this was related to https://orbstack.dev
-
Ask HN: Tips to get started on my own server
If you use a Mac and just want to mess around with linux try something like Orbstack(https://orbstack.dev/) to start up VMs and mess around. The benefit of this is you're going to break things a bunch as you get started. Going from there I'd start looking automating the deployment of the various components the 'old fashioned' way aka writing shell scripts/using SSH. Once you do that then go to using things like Ansible or Terraform etc.
- Orbstack can destroy your Time Machine backups
-
NoSQL Postgres: Add MongoDB compatibility to your Supabase projects with FerretDB
FerretDB provides a Docker image allowing us to run it locally, for example via Orbstack, with a couple of simple commands.
-
Ask HN: Who is hiring? (February 2024)
OrbStack | Founding Engineer | US/Europe REMOTE | Full-time | https://orbstack.dev
OrbStack is making Docker containers & development environments delightful. Our app replaces Docker Desktop and makes containers faster, lighter, and easier to work with. It's the tool of choice for PlanetScale, Replicate, and other hot companies.
Containers should be a joy to use, not something you have to put up with. Let's build the future of dev envs.
As a founding engineer, you'll mainly work on breaking high-level ideas down into tough systems problems, solving them, and taking ownership of projects. If https://cpu.land and https://docs.orbstack.dev/architecture excite you, you'll be right in place.
Email: jobs orbstack dev
-
How Virtualisation came to Apple Silicon Macs
Before you give up, give OrbStack a try: https://orbstack.dev/
It’s significantly faster than Docker and some users in the Discord community have been able to use it to run hand-built Linux x86 VMs on Apple Silicon.
It’s a paid product though, but you can download it for free and try it out before paying.
-
Install Craft CMS v5 (alpha) with one command via DDEV
If you haven't installed a Docker runtime, you might be happy with Orbstack. Other alternatives: DDEV docs: Docker installation.
- Windows is now an app for iPhones, iPads, Macs, and PCs
-
Podman Desktop v1.5 with Compose onboarding and enhanced Kubernetes pod data
For MacOS I can really recommend https://orbstack.dev
It integrates very nicely, has very low CPU idle usage and also lets you quickly spawn VMs with bidirectional file sharing set up.
Since I switched I haven't looked back.
- Any idea what this icon is in the menu bar on my Mac? I've got the spinning beach ball every time I hover over it and it's been like that for weeks. What is it and process do I need to kill?
What are some alternatives?
rapidgzip - Gzip Decompression and Random Access for Modern Multi-Core Machines
colima - Container runtimes on macOS (and Linux) with minimal setup
xgen - Salesforce open-source LLMs with 8k sequence length.
Podman Desktop - Podman Desktop - A graphical tool for developing on containers and Kubernetes
wizmap - Explore and interpret large embeddings in your browser with interactive visualization! 📍
UTM - Virtual machines for iOS and macOS
FastSAM - Fast Segment Anything
multipass - Multipass orchestrates virtual Ubuntu instances
background-removal-js - Remove backgrounds from images directly in the browser environment with ease and no additional costs or privacy concerns. Explore an interactive demo.
lima - Linux virtual machines, with a focus on running containers
graphic-walker - An open source alternative to Tableau. Embeddable visual analytic
Proxyman - Modern. Native. Delightful Web Debugging Proxy for macOS, iOS, and Android ⚡️