wayback
luau
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
wayback
-
Scraping Data From Past: A Step-by-Step Tutorial
In this tutorial, we will explore how to scrape data from the past using the Wayback Machine API. We'll be using Python and the requests library to make HTTP requests and retrieve archived versions of web pages. The code provided demonstrates a basic implementation of scraping historical data from a list of URLs within a specified date range.
-
Subdomain * wildcard search
A bit of context: I do a lot of archive digging with Apple's website. For the longest time, they hosted large files through Akamai. Most URLs looked something like this: http://a2032.g.akamai.net/5/2032/51/6cafb32dc21f74/1a1a1aaa2198c627970773d80669d84574a8d80d3cb12453c02589f25382f26493036bda4ebd305fd241a71b92f365ca/appleworks62_box.eps.hqx Unfortunately, those files shifted around from subdomain to subdomain (one period of time it was under a2032.g.akamai.net, another might be a1008.g.akamai.net) so finding all copies of a specific file was a pain in the ass. I recently learned that the IA has an API for the Wayback's Server that allows way more filtering than the web UI does. So to find every *.g.akamai.net URL they have archived, I used: http://web.archive.org/cdx/search/cdx?url=*.g.akamai.net/*
-
Managed to work with the Waybackmachine-API to get a backup of a much loved site
For the following step, you will need the Wayback Machine's CDX API, the documentation is here:https://github.com/internetarchive/wayback/tree/master/wayback-cdx-serverHowever, please note that there are errors in the documentation regarding the regex filtering syntax.
-
Take More Screenshots
archive.org geocities scrapes go back to 1996, so it is plausible it could have survived:
https://web.archive.org/cdx/search/cdx?url=geocities.com&mat...
If you ever remember any of the details, the CDX API can probably help.
https://github.com/internetarchive/wayback/blob/master/wayba...
- Is there any way to go further into results for pages with more than 10,000 captures?
-
Ask HN: How do RSS readers handle items missing pubDates?
Query the Internet Archive’s CDX server for this info.
https://github.com/internetarchive/wayback/blob/master/wayba...
-
Web scraping from https://web.archive.org/ (wayback machine)
Archive.org has a cdx server you can quickly request information from: https://github.com/internetarchive/wayback/tree/master/wayback-cdx-server
-
Possible to download a file from archive.org?
Are the contents of these WARCs available as part of the Wayback Machine itself? If so, you might be able to use the CDX server to discover and download the content.
- Wayback Machine Downloader – Download an Entire Website from the Wayback Machine
-
easy way to get images off wayback machine?
Wayback API to get a list of all versions of the page (https://github.com/internetarchive/wayback/tree/master/wayback-cdx-server#basic-usage).
luau
-
Building a baseline JIT for Lua automatically
As far as I can tell, they aren't.
http://lua-users.org/wiki/SandBoxes
There is a lot of information there, but it doesn't handle resource exhaustion, execution time limits or give any guarantees. It does indicate that it's possible, and has a decent example of the most restrictive setup, which is a good start. But I would for example compare it with Luau's SECURITY.md.
From https://github.com/luau-lang/luau/blob/master/SECURITY.md:
> Luau provides a safe sandbox that scripts can not escape from, short of vulnerabilities in custom C functions exposed by the host. This includes the virtual machine and builtin libraries. Notably this currently does not include the work-in-progress native code generation facilities.
> Any source code can not result in memory safety errors or crashes during its compilation or execution. Violations of memory safety are considered vulnerabilities.
> Note that Luau does not provide termination guarantees - some code may exhaust CPU or RAM resources on the system during compilation or execution.
So, even luau will have trouble with untrusted code, but it specifies exactly what happens and so on. I think that's fair enough.
-
Pluto, a Modern Lua Dialect
Alternatively, Luau is a well-supported Lua variant with type checking and performance improvements, aimed more towards being a sandboxed embedded scripting environment.
https://luau-lang.org/
-
Buzz: A lightweight statically typed scripting language
If you need Lua but also type-safety, how about Luau [1] then?
[1] https://luau-lang.org/
-
Lua Criticism Is Unwarranted
I had the pleasure of working with Lua 5.1 back in the late noughties. For me it's replaced Tcl whenever I want something I can configure above a C library. At the time I used it I found it quite nice but I'll also not forget the hours I wasted tracking down nil table corruptions which could have easily been caught by a type checker.
I had some hope that Luau https://luau-lang.org or Teal https://github.com/teal-language/tl would make things better but with the following example
function foo(x: number): string
-
Ask HN: Looking for platforms, other than Roblox, that have adopted Luau
Looking at other replies here, I can see I wasn't the only one who didn't realize there is Lua and Luau. Luau is an extension of Lua: https://luau-lang.org/
> Luau is syntactically backwards-compatible with Lua 5.1 (code that is valid Lua 5.1 is also valid Luau); however, we have extended the language with a set of syntactical features that make the language more familiar and ergonomic.
-
Embeddable Common Lisp 23.9.9
Lua is usually the embedded language of choice. If you are focused on security, you could check out the Roblox fork, Luau (https://github.com/Roblox/luau) where the creators took extra care to lock down the language on what scripts could do.
-
Creating a simple sandboxed language
Luau - Lua variant by Roblox
-
The Warframe Lexicon for Updates
On a side note, I've heard that they recently switched from Lua to Roblox's own fork of Lua, Luau.
-
Lua: The Little Language That Could
https://www.youtube.com/results?search_query=luau+roblox&sp=...
Luau
https://github.com/Roblox/luau
Roblox wrote a superset of Roblox Lua which is way faster
-
Scripting Resources MegaThread
https://luau-lang.org/ - some documentation, and examples https://create.roblox.com/docs - documentation, tutorials, and examples https://www.youtube.com/user/AlvinBLOX - tutorials https://www.youtube.com/@TheDevKing/videos - tutorials https://www.lua.org/manual/5.1/ - not specific to Roblox, but Lua reference manual https://www.codecademy.com/learn/learn-lua - Lua on Codecademy
What are some alternatives?
wayback-machine-spn-scripts - Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now
lua-languages - Languages that compile to Lua
ArchiveBox - 🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
LuaJIT - Mirror of the LuaJIT git repository
savepagenow - A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service
moonsharp - An interpreter for the Lua language, written entirely in C# for the .NET, Mono, Xamarin and Unity3D platforms, including handy remote debugger facilities.
waybackpack - Download the entire Wayback Machine archive for a given URL.
lua-language-server - A language server that offers Lua language support - programmed in Lua
warrick - Recover lost websites from the Web Infrastructure
tl - The compiler for Teal, a typed dialect of Lua
wayback-machine-downloader - Download an entire website from the Wayback Machine.
moonscript - :crescent_moon: A language that compiles to Lua