codesearch.ai
mwmbl
codesearch.ai | mwmbl | |
---|---|---|
9 | 27 | |
33 | 1,372 | |
- | 1.7% | |
0.6 | 9.4 | |
about 1 year ago | 11 days ago | |
Go | Python | |
Apache License 2.0 | GNU Affero General Public License v3.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
codesearch.ai
-
Show HN: Ichido, search engine that tags sites using Google and Cloudflare
https://codesearch.ai/
-
Show HN: Feep search, an independent search engine for programmers
- Brave (recently started its own index but often falls back on Google's)
Love to see projects like Marginalia and now this. These projects also make meta search engines like Searx[0] that much more powerful.
Anyways since I'm in the business of listing out relevant projects, other code-centered search engines you might wanna check out are searchcode.com[1], codesearch.ai[2], symbolhound[3], and publicwww.com[4] (some of these are often down, but might still be good to learn from)
[0] https://searx.tuxcloud.net/
[1] https://searchcode.com/
[2] https://codesearch.ai/
[3] http://symbolhound.com/
[4] https://publicwww.com/
-
[P] Semantic code search using Transformers - codesearch.ai
Hey, I'm Rok, a software engineer at Sourcegraph, and I've been working on an experimental AI-powered code search engine called codesearch.ai as a side project. It answers natural language queries with functions indexed from GitHub.com and StackOverflow.
-
Ask HN: Are there any decent GitHub Copilot Alternatives?
It's not a pure CoPilot alternative, but I'd put https://codesearch.ai into the mix (disclaimer, it's my side project).
It is a semantic code search tool that can be queried using natural language. It provides decent answers to a variety of questions, and I've been finding myself using it quite often to "autocomplete" various mundane tasks. For example, plotting with matplotlib, making http requests in Go, running multiple goroutines, etc. - things where I would usually reach for Google. It doesn't provide a straightforward ready-to-run answer like CoPilot, but it does provide a way to help yourself. It all depends on what you prefer and how you learn. Arguably, having to read the code before you use it makes it more likely it will stick in your brain.
-
Contrastive Representation Learning
Great read, thanks for sharing. Would love to see the natural language + code mixed in there :)
I've been interested in contrastive learning for a while, mainly as a means to train semantic code search models. OpenAI released a great paper on this topic called Text and Code Embeddings by Contrastive Pre-Training[1] that outlines the approach. I've used it as a base to build https://codesearch.ai [2] with pretty good results.
[1] https://arxiv.org/pdf/2201.10005.pdf
-
A semantic code search engine built using PyTorch and Hugging Face - codesearch.ai
It looks like the UI for https://codesearch.ai/ was developed in [some combination of Go and React javascript](https://github.com/sourcegraph/codesearch.ai/tree/main/codesearch-ai-data/cmd/web)?
mwmbl
- FLaNK Stack Weekly 19 Feb 2024
-
Text Processing Practice Expt: 27 SERP Types to SQLite (Yy084)
echo "https://mwmbl.org/?q=$x"|client 185.34.32.175
-
How bad are search results? Compare Google, Bing, Marginalia, Kagi, and ChatGPT
Ironically I had to use a search engine to discover what "Mwmbl" was. It's apparently a search engine. But, visiting the front page, I see something akin to a git commit log?! I'm not sure I'd have guessed that this was a SE if Brave Search did not tell me it was (even then I'm not convinced yet).
https://mwmbl.org/
-
Indexing a Billion Pages
I believe this is closer to the thing you were asking about, and the simple answer appears to be "a home grown one in python" https://github.com/mwmbl/mwmbl/blob/e544d45c374c13cdc1a5048d...
- Welcome to mwmbl, the free, open-source and non-profit search engine
- Marginalia.nu API
- Show HN: Ichido, search engine that tags sites using Google and Cloudflare
- Introduction!
- Mwmbl, the free, open-source and non-profit search engine
What are some alternatives?
devdocs - API Documentation Browser
Lobsters - Computing-focused community centered around link aggregation and discussion
Goopt - 🔍 Search Engine for a Procedural Simulation of the Web with GPT-3.
whoogle-search - A self-hosted, ad-free, privacy-respecting metasearch engine
pldb - PLDB: a Programming Language Database. A computable encyclopedia about programming languages.
PiTheremin
code-search-blocklist - A list of domains hosting scrapped code snippets and polluting search results to block.
ublock-origin-shitty-copies-filter - Filter for DuckDuckGo and Google to remove those spam-websites that just blatantly copy and paste content from well known websites.
ublacklist - Blocks specific sites from appearing in Google search results
bertsearch - Elasticsearch with BERT for advanced document search.
searx - Privacy-respecting metasearch engine [Moved to: https://github.com/searx/searx]
MarginaliaSearch - Internet search engine for text-oriented websites. Indexing the small, old and weird web.