open-data
sourcegraph
open-data | sourcegraph | |
---|---|---|
25 | 69 | |
2,231 | 9,764 | |
1.4% | 1.4% | |
0.0 | 10.0 | |
3 days ago | 5 days ago | |
Go | ||
GNU General Public License v3.0 or later | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
open-data
- How to practice data analytics skills
-
[OptaJoe]2009 - Arsenal have won a Premier League game they were losing at half-time outside of London for the first time since December 2009 (2-1 at Liverpool). Temperament.
You can check statsbomb open data but you will to preprocess it from json to sql. They have great course and articles about analyzing the data. Another good reading is awasome-football . They provide list of resources to get data. But the most comprehensive and recommended resources eddwebster's guide. He worked for city football group and his repository updated frequently.
-
Enzo Fernández Progressive Passes - World Cup 2022
I tried visualising Enzo's progressive passes in each of his world cup matches. I used the data available on StatsBomb for this.
-
Football (soccer) player statistics - looking for free databases
https://www.football-data.org/coverage https://datahub.io/collections/football https://github.com/statsbomb/open-data https://www.kaggle.com/datasets/hugomathien/soccer https://www.kaggle.com/datasets/martj42/international-football-results-from-1872-to-2017 https://www.kaggle.com/datasets/secareanualin/football-events https://www.kaggle.com/datasets/adityadesai13/european-football-database-20192020 https://www.kaggle.com/datasets/vivovinco/20212022-football-player-stats https://www.kaggle.com/datasets/antoinekrajnc/soccer-players-statistics
-
Ask HN: Who is hiring? (September 2022)
StatsBomb | Multiple roles | REMOTE, or Bath (UK), or Cairo (Egypt)
StatsBomb is a sports analytics startup, covering football (both the soccer and American varieties) and soon basketball. We sell data products as well as analysis tools to sports, media and gambling organisations, with a tech pipeline that includes computer vision, machine learning, stream processing, and web-based dataviz. We count many of the biggest names in football as customers, and your work will have a direct impact on our ability to deliver insights to those customers, driving success on the field.
We're hiring software engineers of various stripes (data pipeline roles with Python and Clojure, full-stack web dev roles with JavaScript) and more besides. We're fully remote, but have offices in Bath, UK and Cairo, Egypt for those that want them. We organise regular team days and also run our own industry-leading conference each year.
- Apply at: https://statsbomb.com/careers
If you'd like to find out more about football analytics:
- Play with our open data: https://github.com/statsbomb/open-data
- Read our articles: https://statsbomb.com/articles/
- Browse our conference videos: https://www.youtube.com/channel/UCmZ2ArreL9muPvH49Gaw0Bw
-
[OC] Football Wind ⚽️💨 A wind map visualisation of a typical football game. Each particle is following a force field built from the aggregation of 882,536 passes from 890 matches played in various major leagues/cups.
The data source providing all the passes is from StatBomb
-
🏆 TAA vs the u23 world: progressive passes/90 & xA/90
If you're familiar with GitHub and JSON then https://github.com/statsbomb/open-data looks decent.
-
Looking for football (soccer) granular datasets
The company StatsBomb, which specializes in football analytics, has made a lot of their data available for public use here: https://github.com/statsbomb/open-data I’ve been playing with it recently and I’ve found it to be pretty great.
-
[OC] Lionel Messi's shots and goals with Barcelona during his record-breaking 2011/2012 season, compared to his attempts in the 2014 and 2018 World Cups with Argentina
Messi has routinely been one of the best performers in European soccer, including his record-breaking 2011-2012 season in the Spanish league (“La Liga”) with Barcelona, where he set the record for most goals in a season. Unfortunately, success with the Argentina national team has frequently eluded him, finishing as a “runner-up” in the World Cup once and in the Copa America 3 times, before finally winning the Copa America in 2021. Critics often point to his difficulties with his national team as a fatal flaw. I was interested in how his scoring opportunities during arguably his best performance at Barcelona compared to his chances made with Argentina. The data suggests that he is regularly shooting from further away from goal when playing with Argentina when compared to his best performance with Barcelona, which could be a result of a number of factors (different team tactics, difficulty getting up the field, increasing age, less familiarity with teammates, etc.). Data: 2011/2012 La Liga season and World Cup 2018 data were collected from the very nice, public datasets provided by StatsBomb at https://github.com/statsbomb/open-data. The World Cup 2014 data was a bit more difficult to find, but was scraped from the Huffington Post . The StatsBomb data has a ton of great stats to dig into, but because the Huffington Post data had less detail, I wasn't able to go into all of it with just this plot.
-
xG stats for individual shots.
I think Statsbomb has a free API you can use on Github if you request access. https://github.com/statsbomb/open-data
sourcegraph
-
Ask HN: Who is hiring? (March 2024)
Sourcegraph | REMOTE | Full-Time | Machine Learning Engineer, Developer Advocate, Enterprise Product Manager, Technical Advisor | https://sourcegraph.com
Sourcegraph is a code AI platform that makes it easy to read, write, and fix code–even in big, complex codebases.
We are building Cody, an AI coding assistant that uses code search and code intelligence to help devs quickly understand what's happening in code and generate new code that matches the best practices in your codebase. Cody supports AI-enabled autocompletion, fixing bugs, refactoring, test generation, code explanation, and answering high-level questions. You can read Steve Yegge's post on why Cody's code context engine differentiates it from the fast-moving field of AI dev tools: https://about.sourcegraph.com/blog/cheating-is-all-you-need.
Apply here: https://grnh.se/0572f98b4us
-
Architecture.md (2021)
That's pretty much what https://sourcegraph.com/ are selling, is it not?
-
Tell HN: GitHub is blocking search unless you are logged in
Despite their shitty rug-pull <https://github.com/sourcegraph/sourcegraph/pull/53345>, I do really like Sourcegraph and one doesn't (currently?!) need to be logged in to use it: https://sourcegraph.com/search and they have a handy rewrite pattern such that one can just plug the repo path into the URL for quick searching e.g. https://sourcegraph.com/github.com/JetBrains/intellij-commun...
-
My 2024 AI Predictions
- https://sourcegraph.com is pivoting and building a copilot application (named Cody). This is pretty good, since sourcegraph is great at understanding your code
-
The Curse of Docker
While a readable Dockerfile can work as documentation, there are a few caveats:
* the application needs to be designed to work outside containers (so, no hardcoded URLs, ports, or paths). Also, not directly related to containers, but it's nice if it can be easily compiled in most environments and not just on the base image.
* I still need a way to notify me of updates; if the Dockerfile just wgets a binary, this doesn't help me.
* The Dockerfiles need to be easy to find. Sourcegraph's don't seem to be referenced from the documentation, I had to look through their Github repos to find https://github.com/sourcegraph/sourcegraph/tree/main/docker-... (though most are bazel scripts instead of Dockerfiles, but serve the same purpose)
-
Building Reddit’s Design System on iOS
We use Sourcegraph, which is a tool that searches through code in repositories. We leverage this tool in order to understand the adoption curve of our components across all of Reddit. We have a dashboard for each of the platforms to compare the inclusion of RPL components over legacy components. These insights are helpful for us to make informed decisions on how we continue to drive RPL adoption. We love seeing the green line go up and the red line go down!
-
Launch HN: GitStart (YC S19) – Remote junior devs working on production PRs
SourceGraph: https://github.com/sourcegraph/sourcegraph/pulls?q=is%3Apr+a...
- Sourcegraph is no longer Open Source
What are some alternatives?
opendata - SkillCorner Open Data with 9 matches of broadcast tracking data.
opengrok - OpenGrok is a fast and usable source code search and cross reference engine, written in Java
geometry-api-java - The Esri Geometry API for Java enables developers to write custom applications for analysis of spatial data. This API is used in the Esri GIS Tools for Hadoop and other 3rd-party data processing solutions.
tree-sitter - An incremental parsing system for programming tools
sample-data - Metrica Sports sample tracking and event data
Code-Server - VS Code in the browser
football_analytics - 📊⚽ A collection of football analytics projects, data, and analysis by Edd Webster (@eddwebster), including a curated list of publicly available resources published by the football analytics community.
theia-apps - Theia applications examples - docker images, desktop apps, packagings
nba-movement-data - SportVU movement tracking data.
Vue Storefront - Alokai is a Frontend as a Service solution that simplifies composable commerce. It connects all the technologies needed to build and deploy fast & scalable ecommerce frontends. It guides merchants to deliver exceptional customer experiences quickly and easily.
geomesa - GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.
Atheos - A self-hosted browser-based cloud IDE, updated from Codiad IDE