code-align-evals-data vs hate-speech-project

code-align-evals-data

By openai

Suggest topics

Source Code

Suggest alternative

Edit details

hate-speech-project

By danielwatson6

Suggest topics

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

code-align-evals-data		hate-speech-project
	Project
2	Mentions	1
24	Stars	6
-	Growth	-
10.0	Activity	10.0
almost 3 years ago	Latest Commit	over 1 year ago
Python	Language	Python
MIT License	License	-

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

code-align-evals-data

Posts with mentions or reviews of code-align-evals-data. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-03.

Replit's new Code LLM was trained in 1 week
12 projects | news.ycombinator.com | 3 May 2023

deduplication. We first split the files into words/tokens based on non-alphanumeric characters and remove files with fewer than 10 tokens. Next, we compute the MinHash with 256 permutations of all documents, and use Locality Sensitive Hashing to find clusters of duplicates. We further reduce these clusters by ensuring that each file in the original cluster is similar to at least one other file in the reduced cluster. We consider two files similar when their Jaccard similarity exceeds 0.85.
Near-duplicates are still difficult to measure. So we should expect duplication, and it should be proportional to the number of samples we have (even if the same variance, but I'd wager higher variance with larger duplications).
[0] https://github.com/openai/code-align-evals-data/tree/97446d9...
[1] https://arxiv.org/abs/2211.15533

hate-speech-project

Posts with mentions or reviews of hate-speech-project. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-03.

Replit's new Code LLM was trained in 1 week
12 projects | news.ycombinator.com | 3 May 2023

My favorite line from the HumanEval paper
> It is important for these tasks to be hand-written, since our models are trained on a large fraction of GitHub, which already contains solutions to problems from a variety of sources.
So to answer your question, yes, the evaluation dataset is spoiled. You can find such unique and never before seen docstrings like
> For a given list of input numbers calculate the Mean Absolute Deviation around the mean of this dataset. Mean Absolute Deviation is the absolute difference between each element and a centerpoint (mean in this case)[0]
And here's a repo I found that is 8 years old[1]. But how about a more recent one that is even closer?[2] There's plenty more examples[3] (does anyone know how actually limit the date to prior to 2021? `pushed:<2021` doesn't work nor does using the `created` keyword. Date searching doesn't seem to work well).
[0] https://github.com/openai/code-align-evals-data/blob/97446d9...
[1] https://github.com/bertomartin/stat4701/blob/ec2b64f629cbbf6...
[2] https://github.com/danielwatson6/hate-speech-project/blob/64...
[3] https://github.com/search?q=abs%28x+-+mean%29+for+language%3...