[D] New to DNABERT

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • DNABERT

    DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome

  • If I want to get started, they said it's optional to pre-train (so you can skip to step 3). This is where I got tripped up: "Note that the sequences are in kmer format, so you will need to convert your sequences into that." From what I understand, you need to do this so that all of the sequences are the same length? So kmer=6 means all of the sequences are length 6? Someone suggested that I take the first nucleotide in the promoter and grab 3 nucleotides before and 3 nucleotides after (+/-3 bases). I don't think that's how the kmer thing works though? I tried replicating how I think it works down below (I got confused on the last row of the 'after' df). Please correct me if I'm wrong!

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • MicroPython in WASM

    1 project | news.ycombinator.com | 14 May 2024
  • Dagger.io : La nouvelle ère du CI/CD dans le monde DevOps

    3 projects | dev.to | 14 May 2024
  • Building a Parenting Assistant using Lyzr SDK

    1 project | dev.to | 14 May 2024
  • Show HN: EmuBert – the first open encoder model for Australian law

    1 project | news.ycombinator.com | 14 May 2024
  • GPT-4o

    9 projects | news.ycombinator.com | 13 May 2024