Algorithms for division: Using Newton's method

JetBrains - Tell us how you use coding tools. You may win a prize!

Are you a developer or a data analyst? Share your thoughts about your coding tools in our short survey and get a chance to win prizes!

surveys.jetbrains.com

featured

InfluxDB – Built for High-Performance Time Series Workloads

InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

www.influxdata.com

featured

DirectXMath

1 16 1,700 7.7 C++

DirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps

Good article, but note that if the hardware supports the division instruction, will be much faster than the described workarounds.
Personally, I recently did what’s written in 2 cases: FP32 division on ARMv7, and FP64 division on GPUs who don’t support that instruction.
For ARM CPUs, not only they have FRECPE, they also have FRECPS for the iteration step. An example there: https://github.com/microsoft/DirectXMath/blob/jan2021/Inc/Di...
For GPUs, Microsoft classified FP64 division as “extended double shader instruction” and the support is optional. However, GPUs are guaranteed to support FP32 division. The result of FP32 division provides an awesome starting point for Newton-Raphson refinement in FP64 precision.
JetBrains

surveys.jetbrains.com featured

Tell us how you use coding tools. You may win a prize! Are you a developer or a data analyst? Share your thoughts about your coding tools in our short survey and get a chance to win prizes!

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

A 94x speed improvement demonstrated using handwritten assembly

1 project | news.ycombinator.com | 4 Nov 2024
C++ exception performance three years later

1 project | news.ycombinator.com | 13 Dec 2024
I need help to make SSE Display Tweaks work.

1 project | /r/skyrimmods | 9 Oct 2023
Need help converting wav files to XWB without the wav file names being changed

1 project | /r/Modding | 2 Jul 2023
Flight Mechanics

1 project | /r/Starfield | 20 Jun 2023

Algorithms for division: Using Newton's method

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
hardware-buttons linkedin-bot template-engine-js
Post date: 8 Dec 2021

DirectXMath

JetBrains

Related posts

A 94x speed improvement demonstrated using handwritten assembly

C++ exception performance three years later

I need help to make SSE Display Tweaks work.

Need help converting wav files to XWB without the wav file names being changed

Flight Mechanics

Did you know that C++ is
the 7th most popular programming language
based on number of references?

Algorithms for division: Using Newton's method

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com hardware-buttons linkedin-bot template-engine-js Post date: 8 Dec 2021

DirectXMath

JetBrains

Related posts

A 94x speed improvement demonstrated using handwritten assembly

C++ exception performance three years later

I need help to make SSE Display Tweaks work.

Need help converting wav files to XWB without the wav file names being changed

Flight Mechanics

Did you know that C++ is the 7th most popular programming language based on number of references?

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
hardware-buttons linkedin-bot template-engine-js
Post date: 8 Dec 2021

Did you know that C++ is
the 7th most popular programming language
based on number of references?