Multiplication of long numbers by the Karatsuba method

The other day it was necessary to deal with this algorithm, but a cursory search in google did not give anything worthwhile. On Habré, too, there was only one article that did not really help me. Having understood, I will try to share with the public in an accessible form:

Algorithm

The Karatsuba algorithm is a fast multiplication method with the complexity of calculating n ^{log ₂ 3} . At that time, as a naive algorithm, multiplication in a column requires n ² operations. It should be noted that when the length of the numbers is shorter than a few dozen characters (or rather determined experimentally), ordinary multiplication works faster.
Imagine that there are two numbers A and B of length n in some number system BASE:
A = a _n-1 a _n-2 ... a ₀
B = b _n-1 a _n-2 ... a ₀ , where a _? b _? - value acc. discharging numbers.
Each of them can be represented as the sum of two parts, halves of length m = n / 2 (if n is odd, then one part is shorter than the other by one digit:
A ₀ = a _m-1 a _m-2 ... a ₀
A ₁ = a _n-1 a _n-2 ... a _m
A = A ₀ + A ₁ * BASE ^m

B ₀ = b _m-1 b _m-2 ... b ₀
B ₁ = b _n-1 b _n-2 ... b _m
B = B ₀ + B ₁ * BASE ^m

Then: A * B = (A ₀ + A ₁ * BASE ^m ) * (B ₀ + B ₁ * BASE ^m ) = A ₀ * B ₀ + A ₀ * B ₁ * BASE ^m + A ₁ * B ₀ * BASE ^m + A ₁ * B ₁ * BASE ^{2 * m} = A ₀ * B ₀ + ( A ₀ * B ₁ + A ₁ * B ₀ ) * BASE ^m + A ₁ * B ₁ * BASE ^{2 * m}
Here you need 4 multiplication operations (parts of the formula * BASE ^{? * M} are not multiplication, in fact, indicating the place where the result is written, the digit). But in other way:
(A ₀ + A ₁ ) * (B ₀ + B ₁ ) = A ₀ * B ₀ + A ₀ * B ₁ + A ₁ * B ₀ + A ₁ * B ₁
Looking at the highlighted parts in both formulas. After simple transformations, the number of multiplication operations can be reduced to the 3rd by replacing two multiplications by one and several addition and subtraction operations, the execution time of which is an order of magnitude less:
A ₀ * B ₁ + A ₁ * B ₀ = (A ₀ + A ₁ ) * (B ₀ + B ₁ ) - A ₀ * B ₀ - A ₁ * B ₁
')
The final look of the expression:
A * B = A ₀ * B ₀ + ((A ₀ + A ₁ ) * (B ₀ + B ₁ ) - A ₀ * B ₀ - A ₁ * B ₁ ) * BASE ^m + A ₁ * B ₁ * BASE ^{2 * m}

Graphic representation:

Example

For example, multiply two eight-digit numbers in the decimal system 12345 and 98765:

The image clearly shows the recursive nature of the algorithm. For a number less than four digits in length, naive multiplication was applied.

C ++ implementation

Probably should start with how long numbers are stored. It is convenient to represent long numbers as arrays, where each element corresponds to a discharge, and the lower digits are stored in elements with smaller indices (that is, backwards), so it is more convenient to process them. For example:
int long_value[] = { 9, 8, 7, 6, 5, 4} // 456789
To increase performance, it is desirable to choose the maximum number within the base types for the base of the number system. But at the same time the following conditions are imposed on it:

The square of the maximum number in the selected number system should be placed in the selected base type. It is necessary to store the product of one digit to another in intermediate calculations.
The selected base type is desirable to take the sign. This will allow to get rid of several intermediate normalizations.
It is better that the sum of several squares of the maximum number be placed in the discharge. This will get rid of several intermediate normalizations.

Below is the working function of multiplication with comments with all the necessary auxiliary declarations and functions. For better performance, you should change the base of the number system, the type for storing the digits, and uncomment a small code snippet at the place responsible for the naive multiplication:

#include <cstring>
#define BASE 10 // number system
#define MIN_LENGTH_FOR_KARATSUBA 4 // numbers are shorter multiplied by a quadratic algorithm
typedef int digit; // taken only for digits
typedef unsigned long int size_length; // type for long numbers
using namespace std;
struct long_value { // type for long numbers
digit * values; // array with numbers written in reverse order
size_length length; // is long numbers
};
long_value sum (long_value a, long_value b) {
/ * function to add two long numbers. If numbers of different length are added together.
* then the longer is passed as the first argument. Returns new
* unnormalized number.
* /
long_value s;
s.length = a.length + 1;
s.values = new digit [s.length];
s.values [a.length - 1] = a.values [a.length - 1];
s.values [a.length] = 0;
for (size_length i = 0; i <b.length; ++ i)
s.values [i] = a.values [i] + b.values [i];
return s;
}
long_value & sub (long_value & a, long_value b) {
/ * function to subtract one long number from another. Changes the contents of the first
* numbers. Returns a link to the first number. The result is not normalized.
* /
for (size_length i = 0; i <b.length; ++ i)
a.values [i] - = b.values [i];
return a;
}
void normalize (long_value l) {
/ * Normalization of the number - bringing each digit in accordance with the number system.
*
* /
for (size_length i = 0; i <l.length - 1; ++ i) {
if (l.values [i]> = BASE) { // if the number is greater than the maximum, then a transfer is organized
digit carryover = l.values [i] / BASE;
l.values [i + 1] + = carryover;
l.values [i] - = carryover * BASE;
} else if (l.values [i] <0) { // if less - loan
digit carryover = (l.values [i] + 1) / BASE - 1;
l.values [i + 1] + = carryover;
l.values [i] - = carryover * BASE;
}
}
}
long_value karatsuba (long_value a, long_value b) {
long_value product; // resulting product
product.length = a.length + b.length;
product.values = new digit [product.length];
if (a.length <MIN_LENGTH_FOR_KARATSUBA) { // if the number is shorter then apply a naive multiplication
memset (product.values, 0, sizeof (digit) * product.length);
for (size_length i = 0; i <a.length; ++ i)
for (size_length j = 0; j <b.length; ++ j) {
product.values [i + j] + = a.values [i] * b.values [j];
/ * If you change MIN_LENGTH_FOR_KARATSUBA or BASE, uncomment the following
* lines and pick up acc. values for avoiding overflow discharges.
* For example, for the decimal number system, the number 100 means that it is organized
* transfer 1 through one digit, 200 - transfer 2 through one digit, 5000 - 5 through two.
* if (product.values [i + j]> = 100) {
* product.values [i + j] - = 100;
* product.values [i + j + 2] + = 1;
*}
* /
}
} else { // multiplication by the Karatsuba method
long_value a_part1; // the younger part of a
a_part1.values = a.values;
a_part1.length = (a.length + 1) / 2;
long_value a_part2; // the upper half of a
a_part2.values = a.values + a_part1.length;
a_part2.length = a.length / 2;
long_value b_part1; // the younger part of the number b
b_part1.values = b.values;
b_part1.length = (b.length + 1) / 2;
long_value b_part2; // the highest part of the number b
b_part2.values = b.values + b_part1.length;
b_part2.length = b.length / 2;
long_value sum_of_a_parts = sum (a_part1, a_part2); // sum of the parts of a
normalize (sum_of_a_parts);
long_value sum_of_b_parts = sum (b_part1, b_part2); // sum of parts of number b
normalize (sum_of_b_parts);
long_value product_of_sums_of_parts = karatsuba (sum_of_a_parts, sum_of_b_parts);
// product of parts sums
long_value product_of_first_parts = karatsuba (a_part1, b_part1); // junior member
long_value product_of_second_parts = karatsuba (a_part2, b_part2); // senior member
long_value sum_of_middle_terms = sub (sub (product_of_sums_of_parts, product_of_first_parts), product_of_second_parts);
// find the sum of average members
/ *
* Summation of a polynomial
* /
memcpy (product.values, product_of_first_parts.values,
product_of_first_parts.length * sizeof (digit));
memcpy (product.values + product_of_first_parts.length,
product_of_second_parts.values, product_of_second_parts.length
* sizeof (digit));
for (size_length i = 0; i <sum_of_middle_terms.length; ++ i)
product.values [a_part1.length + i] + = sum_of_middle_terms.values [i];
/ *
* Stripping
* /
delete [] sum_of_a_parts.values;
delete [] sum_of_b_parts.values;
delete [] product_of_sums_of_parts.values;
delete [] product_of_first_parts.values;
delete [] product_of_second_parts.values;
}
normalize (product); // final number normalization
return product;
}
* This source code was highlighted with Source Code Highlighter .

Source: https://habr.com/ru/post/124258/

All Articles

Multiplication of long numbers by the Karatsuba method

Algorithm

Example

C ++ implementation

More articles: