[Tutorial] A fast and easy way to factorize integers up to 1e9

#	User	Rating
1	tourist	3690
2	jiangly	3647
3	Benq	3581
4	orzdevinwang	3570
5	Geothermal	3569
5	cnnfls_csy	3569
7	Radewoosh	3509
8	ecnerwala	3486
9	jqdai0815	3474
10	gyh20	3447

#	User	Contrib.
1	maomao90	174
2	awoo	164
3	adamant	162
4	TheScrasse	160
5	nor	158
6	maroonrk	156
7	-is-this-fft-	152
8	orz	146
9	pajenegod	145
9	SecondThread	145

This blog focuses on the GCC compiler.

The most basic way to find a prime factorization for an integer $$$n$$$ is to test every possible divisor up to $$$\sqrt{n}$$$. This can be optimized by precomputing primes and only testing those. What else can we do? Note that the modulo operation is rather expensive. To optimize it, we can use something like the Montgomery multiplication, but it's too complicated for an "easy way". But also note that when the divisor is a compile-time constant, the compiler optimizes the modulo operation (or division, for that matter) to cheap multiplication and bit-shift operations. Let's take advantage of that:

#include <bits/stdc++.h>
using namespace std;

typedef uint32_t u32;

// The cold attribute tells the compiler that this function is unlikely to be
// called.  Without it, compilation will take much more time because the
// compiler tries to optimize the function calls.  Attributes are a GCC
// extension.
__attribute__((cold))
void factor_helper(vector<u32> &vec, u32 &x, u32 y)
{
	do {
		vec.push_back(y);
		x /= y;
	} while (x % y == 0);
}

vector<u32> factor(u32 x) {
	vector<u32> vec;
#define F(y) if (x%y == 0) { factor_helper(vec, x, y); }
	F(2)F(3)F(5)F(7)F(11)F(13)... /* all primes up to sqrt(1e9) */
#undef F
	if (x > 1)
		vec.push_back(x);
	return vec;
}

As you can see, it's pretty simple and the long line containing all primes can be generated with a few lines of code. The only problem that remains is its long compile time. The above code is tweaked in a way so that the compilation can be done within a few seconds with GCC (which has been achieved through trial and error, like how the parameters are passed, the cold attribute, etc.), so that's no longer an issue either. Also unsigned integers are used because they are faster in this case.

But how fast actually is this? With my tests it can factorize $$$10^6$$$ integers in less than 2 seconds! Feel free to test it for yourself (Be sure to use the latest version of g++ available on Codeforces for testing or actually using this).

vector<u32> factor(u32 x) { vector<u32> vec; #pragma GCC unroll 3401 for (int i = 0; i < 3401; i++) if (x % primes[i] == 0) [[unlikely]] factor_helper(vec, x, primes[i]); if (x > 1) vec.push_back(x); return vec; }

template<int L = 0, int R = P, int M = (L + R) / 2> constexpr void fact_p(vector<u32>& vec, u32 &x) { if constexpr (L + 1 < R) { fact_p<L, M>(vec, x); if (L+20<R && x < primes[M]*primes[M]) return; fact_p<M, R>(vec, x); } else if(x % primes[L] == 0) { factor_helper(vec, x, primes[L]); } }

Comments (16)

Write comment?

ftiasch

11 months ago, # |

+42

That sounds really crazy...

→ Reply

aryanc403

← Rev. 2 →

+82

Spoiler

Psychotic_D

11 months ago, # ^ |

Same here.

ymmparsa

Here you go.

WARNING: very large amount of text, open at your own risk!

#include <bits/stdc++.h>
using namespace std;

typedef uint32_t u32;

// The cold attribute tells the compiler that this function is unlikely to be
// called.  Without it, compilation will take much more time because the
// compiler tries to optimize the function calls.  Attributes are a GCC
// extension.
__attribute__((cold))
void factor_helper(vector<u32> &vec, u32 &x, u32 y)
{
	do {
		vec.push_back(y);
		x /= y;
	} while (x % y == 0);
}

vector<u32> factor(u32 x) {
	vector<u32> vec;
#define F(y) if (x%y == 0) { factor_helper(vec, x, y); }
F(2)F(3)F(5)F(7)F(11)F(13)F(17)F(19)F(23)F(29)F(31)F(37)F(41)F(43)F(47)F(53)F(59)F(61)F(67)F(71)F(73)F(79)F(83)F(89)F(97)F(101)F(103)F(107)F(109)F(113)F(127)F(131)F(137)F(139)F(149)F(151)F(157)F(163) F(167)F(173)F(179)F(181)F(191)F(193)F(197)F(199)F(211)F(223)F(227)F(229)F(233)F(239)F(241)F(251)F(257)F(263)F(269)F(271)F(277)F(281)F(283)F(293)F(307)F(311)F(313)F(317)F(331)F(337)F(347)F(349)F(353) F(359)F(367)F(373)F(379)F(383)F(389)F(397)F(401)F(409)F(419)F(421)F(431)F(433)F(439)F(443)F(449)F(457)F(461)F(463)F(467)F(479)F(487)F(491)F(499)F(503)F(509)F(521)F(523)F(541)F(547)F(557)F(563)F(569) F(571)F(577)F(587)F(593)F(599)F(601)F(607)F(613)F(617)F(619)F(631)F(641)F(643)F(647)F(653)F(659)F(661)F(673)F(677)F(683)F(691)F(701)F(709)F(719)F(727)F(733)F(739)F(743)F(751)F(757)F(761)F(769)F(773) F(787)F(797)F(809)F(811)F(821)F(823)F(827)F(829)F(839)F(853)F(857)F(859)F(863)F(877)F(881)F(883)F(887)F(907)F(911)F(919)F(929)F(937)F(941)F(947)F(953)F(967)F(971)F(977)F(983)F(991)F(997)F(1009)F(1013) F(1019)F(1021)F(1031)F(1033)F(1039)F(1049)F(1051)F(1061)F(1063)F(1069)F(1087)F(1091)F(1093)F(1097)F(1103)F(1109)F(1117)F(1123)F(1129)F(1151)F(1153)F(1163)F(1171)F(1181)F(1187)F(1193)F(1201)F(1213) F(1217)F(1223)F(1229)F(1231)F(1237)F(1249)F(1259)F(1277)F(1279)F(1283)F(1289)F(1291)F(1297)F(1301)F(1303)F(1307)F(1319)F(1321)F(1327)F(1361)F(1367)F(1373)F(1381)F(1399)F(1409)F(1423)F(1427)F(1429) F(1433)F(1439)F(1447)F(1451)F(1453)F(1459)F(1471)F(1481)F(1483)F(1487)F(1489)F(1493)F(1499)F(1511)F(1523)F(1531)F(1543)F(1549)F(1553)F(1559)F(1567)F(1571)F(1579)F(1583)F(1597)F(1601)F(1607)F(1609) F(1613)F(1619)F(1621)F(1627)F(1637)F(1657)F(1663)F(1667)F(1669)F(1693)F(1697)F(1699)F(1709)F(1721)F(1723)F(1733)F(1741)F(1747)F(1753)F(1759)F(1777)F(1783)F(1787)F(1789)F(1801)F(1811)F(1823)F(1831) F(1847)F(1861)F(1867)F(1871)F(1873)F(1877)F(1879)F(1889)F(1901)F(1907)F(1913)F(1931)F(1933)F(1949)F(1951)F(1973)F(1979)F(1987)F(1993)F(1997)F(1999)F(2003)F(2011)F(2017)F(2027)F(2029)F(2039)F(2053) F(2063)F(2069)F(2081)F(2083)F(2087)F(2089)F(2099)F(2111)F(2113)F(2129)F(2131)F(2137)F(2141)F(2143)F(2153)F(2161)F(2179)F(2203)F(2207)F(2213)F(2221)F(2237)F(2239)F(2243)F(2251)F(2267)F(2269)F(2273) F(2281)F(2287)F(2293)F(2297)F(2309)F(2311)F(2333)F(2339)F(2341)F(2347)F(2351)F(2357)F(2371)F(2377)F(2381)F(2383)F(2389)F(2393)F(2399)F(2411)F(2417)F(2423)F(2437)F(2441)F(2447)F(2459)F(2467)F(2473) F(2477)F(2503)F(2521)F(2531)F(2539)F(2543)F(2549)F(2551)F(2557)F(2579)F(2591)F(2593)F(2609)F(2617)F(2621)F(2633)F(2647)F(2657)F(2659)F(2663)F(2671)F(2677)F(2683)F(2687)F(2689)F(2693)F(2699)F(2707) F(2711)F(2713)F(2719)F(2729)F(2731)F(2741)F(2749)F(2753)F(2767)F(2777)F(2789)F(2791)F(2797)F(2801)F(2803)F(2819)F(2833)F(2837)F(2843)F(2851)F(2857)F(2861)F(2879)F(2887)F(2897)F(2903)F(2909)F(2917) F(2927)F(2939)F(2953)F(2957)F(2963)F(2969)F(2971)F(2999)F(3001)F(3011)F(3019)F(3023)F(3037)F(3041)F(3049)F(3061)F(3067)F(3079)F(3083)F(3089)F(3109)F(3119)F(3121)F(3137)F(3163)F(3167)F(3169)F(3181) F(3187)F(3191)F(3203)F(3209)F(3217)F(3221)F(3229)F(3251)F(3253)F(3257)F(3259)F(3271)F(3299)F(3301)F(3307)F(3313)F(3319)F(3323)F(3329)F(3331)F(3343)F(3347)F(3359)F(3361)F(3371)F(3373)F(3389)F(3391) F(3407)F(3413)F(3433)F(3449)F(3457)F(3461)F(3463)F(3467)F(3469)F(3491)F(3499)F(3511)F(3517)F(3527)F(3529)F(3533)F(3539)F(3541)F(3547)F(3557)F(3559)F(3571)F(3581)F(3583)F(3593)F(3607)F(3613)F(3617) F(3623)F(3631)F(3637)F(3643)F(3659)F(3671)F(3673)F(3677)F(3691)F(3697)F(3701)F(3709)F(3719)F(3727)F(3733)F(3739)F(3761)F(3767)F(3769)F(3779)F(3793)F(3797)F(3803)F(3821)F(3823)F(3833)F(3847)F(3851) F(3853)F(3863)F(3877)F(3881)F(3889)F(3907)F(3911)F(3917)F(3919)F(3923)F(3929)F(3931)F(3943)F(3947)F(3967)F(3989)F(4001)F(4003)F(4007)F(4013)F(4019)F(4021)F(4027)F(4049)F(4051)F(4057)F(4073)F(4079) F(4091)F(4093)F(4099)F(4111)F(4127)F(4129)F(4133)F(4139)F(4153)F(4157)F(4159)F(4177)F(4201)F(4211)F(4217)F(4219)F(4229)F(4231)F(4241)F(4243)F(4253)F(4259)F(4261)F(4271)F(4273)F(4283)F(4289)F(4297) F(4327)F(4337)F(4339)F(4349)F(4357)F(4363)F(4373)F(4391)F(4397)F(4409)F(4421)F(4423)F(4441)F(4447)F(4451)F(4457)F(4463)F(4481)F(4483)F(4493)F(4507)F(4513)F(4517)F(4519)F(4523)F(4547)F(4549)F(4561) F(4567)F(4583)F(4591)F(4597)F(4603)F(4621)F(4637)F(4639)F(4643)F(4649)F(4651)F(4657)F(4663)F(4673)F(4679)F(4691)F(4703)F(4721)F(4723)F(4729)F(4733)F(4751)F(4759)F(4783)F(4787)F(4789)F(4793)F(4799) F(4801)F(4813)F(4817)F(4831)F(4861)F(4871)F(4877)F(4889)F(4903)F(4909)F(4919)F(4931)F(4933)F(4937)F(4943)F(4951)F(4957)F(4967)F(4969)F(4973)F(4987)F(4993)F(4999)F(5003)F(5009)F(5011)F(5021)F(5023) F(5039)F(5051)F(5059)F(5077)F(5081)F(5087)F(5099)F(5101)F(5107)F(5113)F(5119)F(5147)F(5153)F(5167)F(5171)F(5179)F(5189)F(5197)F(5209)F(5227)F(5231)F(5233)F(5237)F(5261)F(5273)F(5279)F(5281)F(5297) F(5303)F(5309)F(5323)F(5333)F(5347)F(5351)F(5381)F(5387)F(5393)F(5399)F(5407)F(5413)F(5417)F(5419)F(5431)F(5437)F(5441)F(5443)F(5449)F(5471)F(5477)F(5479)F(5483)F(5501)F(5503)F(5507)F(5519)F(5521) F(5527)F(5531)F(5557)F(5563)F(5569)F(5573)F(5581)F(5591)F(5623)F(5639)F(5641)F(5647)F(5651)F(5653)F(5657)F(5659)F(5669)F(5683)F(5689)F(5693)F(5701)F(5711)F(5717)F(5737)F(5741)F(5743)F(5749)F(5779) F(5783)F(5791)F(5801)F(5807)F(5813)F(5821)F(5827)F(5839)F(5843)F(5849)F(5851)F(5857)F(5861)F(5867)F(5869)F(5879)F(5881)F(5897)F(5903)F(5923)F(5927)F(5939)F(5953)F(5981)F(5987)F(6007)F(6011)F(6029) F(6037)F(6043)F(6047)F(6053)F(6067)F(6073)F(6079)F(6089)F(6091)F(6101)F(6113)F(6121)F(6131)F(6133)F(6143)F(6151)F(6163)F(6173)F(6197)F(6199)F(6203)F(6211)F(6217)F(6221)F(6229)F(6247)F(6257)F(6263) F(6269)F(6271)F(6277)F(6287)F(6299)F(6301)F(6311)F(6317)F(6323)F(6329)F(6337)F(6343)F(6353)F(6359)F(6361)F(6367)F(6373)F(6379)F(6389)F(6397)F(6421)F(6427)F(6449)F(6451)F(6469)F(6473)F(6481)F(6491) F(6521)F(6529)F(6547)F(6551)F(6553)F(6563)F(6569)F(6571)F(6577)F(6581)F(6599)F(6607)F(6619)F(6637)F(6653)F(6659)F(6661)F(6673)F(6679)F(6689)F(6691)F(6701)F(6703)F(6709)F(6719)F(6733)F(6737)F(6761) F(6763)F(6779)F(6781)F(6791)F(6793)F(6803)F(6823)F(6827)F(6829)F(6833)F(6841)F(6857)F(6863)F(6869)F(6871)F(6883)F(6899)F(6907)F(6911)F(6917)F(6947)F(6949)F(6959)F(6961)F(6967)F(6971)F(6977)F(6983) F(6991)F(6997)F(7001)F(7013)F(7019)F(7027)F(7039)F(7043)F(7057)F(7069)F(7079)F(7103)F(7109)F(7121)F(7127)F(7129)F(7151)F(7159)F(7177)F(7187)F(7193)F(7207)F(7211)F(7213)F(7219)F(7229)F(7237)F(7243) F(7247)F(7253)F(7283)F(7297)F(7307)F(7309)F(7321)F(7331)F(7333)F(7349)F(7351)F(7369)F(7393)F(7411)F(7417)F(7433)F(7451)F(7457)F(7459)F(7477)F(7481)F(7487)F(7489)F(7499)F(7507)F(7517)F(7523)F(7529) F(7537)F(7541)F(7547)F(7549)F(7559)F(7561)F(7573)F(7577)F(7583)F(7589)F(7591)F(7603)F(7607)F(7621)F(7639)F(7643)F(7649)F(7669)F(7673)F(7681)F(7687)F(7691)F(7699)F(7703)F(7717)F(7723)F(7727)F(7741) F(7753)F(7757)F(7759)F(7789)F(7793)F(7817)F(7823)F(7829)F(7841)F(7853)F(7867)F(7873)F(7877)F(7879)F(7883)F(7901)F(7907)F(7919)F(7927)F(7933)F(7937)F(7949)F(7951)F(7963)F(7993)F(8009)F(8011)F(8017) F(8039)F(8053)F(8059)F(8069)F(8081)F(8087)F(8089)F(8093)F(8101)F(8111)F(8117)F(8123)F(8147)F(8161)F(8167)F(8171)F(8179)F(8191)F(8209)F(8219)F(8221)F(8231)F(8233)F(8237)F(8243)F(8263)F(8269)F(8273) F(8287)F(8291)F(8293)F(8297)F(8311)F(8317)F(8329)F(8353)F(8363)F(8369)F(8377)F(8387)F(8389)F(8419)F(8423)F(8429)F(8431)F(8443)F(8447)F(8461)F(8467)F(8501)F(8513)F(8521)F(8527)F(8537)F(8539)F(8543) F(8563)F(8573)F(8581)F(8597)F(8599)F(8609)F(8623)F(8627)F(8629)F(8641)F(8647)F(8663)F(8669)F(8677)F(8681)F(8689)F(8693)F(8699)F(8707)F(8713)F(8719)F(8731)F(8737)F(8741)F(8747)F(8753)F(8761)F(8779) F(8783)F(8803)F(8807)F(8819)F(8821)F(8831)F(8837)F(8839)F(8849)F(8861)F(8863)F(8867)F(8887)F(8893)F(8923)F(8929)F(8933)F(8941)F(8951)F(8963)F(8969)F(8971)F(8999)F(9001)F(9007)F(9011)F(9013)F(9029) F(9041)F(9043)F(9049)F(9059)F(9067)F(9091)F(9103)F(9109)F(9127)F(9133)F(9137)F(9151)F(9157)F(9161)F(9173)F(9181)F(9187)F(9199)F(9203)F(9209)F(9221)F(9227)F(9239)F(9241)F(9257)F(9277)F(9281)F(9283) F(9293)F(9311)F(9319)F(9323)F(9337)F(9341)F(9343)F(9349)F(9371)F(9377)F(9391)F(9397)F(9403)F(9413)F(9419)F(9421)F(9431)F(9433)F(9437)F(9439)F(9461)F(9463)F(9467)F(9473)F(9479)F(9491)F(9497)F(9511) F(9521)F(9533)F(9539)F(9547)F(9551)F(9587)F(9601)F(9613)F(9619)F(9623)F(9629)F(9631)F(9643)F(9649)F(9661)F(9677)F(9679)F(9689)F(9697)F(9719)F(9721)F(9733)F(9739)F(9743)F(9749)F(9767)F(9769)F(9781) F(9787)F(9791)F(9803)F(9811)F(9817)F(9829)F(9833)F(9839)F(9851)F(9857)F(9859)F(9871)F(9883)F(9887)F(9901)F(9907)F(9923)F(9929)F(9931)F(9941)F(9949)F(9967)F(9973)F(10007)F(10009)F(10037)F(10039) F(10061)F(10067)F(10069)F(10079)F(10091)F(10093)F(10099)F(10103)F(10111)F(10133)F(10139)F(10141)F(10151)F(10159)F(10163)F(10169)F(10177)F(10181)F(10193)F(10211)F(10223)F(10243)F(10247)F(10253)F(10259) F(10267)F(10271)F(10273)F(10289)F(10301)F(10303)F(10313)F(10321)F(10331)F(10333)F(10337)F(10343)F(10357)F(10369)F(10391)F(10399)F(10427)F(10429)F(10433)F(10453)F(10457)F(10459)F(10463)F(10477)F(10487) F(10499)F(10501)F(10513)F(10529)F(10531)F(10559)F(10567)F(10589)F(10597)F(10601)F(10607)F(10613)F(10627)F(10631)F(10639)F(10651)F(10657)F(10663)F(10667)F(10687)F(10691)F(10709)F(10711)F(10723)F(10729) F(10733)F(10739)F(10753)F(10771)F(10781)F(10789)F(10799)F(10831)F(10837)F(10847)F(10853)F(10859)F(10861)F(10867)F(10883)F(10889)F(10891)F(10903)F(10909)F(10937)F(10939)F(10949)F(10957)F(10973)F(10979) F(10987)F(10993)F(11003)F(11027)F(11047)F(11057)F(11059)F(11069)F(11071)F(11083)F(11087)F(11093)F(11113)F(11117)F(11119)F(11131)F(11149)F(11159)F(11161)F(11171)F(11173)F(11177)F(11197)F(11213)F(11239) F(11243)F(11251)F(11257)F(11261)F(11273)F(11279)F(11287)F(11299)F(11311)F(11317)F(11321)F(11329)F(11351)F(11353)F(11369)F(11383)F(11393)F(11399)F(11411)F(11423)F(11437)F(11443)F(11447)F(11467)F(11471) F(11483)F(11489)F(11491)F(11497)F(11503)F(11519)F(11527)F(11549)F(11551)F(11579)F(11587)F(11593)F(11597)F(11617)F(11621)F(11633)F(11657)F(11677)F(11681)F(11689)F(11699)F(11701)F(11717)F(11719)F(11731) F(11743)F(11777)F(11779)F(11783)F(11789)F(11801)F(11807)F(11813)F(11821)F(11827)F(11831)F(11833)F(11839)F(11863)F(11867)F(11887)F(11897)F(11903)F(11909)F(11923)F(11927)F(11933)F(11939)F(11941)F(11953) F(11959)F(11969)F(11971)F(11981)F(11987)F(12007)F(12011)F(12037)F(12041)F(12043)F(12049)F(12071)F(12073)F(12097)F(12101)F(12107)F(12109)F(12113)F(12119)F(12143)F(12149)F(12157)F(12161)F(12163)F(12197) F(12203)F(12211)F(12227)F(12239)F(12241)F(12251)F(12253)F(12263)F(12269)F(12277)F(12281)F(12289)F(12301)F(12323)F(12329)F(12343)F(12347)F(12373)F(12377)F(12379)F(12391)F(12401)F(12409)F(12413)F(12421) F(12433)F(12437)F(12451)F(12457)F(12473)F(12479)F(12487)F(12491)F(12497)F(12503)F(12511)F(12517)F(12527)F(12539)F(12541)F(12547)F(12553)F(12569)F(12577)F(12583)F(12589)F(12601)F(12611)F(12613)F(12619) F(12637)F(12641)F(12647)F(12653)F(12659)F(12671)F(12689)F(12697)F(12703)F(12713)F(12721)F(12739)F(12743)F(12757)F(12763)F(12781)F(12791)F(12799)F(12809)F(12821)F(12823)F(12829)F(12841)F(12853)F(12889) F(12893)F(12899)F(12907)F(12911)F(12917)F(12919)F(12923)F(12941)F(12953)F(12959)F(12967)F(12973)F(12979)F(12983)F(13001)F(13003)F(13007)F(13009)F(13033)F(13037)F(13043)F(13049)F(13063)F(13093)F(13099) F(13103)F(13109)F(13121)F(13127)F(13147)F(13151)F(13159)F(13163)F(13171)F(13177)F(13183)F(13187)F(13217)F(13219)F(13229)F(13241)F(13249)F(13259)F(13267)F(13291)F(13297)F(13309)F(13313)F(13327)F(13331) F(13337)F(13339)F(13367)F(13381)F(13397)F(13399)F(13411)F(13417)F(13421)F(13441)F(13451)F(13457)F(13463)F(13469)F(13477)F(13487)F(13499)F(13513)F(13523)F(13537)F(13553)F(13567)F(13577)F(13591)F(13597) F(13613)F(13619)F(13627)F(13633)F(13649)F(13669)F(13679)F(13681)F(13687)F(13691)F(13693)F(13697)F(13709)F(13711)F(13721)F(13723)F(13729)F(13751)F(13757)F(13759)F(13763)F(13781)F(13789)F(13799)F(13807) F(13829)F(13831)F(13841)F(13859)F(13873)F(13877)F(13879)F(13883)F(13901)F(13903)F(13907)F(13913)F(13921)F(13931)F(13933)F(13963)F(13967)F(13997)F(13999)F(14009)F(14011)F(14029)F(14033)F(14051)F(14057) F(14071)F(14081)F(14083)F(14087)F(14107)F(14143)F(14149)F(14153)F(14159)F(14173)F(14177)F(14197)F(14207)F(14221)F(14243)F(14249)F(14251)F(14281)F(14293)F(14303)F(14321)F(14323)F(14327)F(14341)F(14347) F(14369)F(14387)F(14389)F(14401)F(14407)F(14411)F(14419)F(14423)F(14431)F(14437)F(14447)F(14449)F(14461)F(14479)F(14489)F(14503)F(14519)F(14533)F(14537)F(14543)F(14549)F(14551)F(14557)F(14561)F(14563) F(14591)F(14593)F(14621)F(14627)F(14629)F(14633)F(14639)F(14653)F(14657)F(14669)F(14683)F(14699)F(14713)F(14717)F(14723)F(14731)F(14737)F(14741)F(14747)F(14753)F(14759)F(14767)F(14771)F(14779)F(14783) F(14797)F(14813)F(14821)F(14827)F(14831)F(14843)F(14851)F(14867)F(14869)F(14879)F(14887)F(14891)F(14897)F(14923)F(14929)F(14939)F(14947)F(14951)F(14957)F(14969)F(14983)F(15013)F(15017)F(15031)F(15053) F(15061)F(15073)F(15077)F(15083)F(15091)F(15101)F(15107)F(15121)F(15131)F(15137)F(15139)F(15149)F(15161)F(15173)F(15187)F(15193)F(15199)F(15217)F(15227)F(15233)F(15241)F(15259)F(15263)F(15269)F(15271) F(15277)F(15287)F(15289)F(15299)F(15307)F(15313)F(15319)F(15329)F(15331)F(15349)F(15359)F(15361)F(15373)F(15377)F(15383)F(15391)F(15401)F(15413)F(15427)F(15439)F(15443)F(15451)F(15461)F(15467)F(15473) F(15493)F(15497)F(15511)F(15527)F(15541)F(15551)F(15559)F(15569)F(15581)F(15583)F(15601)F(15607)F(15619)F(15629)F(15641)F(15643)F(15647)F(15649)F(15661)F(15667)F(15671)F(15679)F(15683)F(15727)F(15731) F(15733)F(15737)F(15739)F(15749)F(15761)F(15767)F(15773)F(15787)F(15791)F(15797)F(15803)F(15809)F(15817)F(15823)F(15859)F(15877)F(15881)F(15887)F(15889)F(15901)F(15907)F(15913)F(15919)F(15923)F(15937) F(15959)F(15971)F(15973)F(15991)F(16001)F(16007)F(16033)F(16057)F(16061)F(16063)F(16067)F(16069)F(16073)F(16087)F(16091)F(16097)F(16103)F(16111)F(16127)F(16139)F(16141)F(16183)F(16187)F(16189)F(16193) F(16217)F(16223)F(16229)F(16231)F(16249)F(16253)F(16267)F(16273)F(16301)F(16319)F(16333)F(16339)F(16349)F(16361)F(16363)F(16369)F(16381)F(16411)F(16417)F(16421)F(16427)F(16433)F(16447)F(16451)F(16453) F(16477)F(16481)F(16487)F(16493)F(16519)F(16529)F(16547)F(16553)F(16561)F(16567)F(16573)F(16603)F(16607)F(16619)F(16631)F(16633)F(16649)F(16651)F(16657)F(16661)F(16673)F(16691)F(16693)F(16699)F(16703) F(16729)F(16741)F(16747)F(16759)F(16763)F(16787)F(16811)F(16823)F(16829)F(16831)F(16843)F(16871)F(16879)F(16883)F(16889)F(16901)F(16903)F(16921)F(16927)F(16931)F(16937)F(16943)F(16963)F(16979)F(16981) F(16987)F(16993)F(17011)F(17021)F(17027)F(17029)F(17033)F(17041)F(17047)F(17053)F(17077)F(17093)F(17099)F(17107)F(17117)F(17123)F(17137)F(17159)F(17167)F(17183)F(17189)F(17191)F(17203)F(17207)F(17209) F(17231)F(17239)F(17257)F(17291)F(17293)F(17299)F(17317)F(17321)F(17327)F(17333)F(17341)F(17351)F(17359)F(17377)F(17383)F(17387)F(17389)F(17393)F(17401)F(17417)F(17419)F(17431)F(17443)F(17449)F(17467) F(17471)F(17477)F(17483)F(17489)F(17491)F(17497)F(17509)F(17519)F(17539)F(17551)F(17569)F(17573)F(17579)F(17581)F(17597)F(17599)F(17609)F(17623)F(17627)F(17657)F(17659)F(17669)F(17681)F(17683)F(17707) F(17713)F(17729)F(17737)F(17747)F(17749)F(17761)F(17783)F(17789)F(17791)F(17807)F(17827)F(17837)F(17839)F(17851)F(17863)F(17881)F(17891)F(17903)F(17909)F(17911)F(17921)F(17923)F(17929)F(17939)F(17957) F(17959)F(17971)F(17977)F(17981)F(17987)F(17989)F(18013)F(18041)F(18043)F(18047)F(18049)F(18059)F(18061)F(18077)F(18089)F(18097)F(18119)F(18121)F(18127)F(18131)F(18133)F(18143)F(18149)F(18169)F(18181) F(18191)F(18199)F(18211)F(18217)F(18223)F(18229)F(18233)F(18251)F(18253)F(18257)F(18269)F(18287)F(18289)F(18301)F(18307)F(18311)F(18313)F(18329)F(18341)F(18353)F(18367)F(18371)F(18379)F(18397)F(18401) F(18413)F(18427)F(18433)F(18439)F(18443)F(18451)F(18457)F(18461)F(18481)F(18493)F(18503)F(18517)F(18521)F(18523)F(18539)F(18541)F(18553)F(18583)F(18587)F(18593)F(18617)F(18637)F(18661)F(18671)F(18679) F(18691)F(18701)F(18713)F(18719)F(18731)F(18743)F(18749)F(18757)F(18773)F(18787)F(18793)F(18797)F(18803)F(18839)F(18859)F(18869)F(18899)F(18911)F(18913)F(18917)F(18919)F(18947)F(18959)F(18973)F(18979) F(19001)F(19009)F(19013)F(19031)F(19037)F(19051)F(19069)F(19073)F(19079)F(19081)F(19087)F(19121)F(19139)F(19141)F(19157)F(19163)F(19181)F(19183)F(19207)F(19211)F(19213)F(19219)F(19231)F(19237)F(19249) F(19259)F(19267)F(19273)F(19289)F(19301)F(19309)F(19319)F(19333)F(19373)F(19379)F(19381)F(19387)F(19391)F(19403)F(19417)F(19421)F(19423)F(19427)F(19429)F(19433)F(19441)F(19447)F(19457)F(19463)F(19469) F(19471)F(19477)F(19483)F(19489)F(19501)F(19507)F(19531)F(19541)F(19543)F(19553)F(19559)F(19571)F(19577)F(19583)F(19597)F(19603)F(19609)F(19661)F(19681)F(19687)F(19697)F(19699)F(19709)F(19717)F(19727) F(19739)F(19751)F(19753)F(19759)F(19763)F(19777)F(19793)F(19801)F(19813)F(19819)F(19841)F(19843)F(19853)F(19861)F(19867)F(19889)F(19891)F(19913)F(19919)F(19927)F(19937)F(19949)F(19961)F(19963)F(19973) F(19979)F(19991)F(19993)F(19997)F(20011)F(20021)F(20023)F(20029)F(20047)F(20051)F(20063)F(20071)F(20089)F(20101)F(20107)F(20113)F(20117)F(20123)F(20129)F(20143)F(20147)F(20149)F(20161)F(20173)F(20177) F(20183)F(20201)F(20219)F(20231)F(20233)F(20249)F(20261)F(20269)F(20287)F(20297)F(20323)F(20327)F(20333)F(20341)F(20347)F(20353)F(20357)F(20359)F(20369)F(20389)F(20393)F(20399)F(20407)F(20411)F(20431) F(20441)F(20443)F(20477)F(20479)F(20483)F(20507)F(20509)F(20521)F(20533)F(20543)F(20549)F(20551)F(20563)F(20593)F(20599)F(20611)F(20627)F(20639)F(20641)F(20663)F(20681)F(20693)F(20707)F(20717)F(20719) F(20731)F(20743)F(20747)F(20749)F(20753)F(20759)F(20771)F(20773)F(20789)F(20807)F(20809)F(20849)F(20857)F(20873)F(20879)F(20887)F(20897)F(20899)F(20903)F(20921)F(20929)F(20939)F(20947)F(20959)F(20963) F(20981)F(20983)F(21001)F(21011)F(21013)F(21017)F(21019)F(21023)F(21031)F(21059)F(21061)F(21067)F(21089)F(21101)F(21107)F(21121)F(21139)F(21143)F(21149)F(21157)F(21163)F(21169)F(21179)F(21187)F(21191) F(21193)F(21211)F(21221)F(21227)F(21247)F(21269)F(21277)F(21283)F(21313)F(21317)F(21319)F(21323)F(21341)F(21347)F(21377)F(21379)F(21383)F(21391)F(21397)F(21401)F(21407)F(21419)F(21433)F(21467)F(21481) F(21487)F(21491)F(21493)F(21499)F(21503)F(21517)F(21521)F(21523)F(21529)F(21557)F(21559)F(21563)F(21569)F(21577)F(21587)F(21589)F(21599)F(21601)F(21611)F(21613)F(21617)F(21647)F(21649)F(21661)F(21673) F(21683)F(21701)F(21713)F(21727)F(21737)F(21739)F(21751)F(21757)F(21767)F(21773)F(21787)F(21799)F(21803)F(21817)F(21821)F(21839)F(21841)F(21851)F(21859)F(21863)F(21871)F(21881)F(21893)F(21911)F(21929) F(21937)F(21943)F(21961)F(21977)F(21991)F(21997)F(22003)F(22013)F(22027)F(22031)F(22037)F(22039)F(22051)F(22063)F(22067)F(22073)F(22079)F(22091)F(22093)F(22109)F(22111)F(22123)F(22129)F(22133)F(22147) F(22153)F(22157)F(22159)F(22171)F(22189)F(22193)F(22229)F(22247)F(22259)F(22271)F(22273)F(22277)F(22279)F(22283)F(22291)F(22303)F(22307)F(22343)F(22349)F(22367)F(22369)F(22381)F(22391)F(22397)F(22409) F(22433)F(22441)F(22447)F(22453)F(22469)F(22481)F(22483)F(22501)F(22511)F(22531)F(22541)F(22543)F(22549)F(22567)F(22571)F(22573)F(22613)F(22619)F(22621)F(22637)F(22639)F(22643)F(22651)F(22669)F(22679) F(22691)F(22697)F(22699)F(22709)F(22717)F(22721)F(22727)F(22739)F(22741)F(22751)F(22769)F(22777)F(22783)F(22787)F(22807)F(22811)F(22817)F(22853)F(22859)F(22861)F(22871)F(22877)F(22901)F(22907)F(22921) F(22937)F(22943)F(22961)F(22963)F(22973)F(22993)F(23003)F(23011)F(23017)F(23021)F(23027)F(23029)F(23039)F(23041)F(23053)F(23057)F(23059)F(23063)F(23071)F(23081)F(23087)F(23099)F(23117)F(23131)F(23143) F(23159)F(23167)F(23173)F(23189)F(23197)F(23201)F(23203)F(23209)F(23227)F(23251)F(23269)F(23279)F(23291)F(23293)F(23297)F(23311)F(23321)F(23327)F(23333)F(23339)F(23357)F(23369)F(23371)F(23399)F(23417) F(23431)F(23447)F(23459)F(23473)F(23497)F(23509)F(23531)F(23537)F(23539)F(23549)F(23557)F(23561)F(23563)F(23567)F(23581)F(23593)F(23599)F(23603)F(23609)F(23623)F(23627)F(23629)F(23633)F(23663)F(23669) F(23671)F(23677)F(23687)F(23689)F(23719)F(23741)F(23743)F(23747)F(23753)F(23761)F(23767)F(23773)F(23789)F(23801)F(23813)F(23819)F(23827)F(23831)F(23833)F(23857)F(23869)F(23873)F(23879)F(23887)F(23893) F(23899)F(23909)F(23911)F(23917)F(23929)F(23957)F(23971)F(23977)F(23981)F(23993)F(24001)F(24007)F(24019)F(24023)F(24029)F(24043)F(24049)F(24061)F(24071)F(24077)F(24083)F(24091)F(24097)F(24103)F(24107) F(24109)F(24113)F(24121)F(24133)F(24137)F(24151)F(24169)F(24179)F(24181)F(24197)F(24203)F(24223)F(24229)F(24239)F(24247)F(24251)F(24281)F(24317)F(24329)F(24337)F(24359)F(24371)F(24373)F(24379)F(24391) F(24407)F(24413)F(24419)F(24421)F(24439)F(24443)F(24469)F(24473)F(24481)F(24499)F(24509)F(24517)F(24527)F(24533)F(24547)F(24551)F(24571)F(24593)F(24611)F(24623)F(24631)F(24659)F(24671)F(24677)F(24683) F(24691)F(24697)F(24709)F(24733)F(24749)F(24763)F(24767)F(24781)F(24793)F(24799)F(24809)F(24821)F(24841)F(24847)F(24851)F(24859)F(24877)F(24889)F(24907)F(24917)F(24919)F(24923)F(24943)F(24953)F(24967) F(24971)F(24977)F(24979)F(24989)F(25013)F(25031)F(25033)F(25037)F(25057)F(25073)F(25087)F(25097)F(25111)F(25117)F(25121)F(25127)F(25147)F(25153)F(25163)F(25169)F(25171)F(25183)F(25189)F(25219)F(25229) F(25237)F(25243)F(25247)F(25253)F(25261)F(25301)F(25303)F(25307)F(25309)F(25321)F(25339)F(25343)F(25349)F(25357)F(25367)F(25373)F(25391)F(25409)F(25411)F(25423)F(25439)F(25447)F(25453)F(25457)F(25463) F(25469)F(25471)F(25523)F(25537)F(25541)F(25561)F(25577)F(25579)F(25583)F(25589)F(25601)F(25603)F(25609)F(25621)F(25633)F(25639)F(25643)F(25657)F(25667)F(25673)F(25679)F(25693)F(25703)F(25717)F(25733) F(25741)F(25747)F(25759)F(25763)F(25771)F(25793)F(25799)F(25801)F(25819)F(25841)F(25847)F(25849)F(25867)F(25873)F(25889)F(25903)F(25913)F(25919)F(25931)F(25933)F(25939)F(25943)F(25951)F(25969)F(25981) F(25997)F(25999)F(26003)F(26017)F(26021)F(26029)F(26041)F(26053)F(26083)F(26099)F(26107)F(26111)F(26113)F(26119)F(26141)F(26153)F(26161)F(26171)F(26177)F(26183)F(26189)F(26203)F(26209)F(26227)F(26237) F(26249)F(26251)F(26261)F(26263)F(26267)F(26293)F(26297)F(26309)F(26317)F(26321)F(26339)F(26347)F(26357)F(26371)F(26387)F(26393)F(26399)F(26407)F(26417)F(26423)F(26431)F(26437)F(26449)F(26459)F(26479) F(26489)F(26497)F(26501)F(26513)F(26539)F(26557)F(26561)F(26573)F(26591)F(26597)F(26627)F(26633)F(26641)F(26647)F(26669)F(26681)F(26683)F(26687)F(26693)F(26699)F(26701)F(26711)F(26713)F(26717)F(26723) F(26729)F(26731)F(26737)F(26759)F(26777)F(26783)F(26801)F(26813)F(26821)F(26833)F(26839)F(26849)F(26861)F(26863)F(26879)F(26881)F(26891)F(26893)F(26903)F(26921)F(26927)F(26947)F(26951)F(26953)F(26959) F(26981)F(26987)F(26993)F(27011)F(27017)F(27031)F(27043)F(27059)F(27061)F(27067)F(27073)F(27077)F(27091)F(27103)F(27107)F(27109)F(27127)F(27143)F(27179)F(27191)F(27197)F(27211)F(27239)F(27241)F(27253) F(27259)F(27271)F(27277)F(27281)F(27283)F(27299)F(27329)F(27337)F(27361)F(27367)F(27397)F(27407)F(27409)F(27427)F(27431)F(27437)F(27449)F(27457)F(27479)F(27481)F(27487)F(27509)F(27527)F(27529)F(27539) F(27541)F(27551)F(27581)F(27583)F(27611)F(27617)F(27631)F(27647)F(27653)F(27673)F(27689)F(27691)F(27697)F(27701)F(27733)F(27737)F(27739)F(27743)F(27749)F(27751)F(27763)F(27767)F(27773)F(27779)F(27791) F(27793)F(27799)F(27803)F(27809)F(27817)F(27823)F(27827)F(27847)F(27851)F(27883)F(27893)F(27901)F(27917)F(27919)F(27941)F(27943)F(27947)F(27953)F(27961)F(27967)F(27983)F(27997)F(28001)F(28019)F(28027) F(28031)F(28051)F(28057)F(28069)F(28081)F(28087)F(28097)F(28099)F(28109)F(28111)F(28123)F(28151)F(28163)F(28181)F(28183)F(28201)F(28211)F(28219)F(28229)F(28277)F(28279)F(28283)F(28289)F(28297)F(28307) F(28309)F(28319)F(28349)F(28351)F(28387)F(28393)F(28403)F(28409)F(28411)F(28429)F(28433)F(28439)F(28447)F(28463)F(28477)F(28493)F(28499)F(28513)F(28517)F(28537)F(28541)F(28547)F(28549)F(28559)F(28571) F(28573)F(28579)F(28591)F(28597)F(28603)F(28607)F(28619)F(28621)F(28627)F(28631)F(28643)F(28649)F(28657)F(28661)F(28663)F(28669)F(28687)F(28697)F(28703)F(28711)F(28723)F(28729)F(28751)F(28753)F(28759) F(28771)F(28789)F(28793)F(28807)F(28813)F(28817)F(28837)F(28843)F(28859)F(28867)F(28871)F(28879)F(28901)F(28909)F(28921)F(28927)F(28933)F(28949)F(28961)F(28979)F(29009)F(29017)F(29021)F(29023)F(29027) F(29033)F(29059)F(29063)F(29077)F(29101)F(29123)F(29129)F(29131)F(29137)F(29147)F(29153)F(29167)F(29173)F(29179)F(29191)F(29201)F(29207)F(29209)F(29221)F(29231)F(29243)F(29251)F(29269)F(29287)F(29297) F(29303)F(29311)F(29327)F(29333)F(29339)F(29347)F(29363)F(29383)F(29387)F(29389)F(29399)F(29401)F(29411)F(29423)F(29429)F(29437)F(29443)F(29453)F(29473)F(29483)F(29501)F(29527)F(29531)F(29537)F(29567) F(29569)F(29573)F(29581)F(29587)F(29599)F(29611)F(29629)F(29633)F(29641)F(29663)F(29669)F(29671)F(29683)F(29717)F(29723)F(29741)F(29753)F(29759)F(29761)F(29789)F(29803)F(29819)F(29833)F(29837)F(29851) F(29863)F(29867)F(29873)F(29879)F(29881)F(29917)F(29921)F(29927)F(29947)F(29959)F(29983)F(29989)F(30011)F(30013)F(30029)F(30047)F(30059)F(30071)F(30089)F(30091)F(30097)F(30103)F(30109)F(30113)F(30119) F(30133)F(30137)F(30139)F(30161)F(30169)F(30181)F(30187)F(30197)F(30203)F(30211)F(30223)F(30241)F(30253)F(30259)F(30269)F(30271)F(30293)F(30307)F(30313)F(30319)F(30323)F(30341)F(30347)F(30367)F(30389) F(30391)F(30403)F(30427)F(30431)F(30449)F(30467)F(30469)F(30491)F(30493)F(30497)F(30509)F(30517)F(30529)F(30539)F(30553)F(30557)F(30559)F(30577)F(30593)F(30631)F(30637)F(30643)F(30649)F(30661)F(30671) F(30677)F(30689)F(30697)F(30703)F(30707)F(30713)F(30727)F(30757)F(30763)F(30773)F(30781)F(30803)F(30809)F(30817)F(30829)F(30839)F(30841)F(30851)F(30853)F(30859)F(30869)F(30871)F(30881)F(30893)F(30911) F(30931)F(30937)F(30941)F(30949)F(30971)F(30977)F(30983)F(31013)F(31019)F(31033)F(31039)F(31051)F(31063)F(31069)F(31079)F(31081)F(31091)F(31121)F(31123)F(31139)F(31147)F(31151)F(31153)F(31159)F(31177) F(31181)F(31183)F(31189)F(31193)F(31219)F(31223)F(31231)F(31237)F(31247)F(31249)F(31253)F(31259)F(31267)F(31271)F(31277)F(31307)F(31319)F(31321)F(31327)F(31333)F(31337)F(31357)F(31379)F(31387)F(31391) F(31393)F(31397)F(31469)F(31477)F(31481)F(31489)F(31511)F(31513)F(31517)F(31531)F(31541)F(31543)F(31547)F(31567)F(31573)F(31583)F(31601)F(31607)
#undef F
	if (x > 1)
		vec.push_back(x);
	return vec;
}

adamant

← Rev. 4 →

+40

You can do without pasting enormous sequence of ints...

code

#include <bits/stdc++.h>
using namespace std;

typedef uint32_t u32;

// The cold attribute tells the compiler that this function is unlikely to be
// called.  Without it, compilation will take much more time because the
// compiler tries to optimize the function calls.  Attributes are a GCC
// extension.
__attribute__((cold))
void factor_helper(vector<u32> &vec, u32 &x, u32 y)
{
	do {
		vec.push_back(y);
		x /= y;
	} while (x % y == 0);
}

const int maxp = 45000;
const int P = 4675; // number of primes below maxp

constexpr auto primes = []() constexpr {
    int idx = 0;
    array<int, P> res{};
    array<int, maxp> comp{};
    for(int p = 2; p < maxp; p++) {
        if(!comp[p]) {
            res[idx++] = p;
            for(int j = p; j < maxp; j += p) {
                comp[j] = 1;
            }
        }
    }
    return res;
}();

template<int L, int R>
constexpr void fact_p(auto& vec, u32 &x) {
    if(x % primes[L] == 0) {
        factor_helper(vec, x, primes[L]);
    }
    if constexpr(L + 1 < R) {
        fact_p<L+1, R>(vec, x);   
    }
}

vector<u32> factor(u32 x) {
	vector<u32> vec;
	fact_p<0, 800>(vec, x);
	fact_p<800, 1600>(vec, x);
	fact_p<1600, 2400>(vec, x);
	fact_p<2400, 3200>(vec, x);
	fact_p<3200, 4000>(vec, x);
	fact_p<4000, 4675>(vec, x);
	if (x > 1)
		vec.push_back(x);
	return vec;
}

int main() {
    for(int i = 1e9 - 1e6; i <= 1e9; i++) {
        factor(i);
    }
}

Runs in 2199s in custom run... Had to split in chunks of 800, because CF has limit of 900 on recursive templates.

Wow that's nice! Guess I should learn more about constexpr and templates. Although the original version runs in 1341ms...

+32

I think the difference in execution time might be due to different bounds. I tried out the suggestion by ToxicPie9 below and adjusted the bounds:

#include <bits/stdc++.h>

using namespace std;

typedef uint32_t u32;

__attribute__((cold))
void factor_helper(auto &vec, u32 &x, u32 y) {
	do {
		vec.push_back(y);
		x /= y;
	} while (x % y == 0);
}

const int maxp = 31623;
const int P = 3401; // number of primes below maxp

constexpr auto primes = []() constexpr {
    int idx = 0;
    array<u32, P> res{};
    array<bool, maxp> comp{};
    for(int p = 2; p < maxp; p++) {
        if(!comp[p]) {
            res[idx++] = p;
            for(int j = p; j < maxp; j += p) {
                comp[j] = 1;
            }
        }
    }
    assert(idx == P);
    return res;
}();

template<int L = 0, int R = P, int M = (L + R) / 2>
constexpr void fact_p(auto &vec, u32 &x) {
    if constexpr (L + 1 < R) {
        fact_p<L, M>(vec, x);
        fact_p<M, R>(vec, x);
    } else if(x % primes[L] == 0) {
        factor_helper(vec, x, primes[L]);
    }
}

vector<u32> factor(u32 x) {
	vector<u32> vec;
	fact_p(vec, x);
	if (x > 1)
		vec.push_back(x);
	return vec;
}

int main() {
    for(int i = 1e9 - 1e6; i <= 1e9; i++) {
        factor(i);
    }
}

Runs 1419ms now.

ToxicPie9

+75

You can replace fact_p<L+1, R> with fact_p<L, (L+R)/2> and fact_p<(L+R)/2, R> to get $$$\log(P)$$$ depth so it fits into the limit of 900.

i don't actually know how to write binary search. this is probably incorrect

template <int L, int R, int M = (L + R) / 2>
constexpr void fact_p(auto &vec, u32 &x) {
    if constexpr (L + 1 < R) {
        fact_p<L, M>(vec, x);
        fact_p<M, R>(vec, x);
    } else {
        if (x % primes[L] == 0) {
            factor_helper(vec, x, primes[L]);
        }
    }
}

oToToT

+31

constexpr + template are all you need

+52

Here is my final version using C++ templates and constexpr (based on adamant's code here), so you don't have to paste 4000 prime numbers:

#include <bits/stdc++.h>
using namespace std;

using u32 = uint32_t;

__attribute__((noinline)) void factor_helper(vector<u32> &vec, u32 &x, u32 y) {
    do {
        vec.push_back(y);
        x /= y;
    } while (x % y == 0);
}

constexpr int maxp = 31623; // sqrt(2e9), you can adjust this
constexpr int P = 3401; // number of primes below maxp

constexpr auto primes = []() constexpr {
    int idx = 0;
    array<int, P> res{};
    array<int, maxp> comp{};
    for (int p = 2; p < maxp; p++) {
        if (!comp[p]) {
            res[idx++] = p;
            for (int j = p; j < maxp; j += p) {
                comp[j] = 1;
            }
        }
    }
    return res;
}();

template <int L, int R, int M = (L + R) / 2>
__attribute__((always_inline)) constexpr void fact_p(auto &vec, u32 &x) {
    if constexpr (L + 1 < R) {
        fact_p<L, M>(vec, x);
        fact_p<M, R>(vec, x);
    } else {
        if (x % primes[L] == 0) [[unlikely]] {
            factor_helper(vec, x, primes[L]);
        }
    }
}

vector<u32> factor(u32 x) {
    vector<u32> vec;
    fact_p<0, P>(vec, x);
    if (x > 1)
        vec.push_back(x);
    return vec;
}

// benchmark
int main() {
    for (int i = 1e9 - 1e6; i <= 1e9; i++) {
        factor(i);
    }
}

With standard optimizations (-O2), its performance matches the code in the blog. It uses templates and the always_inline attribute to generate code like a macro.

It took a lot of modifications to get a working one, and I needed some time to analyze why some versions are a lot slower than other ones. Below are some interesting technical facts, reader discretion is advised.

The key here is the noinline attribute (cold might also work), which prevents factor_helper from being inlined. Without it, not only would compiling time increase a lot, the runtime also increases by about 6 times!

When factor_helper is not inlined, The compiled code has a structure like this:

void factor_helper(...) { ... }

vector<u32> factor(u32 x) {
    vector<u32> vec;
    if (x % 2 == 0) {
        factor_helper(x, 2);
    }
    if (x % 3 == 0) {
        factor_helper(x, 3);
    }
    if (x % 5 == 0) {
        factor_helper(x, 5);
    }
    // ...
}

Here only x % p == 0 have optimized operations, but that's ok because divisions in factor_helper only happen $$$O(\log(x))$$$ times.

On the other hand, when factor_helper is inlined, The compiled code has a structure like this:

vector<u32> factor(u32 x) {
    vector<u32> vec;
    while (x % 2 == 0) {
        vec.push_back(2);
        x /= 2;
    }
    while (x % 3 == 0) {
        vec.push_back(3);
        x /= 3;
    }
    while (x % 5 == 0) {
        vec.push_back(5);
        x /= 5;
    }
    // ...
}

When push_back is also inlined, this results in a enormous (up to 470 kB) factor function, which is so large that cache misses start to have a significant effect. In adamant's case where he manually inlined factor_helper, cache and branch misses caused the same amount of instructions to take 5.5x time to run. Took us a while to figure out what was going on :|

platelet

+80

#pragma GCC unroll is faster to compile than C++ templates.

Wow I didn't know unroll can be used like this, thank you for the info. It also makes the code a lot shorter and cleaner.

Have you checked it in CF custom test?

I get compilation time out on G++20 and G++17 (64 bit), and on G++17 the runtime is 7878ms...

+50

I didn't get a compile timeout in the custom test, even though it took a long time to compile.

← Rev. 3 →

Ok, I found the problem. __attribute__((noinline)) must be near factor_helper for this to compile:

#include<bits/stdc++.h>

using namespace std;

typedef uint32_t u32;

__attribute__((noinline))
void factor_helper(auto &vec, u32 &x, u32 y) {
    do {
    	vec.push_back(y);
    	x /= y;
    } while (x % y == 0);
}

const int maxp = 31623;
const int P = 3401; // number of primes below maxp

constexpr auto primes = []() constexpr {
    int idx = 0;
    array<u32, P> res{};
    array<bool, maxp> comp{};
    for(int p = 2; p < maxp; p++) {
        if(!comp[p]) {
            res[idx++] = p;
            for(int j = p; j < maxp; j += p) {
                comp[j] = 1;
            }
        }
    }
    assert(idx == P);
    return res;
}();

vector<u32> factor(u32 x) {
    vector<u32> vec;
    #pragma GCC unroll P
    for (int i = 0; i < P; i++)
        if (x % primes[i] == 0) [[unlikely]]
            factor_helper(vec, x, primes[i]);
    if (x > 1)
    	vec.push_back(x);
    return vec;
}

int main() {
    for(int i = 1e9 - 1e6; i <= 1e9; i++) {
        factor(i);
    }
}

This runs in 1357ms.

jay_jayjay

10 months ago, # |

+49

Actually, you can do this even faster, b/c you don't need to check primes $$$p*p>x$$$. This is ~3-5 times faster than your solution. (using recursive templates)

ymmparsa's blog