Playground/1brc - 1brc - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Dr Ian Preston	0bd1675571	Down to 14s locally (#583 ) Use flat array for stats. Use simd for line termination Co-authored-by: Ian Preston <ianopolous@protonmail.com>	2024-01-25 23:03:05 +01:00
Arman Sharif	d5cedd6a35	armandino: minimise hash collisions + other improvements (#585 )	2024-01-25 22:59:18 +01:00
Van Phu DO	271bdfb032	Simplify Node class with less field, improve hash mix speed (#584 ) * Simplify Node class with less field, improve hash mix speed * remove some ops, a bit faster * more inline, little bit faster but not sure	2024-01-25 22:57:04 +01:00
gabrielfoo	ce9455a584	gabrielfoo's first attempt (#556 ) * first attempt * formatting fix --------- Co-authored-by: Gabriel <gabriel@gabriel>	2024-01-25 22:46:40 +01:00
Vemana	80cd738b4b	C style code. Should be ~4secs or lower based on local testing. (#559 ) 1. Use Unsafe 2. Fit hashtable in L2 cache. 3. If we can find a good hash function, it can fit in L1 cache even. 4. Improve temperature parsing by using a lookup table	2024-01-25 22:40:14 +01:00
Gunnar Morling	561717a986	Update README.md	2024-01-24 20:37:10 +01:00
Gunnar Morling	0b762ee0b0	Fixing Thomas' JDK version	2024-01-24 12:18:01 +01:00
Gunnar Morling	7a9b88f328	Leaderboard	2024-01-24 10:57:28 +01:00
Gunnar Morling	055916e997	Leaderboard update	2024-01-24 10:56:03 +01:00
Alexander Yastrebov	4daeb94b04	Go implementation by AlexanderYastrebov (#298 ) * Go implementation by AlexanderYastrebov This is a proof-of-concept to demonstrate non-java submission. It requires Docker with BuildKit plugin to build and export binary. Updates * #67 * #253 * Use collision-free id lookup * Use number of buckets greater than max number of keys	2024-01-23 20:49:28 +01:00
Laake Scates-Gervasi	464ba6209b	Laake Scates-Gervasi first submission (justplainlaake) [2.5s execution, locally similar time to top 5] (#431 ) * Init Push * organize imports * Add OpenMap * Best outcome yet * Create prepare script and calculate script for native image, also add comments on calculation * Remove extra hashing, and need for the set array * Commit formatting changes from build * Remove unneeded device information * Make shell scripts executable, add hash collision double check for equality * Add hash collision double check for equality * Skip multithreading for small files to improve small file performance	2024-01-23 20:44:57 +01:00
Roy van Rijn	7ced63f460	Rewrote to always read 16 bytes, this has less instructions on perf. (#562 ) It doesn't make a lot of sense since quite some code can be written shorter, but this is what gives the best numbers.	2024-01-23 20:37:09 +01:00
3j5a	a9a05599cd	CalculateAverage_3j5a off-the-shelf Java components + ArraysSupport (#566 ) * off the shell Java components, curious about official runtime results. thnx my laptop results are around 12 seconds, e.g: 87.66user 1.32system 0:12.11elapsed 734%CPU (0avgtext+0avgdata 13980924maxresident)k Architecture: x86_64 CPU op-mode(s): 32-bit, 64-bit Address sizes: 39 bits physical, 48 bits virtual Byte Order: Little Endian CPU(s): 8 On-line CPU(s) list: 0-7 Vendor ID: GenuineIntel Model name: Intel(R) Core(TM) i5-8400H CPU @ 2.50GHz * off-the-shelf Java components... curious about official runtime results. thnx laptop results are around 11 seconds, e.g: ./calculate_average_3j5a.sh 81.46s user 1.36s system 758% cpu 10.917 total Architecture: x86_64 CPU op-mode(s): 32-bit, 64-bit Address sizes: 39 bits physical, 48 bits virtual Byte Order: Little Endian CPU(s): 8 On-line CPU(s) list: 0-7 Vendor ID: GenuineIntel Model name: Intel(R) Core(TM) i5-8400H CPU @ 2.50GHz * off-the-shelf Java components + ArraysSupport.. laptop results are around 10.2 seconds, e.g: ./calculate_average_3j5a.sh 75.02s user 1.31s system 750% cpu 10.175 total Architecture: x86_64 CPU op-mode(s): 32-bit, 64-bit Address sizes: 39 bits physical, 48 bits virtual Byte Order: Little Endian CPU(s): 8 On-line CPU(s) list: 0-7 Vendor ID: GenuineIntel Model name: Intel(R) Core(TM) i5-8400H CPU @ 2.50GHz * method handle... * full buffer read attempt * MH * MH cleanup	2024-01-23 20:31:45 +01:00
Mathias Bjerke	337642d1ec	1brc contribution from mattiz (first attempt) (#567 ) * Contribution from mattiz * Formatted code	2024-01-23 20:28:58 +01:00
karthikeyan97	292edc629f	fine tuning performance further (#526 ) * final comit changing using mappedbytebuffer changes before using unsafe address using unsafe * using graalvm,correct unsafe mem implementation --------- Co-authored-by: Karthikeyans <karthikeyan.sn@zohocorp.com>	2024-01-23 20:21:52 +01:00
Roman Musin	6c0949969a	Native image + a few smaller optimisations (#564 ) * Inline parsing name and station to avoid constantly updating the offset field (-100ms) * Remove Worker class, inline the logic into lambda * Accumulate results in an int matrix instead of using result row (-50ms) * Use native image	2024-01-23 20:19:07 +01:00
yourwass	ba793e88cd	Add Yourwass take on the challenge (#532 ) * Uses vector api for city name parsing and for hash index collision resolution * Uses lookup tables for temperature parsing	2024-01-23 20:04:55 +01:00
Yann Moisan	b3420d9348	improvements (#521 ) - inline computeIfAbsent - replace arraycopy by copyOfRange Co-authored-by: Yann Moisan <yann@zen.ly>	2024-01-23 19:58:10 +01:00
Parker Timmins	c886aaba34	Deploy v2 for parkertimmins (#524 ) * Deploy v2 for parkertimmins Main changes: - fix hash which masked incorrectly - do station equality check in simd - make station array length multiple of 32 - search for newline rather than semicolon * Fix bug - entries were being skipped between batches At the boundary between two batches, the first batch would stop after crossing a limit with a padding of 200 characters applied. The next batch should then start looking for the first full entry after the padding. This padding logic had been removed when starting a batch. For this reason, entries starting in the 200 character padding between batches were skipped.	2024-01-23 19:43:34 +01:00
Artsiom Korzun	d858959097	parse value before going to map (#548 ) parse value before going to map	2024-01-23 19:38:32 +01:00
Gaurav Anantrao Deshmukh	46d2058bd4	First optimal solution attempt (#539 ) * First optimal attempt * Removing debug lines * Using default string equals method --------- Co-authored-by: Gaurav Deshmukh <deshmgau@amazon.com>	2024-01-23 19:32:03 +01:00
Gerd Aschemann	c232346e87	#540 Cache SDKman (#554 ) IMPORTANT: Only use SDKman provided Java - System JDK no longer installed!	2024-01-23 19:13:13 +01:00
Jeevjyot Singh Chhabda	415b3eb5c5	b1rc challenge by @jeevjyot (#551 ) * b1rc challenge * fixed a rounding error * added the file back * fixed file * removed a file --------- Co-authored-by: Jeevjyot Singh Chhabda <jeevjyotsinghchhabda@Jeevjyots-MBP.hsd1.ca.comcast.net>	2024-01-23 18:43:58 +01:00
Jaromir Hamala	2c432abb96	jerrinot's improvement - fast-path for short keys (#545 ) * fast-path for keys<16 bytes * fix off by one error the mask is wrong for he 2nd word when len == 16 * less chunks per thread seems like compact code wins. on my test box anyway.	2024-01-23 18:29:22 +01:00
Marek Kohn	2be84e09ee	Add 1brc solution by @makohn (#544 )	2024-01-23 18:26:53 +01:00
Rene Schwietzke	6cf6da0c17	Added environment docs (#555 ) * Create ENVIRONMENT.md * More intel added	2024-01-23 18:21:36 +01:00
Gunnar Morling	95c0fb06b0	Leaderboard update	2024-01-23 18:16:12 +01:00
Van Phu DO	98a8279669	use thomaswue trick, use parallelism, slightly faster (#560 )	2024-01-23 16:41:25 +01:00
Dr Ian Preston	8bae1b8781	Use simd for name comparison (#568 ) Co-authored-by: Ian Preston <ianopolous@protonmail.com>	2024-01-23 16:37:33 +01:00
Antonio Muñoz	f7febea2f6	tonivade implementation (try 2) (#541 ) * tonivade implementation * synchronized block performs better than ReentrantLock * remove ConcurrentHashMap * refactor * use HashMap.newHashMap * change double to int * minor refactor * fix	2024-01-23 16:35:05 +01:00
Elliot Barlas	df891354d3	Inline and optimize value parsing code for each of the four semicolon position processing branches. This provides a small but noticeable speed-up. It also expands and obfuscates the code, unfortunately. (#563 )	2024-01-23 16:31:04 +01:00
Sarkie	e16ad7660d	Update README.md to add the _baseline (#552 )	2024-01-22 11:00:01 +01:00
Jason Nochlin	b374643a1f	evaluate.sh: Add note for "using Unsafe" (#547 ) * fix typo * automatically label entries using Unsafe * fix for entry in src/main/java-22/ * backfill leaderboard --------- Co-authored-by: Jason Nochlin <hundredwatt@users.noreply.github.com>	2024-01-22 09:20:26 +01:00
Li Lin	d2639b7ce1	Add linl33's implementation (#503 ) * Add linl33's implementation * Update evaluate.sh --------- Co-authored-by: Gunnar Morling <gunnar.morling@googlemail.com>	2024-01-21 21:14:05 +01:00
Gunnar Morling	81963f963c	Leaderboard update	2024-01-21 20:39:20 +01:00
Artsiom Korzun	ac4805ee45	subprocess spawner (#542 )	2024-01-21 20:23:48 +01:00
Roy van Rijn	d8b071c878	Reverting ByteBuffer idea, using Thomas's trick instead. (#538 )	2024-01-21 20:15:07 +01:00
Thomas Wuerthinger	d0a28599c2	Tuning and subprocess spawn for thomaswue (#533 ) * Some clean up, small-scale tuning, and reduce complexity when handling longer names. * Do actual work in worker subprocess. Main process returns immediately and OS clean up of the mmap continues in the subprocess. * Update minor Graal version after CPU release. * Turn GC back to epsilon GC (although it does not seem to make a difference). * Minor tuning for another +1%.	2024-01-21 20:13:48 +01:00
Artsiom Korzun	6b95ac6113	optimize branches (#534 )	2024-01-21 19:54:43 +01:00
Elliot Barlas	47269cf30b	Adjust rolling hash function to operate at int-scale rather than byte-scale. Ensure 8-byte alignment in key buffer for faster comparisons. (#523 )	2024-01-21 19:38:32 +01:00
Gunnar Morling	d135bd9521	Leaderboard update	2024-01-21 18:01:52 +01:00
Roman Musin	7bfc7eaec6	Reduce allocations and heap size (#525 ) * Reduce allocations * Shrink the heap size * Calculate hash when reading name (50-100ms difference) * no need to reverse bytes * bump heap size	2024-01-21 18:01:23 +01:00
Gunnar Morling	3e1951379a	#104 Running tests for PRs	2024-01-21 17:44:35 +01:00
Gunnar Morling	38c3e0f1ee	Leaderboard update	2024-01-21 13:43:40 +01:00
Van Phu DO	410425c833	reorganize code, little bit faster (#509 )	2024-01-21 13:25:18 +01:00
kumarsaurav123	8ba67cbc6d	Use Array to store results instead of grouping by and custom class (#522 )	2024-01-21 12:50:36 +01:00
adri	0a7726cc64	Improving first iteration by avoiding string creation as much as possible (#516 ) - It avoids creating unnecessary Strings objects and handles with the station names with its djb2 hashes instead - Initializes hashmaps with capacity and load factor - Adds -XX:+AlwaysPreTouch	2024-01-20 21:27:34 +01:00
Gunnar Morling	36ffed1315	Leaderboard update	2024-01-20 21:02:06 +01:00
giovannicuccu	2c1264def9	Solution without unsafe (#507 ) Co-authored-by: Giovanni Cuccu <gcuccu@imolainformatica.it>	2024-01-20 21:01:43 +01:00
Shivam Agarwal	f06de5faab	Add 0xshivamagarwal Implementation (#508 ) * 0xshivamagarwal implementation * . --------- Co-authored-by: Shivam Agarwal <>	2024-01-20 20:54:04 +01:00

... 3 4 5 6 7 ...

704 Commits