Playground/1brc - 1brc - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Gunnar Morling	87f3b71701	Update pull_request_template.md	2024-01-28 22:52:29 +01:00
Gunnar Morling	99367dbc50	Update pull_request_template.md	2024-01-28 22:52:12 +01:00
Aleksey Shipilëv	82197d4482	shipilev: Amendments to version 4 (#627 ) * Amendments * One more locality touchup: no need to carry the entire name array	2024-01-28 22:49:34 +01:00
Jaromir Hamala	9da1660ba5	jerrinot - running out of ideas (#631 ) * another shameless copycat from thomas: less safepoints * I have no idea what I am doing	2024-01-28 22:43:53 +01:00
Dr Ian Preston	3a790c99b9	Reduce preferred vector size (#622 ) Co-authored-by: Ian Preston <ianopolous@protonmail.com>	2024-01-28 22:39:17 +01:00
Gunnar Morling	bb9bc68e41	Fixing link	2024-01-28 21:55:52 +01:00
Gunnar Morling	f98304e4a7	Fixing leaderboard entries with difference between user name and class name	2024-01-28 20:43:16 +01:00
Gunnar Morling	2bb74fe071	Leaderboard update	2024-01-28 18:37:11 +01:00
Aleksey Shipilëv	baed56bcdb	Version 4 (#183 )	2024-01-28 18:36:22 +01:00
Gunnar Morling	5dffd8e9b3	Leaderboard update	2024-01-28 18:33:07 +01:00
Mahadev K	f598d74594	Mahadev virtual thread 1brc (#611 ) * Read file with multiple virtual threads and process chunks of file data in parallel. * Updated logic to bucket every chunk of aggs into a vector and merge them into a TreeMap for printing. * Virtual Thread / File Channels Impl. * Renamed files with GHUsername. * Added statement to get vals before updating. * Added executable permission to the files.	2024-01-28 18:26:44 +01:00
Dimitris Karampinas	f5bddafaf7	Dkarampi solution (#614 ) * Simple multi-threaded version * Format code * Formatted code * More formatting	2024-01-28 18:12:54 +01:00
Van Phu DO	a33ed2181b	Use native type, remove lots of type conversions (#618 ) * less type conversion, less string cast * adjust some comments * fixed format issue	2024-01-28 18:08:42 +01:00
Aleksei	d5854d65e6	Bytesfellow initial submittion (#619 ) * Latest snapshot (#1) preparing initial version * Improved performance to 20seconds (-9seconds from the previous version) (#2) improved performance a bit * Improved performance to 14 seconds (-6 seconds) (#3) improved performance to 14 seconds * sync branches (#4) * initial commit * some refactoring of methods * some fixes for partitioning * some fixes for partitioning * fixed hacky getcode for utf8 bytes * simplified getcode for partitioning * temp solution with syncing * temp solution with syncing * new stream processing * new stream processing * some improvements * cleaned stuff * run configuration * round buffer for the stream to pages * not using compute since it's slower than straightforward get/put. using own byte array equals. * using parallel gc * avoid copying bytes when creating a station object * formatting * Copy less arrays. Improved performance to 12.7 seconds (-2 seconds) (#5) * initial commit * some refactoring of methods * some fixes for partitioning * some fixes for partitioning * fixed hacky getcode for utf8 bytes * simplified getcode for partitioning * temp solution with syncing * temp solution with syncing * new stream processing * new stream processing * some improvements * cleaned stuff * run configuration * round buffer for the stream to pages * not using compute since it's slower than straightforward get/put. using own byte array equals. * using parallel gc * avoid copying bytes when creating a station object * formatting * some tuning to increase performance * some tuning to increase performance * avoid copying data; fast hashCode with slightly more collisions * avoid copying data; fast hashCode with slightly more collisions * cleanup (#6) * tidy up	2024-01-28 18:06:18 +01:00
Thomas Wuerthinger	7e525c5992	Some fine tuning for thomaswue (#606 ) * Some fine tuning. * Process 2MB segments to make all threads finish at the same time. Process with 3 scanners in parallel in the same thread.	2024-01-28 17:59:57 +01:00
Gunnar Morling	c8dd691a27	Leaderboard update	2024-01-28 17:59:46 +01:00
Gunnar Morling	9531407119	Leaderboard update	2024-01-28 17:13:21 +01:00
Andrzej Nestoruk	b3d6659d68	anestoruk submission (#617 ) * initial implementation * few improvements and a cleanup (down to ~12s)	2024-01-28 17:12:32 +01:00
John Ziamos	97334e8621	use long for string equals (#613 ) use more generic hashcode	2024-01-28 17:03:42 +01:00
Jonathan Wright	8ef22ab1bd	Initial submission for jonathan_aotearoa. (#586 ) * Initial submission for jonathan_aotearoa * Fixing typos * Adding hyphens to prepare and calculate shell scripts so that they're aligned with my GitHub username. * Making chunk processing more robust in attempt to fix the cause of the build error. * Fixing typo. * Fixed the handling of files less than 8 bytes in length. * Additional assertion, comment improvements. * Refactoring to improve testability. Additional assertion and comments. * Updating collision checking to include checking if the station name is equal. * Minor refactoring to make param ordering consistent. * Adding a custom toString method for the results map. * Fixing collision checking bug * Fixing rounding bug. * Fixing collision bug. --------- Co-authored-by: jonathan <jonathan@example.com>	2024-01-28 16:30:22 +01:00
Gunnar Morling	243f34f38b	Adding 10K eval script	2024-01-28 16:25:04 +01:00
Gunnar Morling	5f467c668a	Leaderboard update	2024-01-28 11:56:44 +01:00
Serkan ÖZAL	6bd2a21686	serkan-ozal's 2nd submission with some minor improvements: (#612 ) - use shared memory arena and region between worker threads - reduce number of instructions slightly while processing file region	2024-01-28 11:56:30 +01:00
Gunnar Morling	5bb6c5f3ef	Leaderboard update	2024-01-28 11:35:19 +01:00
Jaromir Hamala	d9ab36a241	jerrinot's improvement (#607 ) * some random changes with minimal, if any, effect * use munmap() trick credit: thomaswue * some smaller tweaks * use native image	2024-01-28 11:34:28 +01:00
PanosDR	a6cd83fc98	CalculateAverage_pdrakatos (#515 ) * CalculateAverage_pdrakatos * Rename to be valid with rules * CalculateAverage_pdrakatos * Rename to be valid with rules * Changes on scripts execution * Fixing bugs causing scripts not to be executed * Changes on prepare make it compatible * Fixing passing all tests * Increase direct memory allocation buffer * Fixing memory problem causes heap space exception	2024-01-28 10:25:53 +01:00
Alberto Venturini	936fc1da54	Second version by albertoventurini (#609 ) * Contribution by albertoventurini * Use byte arrays of size 2^20 --------- Co-authored-by: Alberto Venturini <alberto.venturini@accso.de>	2024-01-28 10:02:42 +01:00
Serkan ÖZAL	3e208be741	serkan-ozal: Initial impl (#553 ) * Initial impl * Fix bad file descriptor error in the `calculate_average_serkan-ozal.sh` * Disable Epsilon GC and rely on default GC. Because apparently, JIT and Epsilon GC don't play well together in the eval machine for short lived Vector API's `ByteVector` objects * Take care of byte order before processing key length with bit shift operators * Fix key equality check for long keys	2024-01-28 09:53:09 +01:00
Gunnar Morling	9dde50872f	Leaderboard update; - Had used wrong link for Subrahmanyam non-idiomatic at first - Adding 10K key set eval using Subrahmanyam non-idiomatic	2024-01-28 09:46:25 +01:00
Gunnar Morling	fddf5326cf	Leaderboard update	2024-01-28 09:29:15 +01:00
Dr Ian Preston	8279aa7560	Simplify dedupeStation() (#589 ) 13.8s locally now. Co-authored-by: Ian Preston <ianopolous@protonmail.com>	2024-01-27 19:43:41 +01:00
tivrfoa	d9604d9258	Use LinkedBlockingQueue to process results - based on thomaswue (#603 ) /** * Solution based on thomaswue solution, commit: * commit `d0a28599c2` * Author: Thomas Wuerthinger <thomas.wuerthinger@oracle.com> * Date: Sun Jan 21 20:13:48 2024 +0100 * * Changes: * 1) Use LinkedBlockingQueue to store partial results, that * will then be merged into the final map later. * As different chunks finish at different times, this allows * to process them as they finish, instead of joining the * threads sequentially. * This change seems more useful for the 10k dataset, as the * runtime difference of each chunk is greater. * 2) Use only 4 threads if the file is >= 14GB. * This showed much better results on my local test, but I only * run with 200 million rows (because of limited RAM), and I have * no idea how it will perform on the 1brc HW. */	2024-01-27 19:41:00 +01:00
Yevhenii Melnyk	a304f80710	(new submission) melgenek: ~top 15 on 10k. Buffered IO, VarHandles, vectors, custom hashtable (#600 ) * melgenek: ~top 15 on 10k. Buffered IO, VarHandles, vectors, custom hashtable * Calculate the required heap size dynamically	2024-01-27 19:37:19 +01:00
Jairo Graterón	eea9c33858	Fix hash code collisions (#605 ) * fix test rounding, pass 10K station names * improved integer conversion, delayed string creation. * new algorithm hash, use ConcurrentHashMap * fix rounding test * added the length of the string in the hash initialization. * fix hash code collisions	2024-01-27 19:32:15 +01:00
Gunnar Morling	6b5b68c772	Leaderboard update	2024-01-27 18:20:13 +01:00
Gunnar Morling	f1209f2ba8	Leaderboard update	2024-01-27 16:10:47 +01:00
Manish Garg	5c47ce1cbd	Reading 1B row file using Java NIO lib. (#601 )	2024-01-27 15:52:11 +01:00
Roy van Rijn	489ec9e3b1	Larger heap, small tweaks (#593 ) More small tweaks, perf from 775~ to 738~	2024-01-27 15:24:06 +01:00
Florin Blanaru	84f6331b83	1BRC gigiblender (#595 ) * Dirty implementation gigiblender * Final impl gigiblender	2024-01-27 15:20:02 +01:00
Roman Musin	f9c58414da	Next version (#596 ) * cleanup prepare script * native image options * fix quardaric probing (no change to perf) * mask to get the last chunk of the name * extract hash functions * tweak the probing loop (-100ms) * fiddle with native image options * Reorder conditions in hope it makes branch predictor happier * extracted constant	2024-01-27 15:17:55 +01:00
Van Phu DO	c228633b57	improve hard disk access locality, another 8% (#591 ) * improve hard disk access locality, another 8% * add some comments & credit * fixed format	2024-01-27 14:54:43 +01:00
Hieu Dao Quang	5092eb44d1	First attempt with Java-managed concurrency (#590 ) Co-authored-by: Quang Hieu Dao <hieu_dq@flinters.vn>	2024-01-27 14:49:59 +01:00
rcasteltrione	769884426b	Initial submission (#588 ) * Initial submission * fixed not executable scripts	2024-01-27 14:43:51 +01:00
Gunnar Morling	22c188b148	Leaderboard update	2024-01-26 18:23:07 +01:00
Jason Nochlin	457a36be63	Fix hundredwatt's entry on 10k dataset (#558 ) * Improve hash function * remove limit on number of cores * fix calculation of boundaries between chunks * fix IOOBE --------- Co-authored-by: Jason Nochlin <hundredwatt@users.noreply.github.com>	2024-01-26 18:22:35 +01:00
Gunnar Morling	09b0d75477	Leaderboard update	2024-01-25 23:37:52 +01:00
gonix	27b867d10d	CalculateAverage_gonix update (#579 ) Minor updates here and there, shaves off ~5% of execution time on my machine.	2024-01-25 23:37:20 +01:00
Alberto Venturini	cb7423d386	Contribution by albertoventurini (#578 ) * Contribution by albertoventurini * Shave off a couple of hundreds of milliseconds, by making an assumption on temperature readings * Parse reading without loop, inspired by other solutions * Use all cores * Small improvements, only allocate 247 positions instead of 256 --------- Co-authored-by: Alberto Venturini <alberto.venturini@accso.de>	2024-01-25 23:17:39 +01:00
Roman Stoffel	94e29982f9	Updates for gamlerhart: Simpler & Faster (#580 ) * Update with Rounding Bugfix * Simplification of Merging Results * More Plain Java Code for Value Storage * Improve Performance by Stupid Hash Drop around 3 seconds on my machine by simplifying the hash to be ridicules stupid, but faster. * Fix outdated comment	2024-01-25 23:12:10 +01:00
Dmitry Bufistov	b20e7365e7	Second submission to keep a bit of dignity (#581 ) * Dmitry challenge * Dmitry submit 2. Use MemorySegment of FileChannle and Unsafe to read bytes from disk. 4 seconds speedup in local test from 20s to 16s.	2024-01-25 23:09:22 +01:00

1 2 3 4 5 ...

555 Commits