https://github.com/apache/spark

sort by:
Revision Author Date Message Commit Date
f707856 Removed java-opts.template 24 May 2011, 22:59:01 UTC
cec427e Fixed a bug with preferred locations having changed meaning in new RDDs 23 May 2011, 00:12:29 UTC
4c888b2 Fix queue type for executor 22 May 2011, 23:42:05 UTC
bea3a33 doc tweak 22 May 2011, 23:03:41 UTC
9bde5a5 class loader fix 22 May 2011, 23:00:41 UTC
91c07a3 Various fixes to serialization 22 May 2011, 05:50:08 UTC
f61b61c Merge branch 'master' into new-rdds 22 May 2011, 04:25:58 UTC
24a1e7f Scheduler can now recover from lost map outputs 20 May 2011, 07:19:53 UTC
82329b0 Updated scheduler to support running on just some partitions of final RDD 19 May 2011, 19:47:09 UTC
328e51b Various minor fixes 19 May 2011, 18:19:25 UTC
fd1d255 Stop objectifying various trackers, caches, etc. 17 May 2011, 19:41:13 UTC
4db50e2 Fixed unit tests by making them clean up the SparkContext after use and thus clean up the various singletons (RDDCache, MapOutputTracker, etc). This isn't perfect yet (ideally we shouldn't use singleton objects at all) but we can fix that later. 13 May 2011, 19:03:58 UTC
aca8150 Ensure that AddedToCache messages make it home before tasks finish 13 May 2011, 18:43:52 UTC
16c886a Optimization for count() 13 May 2011, 17:41:34 UTC
4b1f0f1 Merge pull request #48 from ankurdave/bagel-new Bagel: Large-scale graph processing on Spark 13 May 2011, 04:34:38 UTC
f40a089 Rename bagel to spark.bagel and Pregel to Bagel 09 May 2011, 22:23:21 UTC
7e20648 Upgraded to SBT 0.7.5 09 May 2011, 21:48:39 UTC
4bedf5b Merge pull request #47 from ankurdave/cache-to-disk Merging in Ankur's code for a cache that spills to disk 09 May 2011, 21:22:56 UTC
c110405 Move shortest path and PageRank to bagel.examples 04 May 2011, 01:53:58 UTC
563c5e7 Refactor and add aggregator support Refactored out the agg() and comp() methods from Pregel.run. Defined an implicit conversion to allow applications that don't use aggregators to avoid including a null argument for the result of the aggregator in the compute function. 03 May 2011, 22:40:45 UTC
c18fa3e Package combiner functions into a trait 03 May 2011, 22:40:41 UTC
1c8ca0e Add Bagel test suite Note: This test suite currently fails for the same reason that the Spark Core test suite fails: Spark currently seems to have a bug where any test after the first one fails. 03 May 2011, 22:40:31 UTC
c5b3ea7 Clean up Bagel source and interface 03 May 2011, 22:40:01 UTC
19122af Update ShortestPath to work with controllable partitioning 03 May 2011, 22:39:39 UTC
45ec9db Add Bagel classpath to run script 03 May 2011, 22:39:21 UTC
62ef620 Clean up Pregel.run, add logging 03 May 2011, 22:38:01 UTC
c0736f6 Add Bagel, an implementation of Pregel on Spark 03 May 2011, 22:37:08 UTC
a4c04f3 Error handling for disk I/O in DiskSpillingCache Also renamed the property spark.DiskSpillingCache.cacheDir to spark.diskSpillingCache.cacheDir in order to follow conventions. 28 April 2011, 06:23:29 UTC
12ff0d2 Bring an entry back into memory after fetching it from disk 28 April 2011, 05:59:05 UTC
e30313a Added DiskSpillingCache DiskSpillingCache is a BoundedMemoryCache that spills entries to disk when it runs out of space. Currently the implementation is very simple. In particular, it's missing the following features: - Error handling for disk I/O, including checking of disk space levels - Bringing an entry back into memory after fetching it from disk In addition, here are some features that aren't critical but should be implemented soon: - Spilling based on a user-set priority in addition to LRU - Caching into a subdirectory of spark.DiskSpillingCache.cacheDir rather than the root directory 28 April 2011, 05:32:35 UTC
9d2d533 Temporary fix for issue #42. 22 April 2011, 00:40:26 UTC
5c95352 fixed small bug when classpath has some strange formatting 19 April 2011, 00:12:29 UTC
94ba95b Added flatMapValues 13 April 2011, 02:51:58 UTC
d840fa8 Merge remote branch 'origin/custom-serialization' into new-rdds 09 March 2011, 08:40:07 UTC
ff5b137 Some tweaks to make Kryo cache work better 09 March 2011, 08:31:50 UTC
7febdfb Better reuse of buffers in Kryo serialization 08 March 2011, 20:36:36 UTC
8ee3ec2 Merge remote branch 'origin/custom-serialization' into new-rdds 08 March 2011, 19:58:19 UTC
7408230 Updated modified Kryo to use objenesis 08 March 2011, 19:58:08 UTC
ab1216c Register None and Nil properly 08 March 2011, 19:52:58 UTC
d39f5dd Merge remote branch 'origin/custom-serialization' into new-rdds 08 March 2011, 18:28:50 UTC
4f0d0a7 stuff 08 March 2011, 18:28:26 UTC
8b6f3db Merge remote branch 'origin/custom-serialization' into new-rdds 08 March 2011, 03:20:28 UTC
38f6bce Added SerializingCache 08 March 2011, 03:16:24 UTC
6316c79 Remove some logging 08 March 2011, 02:56:36 UTC
e7b4b04 Added pluggable serializers and Kryo serialization 08 March 2011, 02:41:53 UTC
467f056 Remove commented code 07 March 2011, 07:38:41 UTC
bce95b8 Finished cogroup stuff 07 March 2011, 07:38:16 UTC
04c2d6a stuff 07 March 2011, 03:27:03 UTC
0fb691d Various fixes to get MesosScheduler working with new RDDs 07 March 2011, 00:16:38 UTC
1df5a65 Pass cache locations correctly to DAGScheduler. 06 March 2011, 20:16:38 UTC
e1436f1 Merge remote branch 'origin/master' into new-rdds 06 March 2011, 19:11:47 UTC
370b958 Added sampling for large arrays in SizeEstimator 06 March 2011, 19:11:20 UTC
a789e9a Merge remote branch 'origin/master' into new-rdds 01 March 2011, 18:33:37 UTC
021c50a Remove unnecessary lock which was there to work around a bug in Configuration in Hadoop 0.20.0 01 March 2011, 18:28:38 UTC
adaba4d Removed old slf4j jars that came with Hadoop 01 March 2011, 18:28:21 UTC
447debb Updated Hadoop to 0.20.2 to include some bug fixes 01 March 2011, 18:27:48 UTC
9e59afd More work on new RDD design 28 February 2011, 03:15:52 UTC
f38f86d More stuff 27 February 2011, 22:27:12 UTC
2e6023f stuff 27 February 2011, 07:41:44 UTC
309367c Initial work towards new RDD design 27 February 2011, 07:15:33 UTC
dc24aec Close record readers in HadoopFile after finishing a split 10 February 2011, 20:07:48 UTC
62f1c6f Remove build.properties from version control 09 February 2011, 19:52:56 UTC
d3df963 Brought in some reorganization of build file from Hive branch 09 February 2011, 05:27:36 UTC
e8df4bb Added more SBT stuff to gitignore 09 February 2011, 01:06:07 UTC
26b77ae Increased SBT mem to 700 MB so that unit tests run more nicely 09 February 2011, 01:03:28 UTC
99f3f23 Changed default shuffle to LocalFileShuffle because it's way faster for small files 09 February 2011, 01:03:03 UTC
f4f7aa2 formatting 09 February 2011, 00:39:17 UTC
ee60aaa Added a pointer to wiki in readme 09 February 2011, 00:38:10 UTC
c1c766a Updated readme 03 February 2011, 03:21:49 UTC
50df43b Added SBT target for building a single JAR with Spark Core and its dependencies 03 February 2011, 03:08:14 UTC
a11fe23 Moved examples to spark.examples package 03 February 2011, 00:30:27 UTC
8217060 Added IntelliJ's build directory to gitignore 02 February 2011, 08:30:29 UTC
ec28b60 Merge branch 'master' into sbt Conflicts: Makefile core/src/main/java/spark/compress/lzf/LZF.java core/src/main/java/spark/compress/lzf/LZFInputStream.java core/src/main/java/spark/compress/lzf/LZFOutputStream.java core/src/main/native/spark_compress_lzf_LZF.c run 02 February 2011, 08:25:54 UTC
7f74ee9 Added support for IntelliJ IDEA 02 February 2011, 08:08:13 UTC
e5c4cd8 Made examples and core subprojects 01 February 2011, 23:11:08 UTC
817e722 Merge branch 'master' of github.com:mesos/spark 16 December 2010, 03:40:35 UTC
14c29c1 Fixed import 16 December 2010, 03:40:27 UTC
5cafdd7 Removed some unused imports from Broadcast.scala 16 December 2010, 03:11:23 UTC
be0ce57 - Fixed an compilation error due to wrong 'import' of legacy lzf libraries in DfsBroadcast.scala - Updated to use ning libraries. - Passes all unit tests 16 December 2010, 02:34:27 UTC
5c222db Merge branch 'master' into mos-bt Conflicts: src/scala/spark/Broadcast.scala 15 December 2010, 18:57:39 UTC
0a5c24a - Default broadcast mechanism is set to DfsBroadcast - Configuration parameters are renamed to follow our convention - Master now automatically supplies its hostAddress instead of reading from config file - sendBroadcast has been removed from the Broadcast trait 13 December 2010, 22:36:39 UTC
3439573 Someone forgot to pass the parameters: fixes SPARK_MEM set from main script but not passed to executor. 12 December 2010, 21:30:49 UTC
0d895ba Added BSD license 07 December 2010, 18:32:17 UTC
06dc4a5 - Removed config files from git's control. - Changed DfsShuffle to default in RDD.scala. 07 December 2010, 18:17:47 UTC
799c1b1 Adding license file for compress-lzf 07 December 2010, 16:30:29 UTC
2fb8495 Replacing the native lzf compression code with the ning open-source compress-lzf library. (Apache 2.0 liscense) 06 December 2010, 05:20:15 UTC
1013a01 Made the root Broadcast object initilization method 'synchronized'. 02 December 2010, 10:36:21 UTC
a69638d Removed deprecated repl file ClassServer.scala 01 December 2010, 02:45:04 UTC
b7dda4c Merge branch 'multi-tracker' into mos-bt Conflicts: conf/java-opts src/scala/spark/Broadcast.scala src/scala/spark/DfsBroadcast.scala src/scala/spark/SparkContext.scala 01 December 2010, 02:29:38 UTC
815ecd3 Made Broadcast Pluggable. Finally! 01 December 2010, 02:08:49 UTC
ea28a30 Makes transfer faster for small swarms (with size < MaxRxPeers) 01 December 2010, 00:13:56 UTC
191af65 - Using the new Cache implementation. 30 November 2010, 04:35:47 UTC
c9cad03 - Using the new Cache implementation. - Removed unused code related to dualMode (deprecated). 30 November 2010, 04:30:38 UTC
73714da Made LocalFileShuffle to be the default. 30 November 2010, 03:19:21 UTC
c62f2fe Made LocalFileShuffle to be the default. 30 November 2010, 03:17:01 UTC
15f1f68 Variable renaming. 29 November 2010, 22:15:36 UTC
bb3e7fb Variable renaming. 29 November 2010, 22:13:35 UTC
1157380 Removed unused third party libraries. 29 November 2010, 20:21:46 UTC
1f7fcb4 Merge branch 'master' into mos-bt Conflicts: Makefile run src/scala/spark/Broadcast.scala src/scala/spark/Executor.scala src/scala/spark/HdfsFile.scala src/scala/spark/MesosScheduler.scala src/scala/spark/RDD.scala src/scala/spark/SparkContext.scala src/scala/spark/Split.scala src/scala/spark/Utils.scala src/scala/spark/repl/SparkInterpreter.scala third_party/mesos.jar 29 November 2010, 20:10:06 UTC
9e93f8d Moved broadcast implementations to separate files. Ready to merge with 'master' 29 November 2010, 19:43:20 UTC
back to top