9f20b6b | Matei Zaharia | 04 October 2010, 03:28:20 UTC | Added reduceByKey operation for RDDs containing pairs | 04 October 2010, 03:28:20 UTC |
34ecced | root | 03 October 2010, 05:06:06 UTC | Fixed a rather bad bug in HDFS files that has been in for a while: caching was not working because Split objects did not have a consistent toString value | 03 October 2010, 05:06:06 UTC |
b6debf5 | Matei Zaharia | 29 September 2010, 17:59:01 UTC | Merge branch 'matei-logging' | 29 September 2010, 17:59:01 UTC |
f50b23b | Matei Zaharia | 29 September 2010, 17:04:00 UTC | Increase default locality wait to 3s. Fixes #20. | 29 September 2010, 17:04:00 UTC |
a7c0e2a | Matei Zaharia | 29 September 2010, 07:22:11 UTC | Made task-finished log messages slightly nicer | 29 September 2010, 07:22:11 UTC |
40f6914 | Matei Zaharia | 29 September 2010, 07:22:09 UTC | Made spark-executor output slightly nicer | 29 September 2010, 07:22:09 UTC |
0d28bdc | Matei Zaharia | 29 September 2010, 07:10:46 UTC | A couple of minor fixes: - Don't include trailing $'s in class names of Scala objects - Report errors using logError instead of printStackTrace | 29 September 2010, 07:10:46 UTC |
0fa70a6 | Matei Zaharia | 29 September 2010, 06:58:19 UTC | Updated log4j.properties to ignore jetty messages below WARN level | 29 September 2010, 06:58:19 UTC |
7090dea | Matei Zaharia | 29 September 2010, 06:22:07 UTC | Changed printlns to log statements and fixed a bug in run that was causing it to fail on a Mesos cluster | 29 September 2010, 06:54:29 UTC |
516248a | Matei Zaharia | 29 September 2010, 06:20:23 UTC | Added log4j.properties | 29 September 2010, 06:22:39 UTC |
332c8b8 | Matei Zaharia | 29 September 2010, 06:16:28 UTC | Removed Hadoop's SLF4J jars | 29 September 2010, 06:16:28 UTC |
db623de | Matei Zaharia | 29 September 2010, 06:12:23 UTC | Added Logging trait | 29 September 2010, 06:12:23 UTC |
c7d233b | Matei Zaharia | 29 September 2010, 06:08:01 UTC | Added log4j jars and paths | 29 September 2010, 06:08:01 UTC |
e5e9ede | Matei Zaharia | 29 September 2010, 05:43:04 UTC | Merge branch 'http-repl-class-serving' | 29 September 2010, 05:43:04 UTC |
e068f21 | Matei Zaharia | 29 September 2010, 05:32:38 UTC | More work on HTTP class loading | 29 September 2010, 05:32:38 UTC |
7ef3a20 | Matei Zaharia | 29 September 2010, 00:55:11 UTC | Modified the interpreter to serve classes to the executors using a Jetty HTTP server instead of a shared (NFS) file system. | 29 September 2010, 00:55:11 UTC |
b749f0e | Justin Ma | 29 September 2010, 00:28:54 UTC | fixed typo in printing which task is already finished | 29 September 2010, 00:28:54 UTC |
366c09c | Justin Ma | 13 September 2010, 22:30:22 UTC | Let's use future instead of actors | 13 September 2010, 22:30:22 UTC |
0896fd6 | Justin Ma | 12 September 2010, 16:01:44 UTC | Added fork()/join() operations for SparkContext, as well as corresponding changes to MesosScheduler to support multiple ParallelOperations. | 12 September 2010, 16:01:44 UTC |
6f0d2c1 | Justin Ma | 07 September 2010, 21:03:59 UTC | round robin scheduling of tasks has been added | 07 September 2010, 21:03:59 UTC |
e9ffe6c | Justin Ma | 01 September 2010, 20:31:06 UTC | now adding the Split object. | 01 September 2010, 20:31:06 UTC |
7a9ff1c | Justin Ma | 31 August 2010, 19:08:09 UTC | - Got rid of 'Split' type parameter in RDD - Added SampledRDD, SplitRDD and CartesianRDD - Made Split a class rather than a type parameter - Added numCores() to Scheduler to help set default level of parallelism | 31 August 2010, 19:08:09 UTC |
ea8c278 | Justin Ma | 18 August 2010, 22:59:35 UTC | now we have sampling with replacement (at least on a per-split basis) | 18 August 2010, 22:59:35 UTC |
156bccb | Justin Ma | 18 August 2010, 22:25:57 UTC | HdfsFile.scala: added a try/catch block to exit gracefully for correupted gzip files MesosScheduler.scala: formatted the slaveOffer() output to include the serialized task size RDD.scala: added support for aggregating RDDs on a per-split basis (aggregateSplit()) as well as for sampling without replacement (sample()) | 18 August 2010, 22:25:57 UTC |
75b2ca1 | Matei Zaharia | 17 August 2010, 06:16:35 UTC | Removed HOD from included Hadoop because it was making the project count as Python on GitHub :|. | 17 August 2010, 06:16:35 UTC |
1cbffaa | Matei Zaharia | 16 August 2010, 01:33:27 UTC | Modified Scala interpreter to have it avoid computing string versions of all results when :silent is enabled, so that it is easier to work with large arrays in Spark. (The string version of an array of numbers might not fit in memory even though the array itself does.) | 16 August 2010, 01:33:27 UTC |
1600c31 | Matei Zaharia | 14 August 2010, 02:03:46 UTC | Added latest mesos.jar | 14 August 2010, 02:03:46 UTC |
0b19592 | Matei Zaharia | 14 August 2010, 01:54:32 UTC | Improved README and added blank templates for config files. | 14 August 2010, 01:54:32 UTC |
3d8d7fd | Matei Zaharia | 13 August 2010, 18:29:19 UTC | Bug fix from Justin | 13 August 2010, 18:29:19 UTC |
a9481c3 | root | 13 August 2010, 07:39:36 UTC | Update to work with latest Mesos API changes | 13 August 2010, 07:39:36 UTC |
4488b3b | Matei Zaharia | 09 August 2010, 23:46:14 UTC | Fixed a bug where we would incorrectly decide we've finished a parallel operation if Mesos tells us a task is finished twice | 09 August 2010, 23:46:14 UTC |
f415b07 | Matei Zaharia | 06 August 2010, 19:07:18 UTC | Change shell framework's name to "Spark shell" | 06 August 2010, 19:07:26 UTC |
0e6e577 | Matei Zaharia | 26 July 2010, 03:54:56 UTC | Add Mesos native library to .gitignore | 26 July 2010, 03:54:56 UTC |
b56ed67 | Matei Zaharia | 26 July 2010, 03:53:46 UTC | Updated code to work with Nexus->Mesos name change | 26 July 2010, 03:53:46 UTC |
4239f76 | Matei Zaharia | 26 July 2010, 03:46:44 UTC | Removed Matei's old start on broadcast code | 26 July 2010, 03:46:44 UTC |
e240e38 | Matei Zaharia | 26 July 2010, 01:10:03 UTC | Updated a bunch of libraries, and increased the default memory in run so that unit tests can run successfully. | 26 July 2010, 01:10:03 UTC |
0435de9 | Matei Zaharia | 20 July 2010, 01:00:30 UTC | Made it possible to set various Spark options and environment variables in general through a conf/spark-env.sh script. | 20 July 2010, 01:00:30 UTC |
edad598 | Justin Ma | 19 July 2010, 22:03:49 UTC | Updated Spark to run with latest Mesos build and Scala-2.8.0.final. | 19 July 2010, 22:03:49 UTC |
7d0eae1 | Matei Zaharia | 27 June 2010, 22:21:54 UTC | Merge branch 'dev' Conflicts: src/scala/spark/HdfsFile.scala src/scala/spark/NexusScheduler.scala src/test/spark/repl/ReplSuite.scala | 27 June 2010, 22:21:54 UTC |
6aacaa6 | root | 18 June 2010, 23:24:18 UTC | Made Spark shell class directory configurable. | 18 June 2010, 23:24:18 UTC |
323571a | Matei Zaharia | 18 June 2010, 19:54:33 UTC | Initial work on union operation. | 18 June 2010, 19:54:33 UTC |
b541988 | Matei Zaharia | 17 June 2010, 20:19:02 UTC | Added appropriate hashCode, equals and toString to ParallelArraySplit. | 17 June 2010, 20:19:02 UTC |
cd247b7 | Matei Zaharia | 17 June 2010, 19:49:42 UTC | Created common RDD superclass for distributed files and parallel arrays. This also means that parallel arrays now get all the functionality files used to have (filter, map, reduce, cache, etc). | 17 June 2010, 19:49:42 UTC |
77103ea | Matei Zaharia | 11 June 2010, 21:55:23 UTC | Fixed README | 11 June 2010, 21:55:23 UTC |
0d9c51d | Matei Zaharia | 11 June 2010, 17:03:01 UTC | Added back REPL tests | 11 June 2010, 17:03:01 UTC |
e58fba2 | Matei Zaharia | 11 June 2010, 08:18:43 UTC | Fix junk stripper | 11 June 2010, 08:18:43 UTC |
396f48e | Matei Zaharia | 11 June 2010, 08:10:03 UTC | New interpreter port for Scala 2.8 interpreter | 11 June 2010, 08:10:03 UTC |
4eb39e0 | Matei Zaharia | 11 June 2010, 05:41:23 UTC | New nexus.jar | 11 June 2010, 05:41:23 UTC |
1473987 | Matei Zaharia | 11 June 2010, 05:36:45 UTC | Fixed classpath for tests | 11 June 2010, 05:36:45 UTC |
359e84c | Matei Zaharia | 11 June 2010, 05:09:13 UTC | Use new Nexus API | 11 June 2010, 05:09:13 UTC |
92246c8 | Matei Zaharia | 11 June 2010, 04:50:55 UTC | Initial work on 2.8 port | 11 June 2010, 04:50:55 UTC |
c177a54 | Matei Zaharia | 11 June 2010, 01:08:59 UTC | Ignore .DS_Store | 11 June 2010, 01:08:59 UTC |
1c90a32 | root | 30 April 2010, 22:41:21 UTC | Fix native build to use build directory | 30 April 2010, 22:41:21 UTC |
06aac8a | Matei Zaharia | 04 April 2010, 06:44:55 UTC | Imported changes from old repository (mostly Mosharaf's work, plus some fault tolerance code). | 04 April 2010, 06:44:55 UTC |
df29d0e | Matei Zaharia | 29 March 2010, 23:17:55 UTC | Initial commit | 29 March 2010, 23:17:55 UTC |