Revision history - HEAD - origin: https://github.com/mongodb/mongo-hadoop

visit type:

Revision	Author	Date	Message	Commit Date
20208a0	Alexander Golin	28 January 2022, 19:28:02 UTC	Merge pull request #160 from mongodb/DRIVERS-2036	28 January 2022, 19:28:02 UTC
2d59cfa	Alexander Golin	28 January 2022, 19:25:49 UTC	DRIVERS-2036: EOL Notice Adding EOL notice before archiving repo.	28 January 2022, 19:25:49 UTC
cdcd0f1	Raghotham S	04 April 2017, 12:10:26 UTC	Beautifying markdown headers	04 April 2017, 12:28:21 UTC
d16e574	Jeff Yemin	29 March 2017, 21:43:37 UTC	Support continuous integration in Evergreen	30 March 2017, 17:49:30 UTC
09b824b	Luke Lovett	27 January 2017, 17:12:19 UTC	BUMP 2.0.2	27 January 2017, 17:12:19 UTC
ddb01ff	Luke Lovett	27 January 2017, 17:10:03 UTC	Update History.md for 2.0.2 release.	27 January 2017, 17:10:03 UTC
acc6270	Luke Lovett	27 January 2017, 06:58:36 UTC	Set split key min and max on MongoInputSplits created with createSplitFromBounds. This reverts some of the changes made by a2f662fe7d66bc5c5a4b8fc7219eb64e76100c39.	27 January 2017, 06:58:36 UTC
977c9a0	Luke Lovett	27 January 2017, 01:12:54 UTC	HADOOP-303 - Use the correct projection when selecting a column that maps to an embedded field within a document. (#143) Normalize nested field names to lower case.	27 January 2017, 01:12:54 UTC
a2f662f	Luke Lovett	27 January 2017, 00:51:44 UTC	HADOOP-304 - Add a test case to cover constructing MongoInputSplit. Allow min/max split keys to be set from the Configuration in MIS.	27 January 2017, 01:10:34 UTC
b618dc9	Mladen Krstic	23 January 2017, 13:39:59 UTC	Support MongoDB skip when creating MongoInputSplit	27 January 2017, 00:28:48 UTC
a6b9cb7	Luke Lovett	24 January 2017, 22:17:06 UTC	BUMP 2.0.1 -> 2.0.2.dev	24 January 2017, 22:17:06 UTC
4507e57	Luke Lovett	30 August 2016, 19:48:14 UTC	BUMP 2.0.1	30 August 2016, 19:50:24 UTC
fd8529d	Luke Lovett	30 August 2016, 19:50:09 UTC	Update History.md for 2.0.1 release.	30 August 2016, 19:50:24 UTC
df080df	Luke Lovett	30 August 2016, 17:25:02 UTC	HADOOP-295 - MongoPaginatingSplitter should set the noTimeout option on its cursor.	30 August 2016, 17:25:02 UTC
f6f74f9	Luke Lovett	15 August 2016, 17:59:28 UTC	BUMP 2.0.0	15 August 2016, 17:59:28 UTC
c3c6585	Luke Lovett	28 July 2016, 22:33:16 UTC	BUMP 2.0.0-rc0	28 July 2016, 22:33:16 UTC
d6ebf36	Luke Lovett	28 July 2016, 22:32:14 UTC	Merge branch '2.0-dev' Conflicts: History.md README.md build.gradle core/src/test/java/com/mongodb/hadoop/testutils/BaseHadoopTest.java	28 July 2016, 22:32:14 UTC
9c354a9	Luke Lovett	26 July 2016, 22:20:04 UTC	Update History.md for 2.0.0-rc0.	28 July 2016, 22:00:54 UTC
6b16268	Luke Lovett	27 July 2016, 17:56:35 UTC	Remove temporary directories, in addition to temporary files (HADOOP-292).	28 July 2016, 22:00:34 UTC
fb83e03	Luke Lovett	21 July 2016, 20:34:23 UTC	Sleep to allow more time for jobtracker to start.	21 July 2016, 20:34:23 UTC
b8d77de	Luke Lovett	24 June 2016, 21:58:23 UTC	Try to use a local mongos host/port on InputSplits produced by ShardChunkMongoSplitter (HADOOP-202).	12 July 2016, 20:38:20 UTC
feced9f	Luke Lovett	20 June 2016, 22:29:05 UTC	Fix MongoSplitterFactoryTest run against a sharded cluster.	20 June 2016, 22:29:05 UTC
4868153	Luke Lovett	20 June 2016, 21:07:21 UTC	Appease checkstyle.	20 June 2016, 21:08:22 UTC
0ab0d10	Luke Lovett	20 June 2016, 21:08:03 UTC	Add Powerrr to the list of CONTRIBUTORS.	20 June 2016, 21:08:22 UTC
667f514	Power	16 June 2016, 11:16:34 UTC	use query projection in MongoPaginatingSplitter	20 June 2016, 20:36:57 UTC
d8d63ac	Luke Lovett	20 June 2016, 18:56:40 UTC	Merge pull request #141 from Powerrr/paginating-splitter-projection Use query projection in MongoPaginatingSplitter	20 June 2016, 18:56:40 UTC
33d1560	Power	16 June 2016, 11:16:34 UTC	use query projection in MongoPaginatingSplitter	16 June 2016, 11:16:34 UTC
5b649a5	Luke Lovett	13 June 2016, 23:58:56 UTC	Add SampleSplitter (HADOOP-283). SampleSplitter creates InputSplits based on the output of the $sample aggregation operator. This is a very inexpensive way to create splits on unsharded MongoDB collections without requiring special privileges as the 'splitVector' command does. SampleSplitter requires MongoDB 3.2+.	15 June 2016, 16:15:05 UTC
08c45fc	Luke Lovett	02 June 2016, 20:28:04 UTC	HADOOP-236 - Add two new classes to support making updates from Hadoop streaming jobs: - MongoUpdateInputWriter - MongoUpdateOutputReader To use these classes (and thus specify that a job is for making updates), set `-io mongoUpdate` when launching the Hadoop streaming job.	13 June 2016, 16:28:47 UTC
ae06f2e	Luke Lovett	10 June 2016, 23:33:26 UTC	Make one GridFSInputFormatTest Hadoop-1.2 compatible.	10 June 2016, 23:33:26 UTC
eabb01d	Luke Lovett	01 April 2016, 22:47:46 UTC	Restore support for Hadoop 1.2.X (HADOOP-246).	10 June 2016, 23:17:27 UTC
72626ab	Luke Lovett	18 May 2016, 00:22:32 UTC	Support for reading from GridFS via GridFSInputFormat and GridFSSplit (HADOOP-272).	03 June 2016, 22:52:20 UTC
52f671e	Luke Lovett	06 May 2016, 17:08:19 UTC	Add CONTRIBUTORS.md	06 May 2016, 17:09:02 UTC
b407177	Luke Lovett	06 May 2016, 16:58:19 UTC	Merge pull request #138 from emanresusername/2.0-dev unorderedBulkOperation support (HADOOP-279)	06 May 2016, 16:58:19 UTC
a4bdaf8	j	13 April 2016, 22:40:35 UTC	unorderedBulkOperation support	06 May 2016, 02:45:51 UTC
6a662c5	Luke Lovett	19 April 2016, 18:27:36 UTC	Support document replacement (HADOOP-263).	04 May 2016, 17:42:25 UTC
e99542e	Luke Lovett	28 March 2016, 19:39:37 UTC	Update History.md	13 April 2016, 00:38:38 UTC
469f6e4	Luke Lovett	18 March 2016, 20:14:55 UTC	Ensure that BSONPickler and custom constructors are registered on every Spark node (HADOOP-273).	13 April 2016, 00:38:07 UTC
a5f869c	Luke Lovett	24 March 2016, 20:58:38 UTC	Fix Spark version in README (HADOOP-275).	13 April 2016, 00:38:07 UTC
dc50df8	Luke Lovett	24 March 2016, 18:23:01 UTC	Clarify installation instructions and version compatibility in the README (HADOOP-275).	13 April 2016, 00:38:07 UTC
045ed4c	Luke Lovett	17 March 2016, 23:57:29 UTC	Allow datetime.datetime objects to be read and written from Spark (HADOOP-274).	13 April 2016, 00:38:07 UTC
130ba1b	Luke Lovett	28 March 2016, 19:39:42 UTC	BUMP 1.5.2	28 March 2016, 19:39:42 UTC
9764990	Luke Lovett	28 March 2016, 19:39:37 UTC	Update History.md	28 March 2016, 19:39:37 UTC
22cd303	Luke Lovett	18 March 2016, 20:14:55 UTC	Ensure that BSONPickler and custom constructors are registered on every Spark node (HADOOP-273).	28 March 2016, 18:36:46 UTC
aecd367	Luke Lovett	24 March 2016, 20:58:38 UTC	Fix Spark version in README (HADOOP-275).	24 March 2016, 20:58:38 UTC
75038b9	Luke Lovett	24 March 2016, 18:23:01 UTC	Clarify installation instructions and version compatibility in the README (HADOOP-275).	24 March 2016, 18:23:01 UTC
883b3e0	Luke Lovett	17 March 2016, 23:57:29 UTC	Allow datetime.datetime objects to be read and written from Spark (HADOOP-274).	18 March 2016, 20:06:44 UTC
2a43478	Luke Lovett	18 March 2016, 20:05:57 UTC	BUMP 1.5.2-SNAPSHOT	18 March 2016, 20:05:57 UTC
54b5ace	Luke Lovett	09 March 2016, 18:40:21 UTC	BUMP 2.0.0-SNAPSHOT	09 March 2016, 18:40:21 UTC
8f2699a	Luke Lovett	09 March 2016, 18:09:22 UTC	BUMP 1.5.1	09 March 2016, 18:09:22 UTC
d6f84ad	Luke Lovett	09 March 2016, 18:07:09 UTC	Update History.md	09 March 2016, 18:07:09 UTC
f241321	Luke Lovett	25 February 2016, 19:29:36 UTC	Close MongoClients in MongoRecordWriter and MongoOutputCommitter (HADOOP-265).	03 March 2016, 19:48:42 UTC
0c06361	Luke Lovett	25 February 2016, 18:25:07 UTC	Don't allow null templates to be passed to JSONPigReplace.replaceAll() (HADOOP-266).	03 March 2016, 19:32:19 UTC
3f3b09c	Luke Lovett	29 February 2016, 18:28:14 UTC	Allow users to set the limit on MongoInputSplits (HADOOP-267).	29 February 2016, 18:43:26 UTC
3f57880	Luke Lovett	23 February 2016, 18:13:33 UTC	BUMP 2.0.0-SNAPSHOT	23 February 2016, 18:13:33 UTC
c10a614	Luke Lovett	23 February 2016, 18:00:58 UTC	BUMP 1.5.0	23 February 2016, 18:03:34 UTC
750e52a	Luke Lovett	23 February 2016, 18:03:20 UTC	Fix parameter name in JavaDoc.	23 February 2016, 18:03:34 UTC
02a043a	Luke Lovett	17 February 2016, 20:42:38 UTC	Return null early in getTypeForBSON if input is null (HADOOP-255).	17 February 2016, 20:42:38 UTC
c0e49c5	Luke Lovett	01 February 2016, 21:43:24 UTC	BUMP 1.5.0-rc1-SNAPSHOT.	01 February 2016, 21:43:24 UTC
9d693eb	Luke Lovett	01 February 2016, 21:10:55 UTC	BUMP 1.5.0-rc0	01 February 2016, 21:10:55 UTC
07fddd6	Luke Lovett	01 February 2016, 21:10:51 UTC	Update History.md	01 February 2016, 21:10:51 UTC
4b841ea	Luke Lovett	29 January 2016, 18:57:37 UTC	Fix a test that uses a cursor after it is closed.	29 January 2016, 18:57:37 UTC
9dd9fa5	Luke Lovett	08 December 2015, 23:24:02 UTC	Add UDFs that permit storing BSON from Pig and extracting timestamp information from ObjectIds (HADOOP-76).	28 January 2016, 21:00:21 UTC
352ad53	Luke Lovett	28 January 2016, 20:53:28 UTC	Fix some tests that were broken with MongoDB < 2.6.	28 January 2016, 20:53:28 UTC
37e9917	Luke Lovett	27 January 2016, 19:39:46 UTC	Update the project to use the latest versions of Hive, Hadoop, Pig, Spark, and the MongoDB Java Driver (HADOOP-250).	27 January 2016, 22:52:03 UTC
9a46b62	Luke Lovett	26 January 2016, 21:28:03 UTC	Be able to infer FileSystem implementation from URI (HADOOP-253).	27 January 2016, 19:16:21 UTC
92a923f	Luke Lovett	25 January 2016, 23:54:24 UTC	Close only thread-local clients with MongoConfigUtil.close() (HADOOP-243).	26 January 2016, 23:05:47 UTC
7758903	Luke Lovett	16 October 2015, 18:50:25 UTC	Create option 'mongo.input.splits.combine' for combining splits. Add MongoPaginatingSplitter (HADOOP-83).	22 January 2016, 19:15:59 UTC
4271291	Luke Lovett	08 January 2016, 18:07:17 UTC	Update HADOOP_HOME to HADOOP_PREFIX in README.	08 January 2016, 18:07:17 UTC
96ac8e5	Luke Lovett	07 December 2015, 19:33:30 UTC	Amend a few minor details about the Enron emails Spark example: - Include output from "core" project in example fat jar. - Use accessor methods for getting tuple data in Java. - Some minor cleanup of comments/style to appease checkstyle.	07 December 2015, 22:18:25 UTC
53467ad	Bryan Reinero	03 December 2015, 22:05:56 UTC	Formatting corrections for compliance to sytle guide	03 December 2015, 22:05:56 UTC
9ef021f	Bryan Reinero	03 December 2015, 21:26:33 UTC	New Spark examples, including Dataframes & SparkSQL, using Enron email dataset.	03 December 2015, 21:26:33 UTC
9905150	Luke Lovett	16 November 2015, 18:45:55 UTC	Add Mariano Semelman to the list of contributors in README.md.	16 November 2015, 22:05:26 UTC
1ff5085	Luke Lovett	23 October 2015, 22:07:48 UTC	Run the 'splitVector' command on the same database from which we want the splits (HADOOP-238).	16 November 2015, 22:05:18 UTC
9d38854	Luke Lovett	09 November 2015, 17:05:59 UTC	HADOOP-242 - Support compression in mapred package.	09 November 2015, 18:42:08 UTC
3aa106d	Luke Lovett	04 November 2015, 21:18:17 UTC	Use full path to 'hdfs' binary in enronEmails task.	04 November 2015, 21:18:17 UTC
2b7a0d7	Luke Lovett	04 November 2015, 16:31:04 UTC	Update pymongo-spark's README to reflect that mongo-hadoop-spark.jar has not yet been released.	04 November 2015, 16:31:04 UTC
103574a	Luke Lovett	30 October 2015, 17:56:14 UTC	Don't use $eq operator in tests, since it's not available in older server versions.	30 October 2015, 17:56:14 UTC
f5d51eb	Luke Lovett	12 October 2015, 22:13:47 UTC	Allow projections to be pushed down to MongoDB from Pig (HADOOP-167).	22 October 2015, 17:18:38 UTC
0dcf8ad	Luke Lovett	22 October 2015, 17:04:38 UTC	Don't need to build a tests jar for the spark subproject in order to run the tests.	22 October 2015, 17:04:38 UTC
9d7516a	Luke Lovett	16 October 2015, 17:56:27 UTC	Throw a RuntimeException if MongoRecordWriter cannot open an OutputStream (HADOOP-235).	16 October 2015, 17:56:27 UTC
6cdb43f	Luke Lovett	07 October 2015, 22:32:38 UTC	Fix a broken link in the README.	07 October 2015, 22:32:38 UTC
affad1b	Luke Lovett	05 August 2015, 20:26:55 UTC	Add support for PySpark (HADOOP-187). This adds a "spark" module to the project, which compiles into "mongo-hadoop-spark.jar". This jar is currently only necessary if you want to use PySpark with mongo-hadoop. Additionally, this adds the pymongo_spark module, which provides the necessary objects and methods to use mongo-hadoop and PySpark together on the Python-side.	07 October 2015, 22:30:13 UTC
5642d65	Luke Lovett	18 September 2015, 18:32:16 UTC	Support compressed BSON files in BSONFileInputFormat (HADOOP-71). BSONFileInputFormat can now read files compressed with any of the codecs included with Hadoop. Additionally, BSONSplitter can be run as an executable program to split, compress, and upload BSON files to HDFS, or any other file system supported by Hadoop.	07 October 2015, 20:13:50 UTC
75c6e53	Luke Lovett	25 August 2015, 21:07:26 UTC	Push down some projections and queries from Hive to MongoDB (HADOOP-90). Any binary operator supported by IndexPredicateAnalyzer can be part of a pushdown predicate to MongoDB. Other operators are currently unsupported, so more advanced filtering is done Hadoop-side.	07 October 2015, 20:03:31 UTC
fdc37e5	Luke Lovett	20 August 2015, 21:36:55 UTC	Specify options in a "properties" file which is read by MongoStorageHandler (HADOOP-216). Set the path to the properties file with the "mongo.properties.path" table property in Hive.	07 October 2015, 18:25:03 UTC
e4df9eb	Luke Lovett	30 September 2015, 00:38:40 UTC	BUMP 1.5.0-SNAPSHOT	30 September 2015, 00:38:40 UTC
2214f05	Luke Lovett	30 September 2015, 00:10:32 UTC	BUMP 1.4.1	30 September 2015, 00:20:59 UTC
9000ba0	Luke Lovett	30 September 2015, 00:19:54 UTC	Update documentation links in build.gradle.	30 September 2015, 00:20:59 UTC
ec1ff3d	Luke Lovett	30 September 2015, 00:18:11 UTC	Update History.md.	30 September 2015, 00:18:19 UTC
9c84080	Luke Lovett	29 September 2015, 22:44:41 UTC	Fix test that parses Strings into Timestamps (HADOOP-226).	29 September 2015, 22:44:41 UTC
766922b	Luke Lovett	29 September 2015, 18:57:03 UTC	Use zero-args constructors for MongoOutputCommitter, so that they can be provided to -D mapred.output.committer.class=XXX (HADOOP-231)	29 September 2015, 18:57:03 UTC
3dafe6e	Luke Lovett	15 September 2015, 00:27:40 UTC	Appease checkstyle.	15 September 2015, 00:27:40 UTC
f9724fa	Luke Lovett	12 August 2015, 21:56:38 UTC	Fix some merge artifacts related to merge of PR #131 (add min/max parameters to splitVector command).	15 September 2015, 00:22:18 UTC
b377dbd	Mariano Semelman	11 August 2015, 20:54:20 UTC	cleanup changes and simplified checkings	15 September 2015, 00:22:18 UTC
da2bfb9	Mariano Semelman	10 August 2015, 18:01:05 UTC	fixed confussion with expected BSONObject	15 September 2015, 00:22:18 UTC
7be32b3	Mariano Semelman	10 August 2015, 17:05:15 UTC	support for min/max parameters for splitter.	15 September 2015, 00:22:18 UTC
0537b44	Justin Lee	09 September 2015, 17:23:31 UTC	remove old, non-existent repository checkstyle fixes	09 September 2015, 17:23:31 UTC
bd167a7	Luke Lovett	28 August 2015, 20:56:00 UTC	Convert Strings that represent Hive Timestamps to Timestamp automatically, if the schema requires (HADOOP-226).	31 August 2015, 17:03:48 UTC
5338040	Luke Lovett	14 August 2015, 16:41:26 UTC	Do not log the full MongoDB connection string, so that credentials cannot show up in Hadoop logs (HADOOP-219).	14 August 2015, 16:41:26 UTC

Newer
Older