Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
172 commits
Select commit Hold shift + click to select a range
55b1b32
[GraphX] Improve LiveJournalPageRank example
Mar 8, 2015
f7c7992
[EC2] [SPARK-6188] Instance types can be mislabeled when re-starting …
thvasilo Mar 9, 2015
70f8814
[Docs] Replace references to SchemaRDD with DataFrame
rxin Mar 9, 2015
3cac199
[SPARK-5310][Doc] Update SQL Programming Guide to include DataFrames.
rxin Mar 9, 2015
8767565
[SPARK-6194] [SPARK-677] [PySpark] fix memory leak in collect()
Mar 9, 2015
9a0272f
[SPARK-6177][MLlib]Add note in LDA example to remind possible coalesce
hhbyyh Mar 10, 2015
c4c4b07
[SPARK-6087][CORE] Provide actionable exception if Kryo buffer is not…
levkhomich Mar 10, 2015
d14df06
[SPARK-6191] [EC2] Generalize ability to download libs
nchammas Mar 10, 2015
7c7d2d5
[SPARK-6186] [EC2] Make Tachyon version configurable in EC2 deploymen…
Mar 10, 2015
74fb433
Minor doc: Remove the extra blank line in data types javadoc.
rxin Mar 11, 2015
2672374
[SPARK-5183][SQL] Update SQL Docs with JDBC and Migration Guide
marmbrus Mar 11, 2015
2d4e00e
[SPARK-5986][MLLib] Add save/load for k-means
yinxusen Mar 11, 2015
517975d
[SPARK-4924] Add a library for launching Spark jobs programmatically.
Mar 11, 2015
35b2564
[MINOR] [DOCS] Fix map -> mapToPair in Streaming Java example
srowen Mar 11, 2015
40f4979
[SQL][Minor] fix typo in comments
liuhb86 Mar 11, 2015
ec30c17
[SPARK-6279][Streaming]In KafkaRDD.scala, Miss expressions flag "s" a…
zzcclp Mar 11, 2015
6e94c4e
SPARK-6225 [CORE] [SQL] [STREAMING] Resolve most build warnings, 1.3.…
srowen Mar 11, 2015
5b335bd
[SPARK-6228] [network] Move SASL classes from network/shuffle to netw…
Mar 11, 2015
548643a
[SPARK-4423] Improve foreach() documentation to avoid confusion betwe…
Mar 11, 2015
2d87a41
SPARK-3642. Document the nuances of shared variables.
sryza Mar 11, 2015
55c4831
SPARK-6245 [SQL] jsonRDD() of empty RDD results in exception
srowen Mar 11, 2015
51a79a7
[SPARK-6274][Streaming][Examples] Added examples streaming + sql exam…
tdas Mar 11, 2015
cd3b68d
[SPARK-6128][Streaming][Documentation] Updates to Spark Streaming Pro…
tdas Mar 12, 2015
e921a66
BUILD: Adding more known contributor names
pwendell Mar 12, 2015
25b71d8
[SPARK-6296] [SQL] Added equals to Column
vlyubin Mar 12, 2015
712679a
[SPARK-6294] fix hang when call take() in JVM on PythonRDD
Mar 12, 2015
0cba802
[SPARK-5814][MLLIB][GRAPHX] Remove JBLAS from runtime
mengxr Mar 12, 2015
4e47d54
[docs] [SPARK-6306] Readme points to dead link
Mar 12, 2015
304366c
[SPARK-6275][Documentation]Miss toDF() function in docs/sql-programmi…
zzcclp Mar 12, 2015
8f1bc79
[build] [hotfix] Fix make-distribution.sh for Scala 2.11.
Mar 12, 2015
fb4787c
[SPARK-6268][MLlib] KMeans parameter getter methods
hhbyyh Mar 12, 2015
a4b2716
[SPARK-4588] ML Attributes
mengxr Mar 12, 2015
17c309c
[mllib] [python] Add LassoModel to __all__ in regression.py
jkbradley Mar 12, 2015
3980ebd
HOTFIX: Changes to release script.
pwendell Mar 13, 2015
0af9ea7
[SPARK-5845][Shuffle] Time to cleanup spilled shuffle files not inclu…
Mar 13, 2015
69ff8e8
[SPARK-5310] [SQL] [DOC] Parquet section for the SQL programming guide
liancheng Mar 13, 2015
9048e81
[SPARK-6197][CORE] handle json exception when hisotry file not finish…
liyezhang556520 Mar 13, 2015
ea3d2ee
[CORE][minor] remove unnecessary ClassTag in `DAGScheduler`
cloud-fan Mar 13, 2015
dc4abd4
[SPARK-6252] [mllib] Added getLambda to Scala NaiveBayes
jkbradley Mar 13, 2015
7f13434
[SPARK-6278][MLLIB] Mention the change of objective in linear regression
mengxr Mar 13, 2015
b943f5d
[SPARK-4600][GraphX]: org.apache.spark.graphx.VertexRDD.diff does not…
Mar 13, 2015
cdc34ed
[SPARK-6285] [SQL] Removes unused ParquetTestData and duplicated Test…
liancheng Mar 13, 2015
e360d5e
[SPARK-6317][SQL]Fixed HIVE console startup issue
vinodkc Mar 13, 2015
b38e073
[SPARK-6210] [SQL] use prettyString as column name in agg()
Mar 14, 2015
ee15404
[SQL]Delete some dupliate code in HiveThriftServer2
ArcherShao Mar 14, 2015
5be6b0e
[SPARK-6195] [SQL] Adds in-memory column type for fixed-precision dec…
liancheng Mar 14, 2015
127268b
[SPARK-6329][Docs]: Minor doc changes for Mesos and TOC
Mar 14, 2015
c49d156
[SPARK-5790][GraphX]: VertexRDD's won't zip properly for `diff` capab…
Mar 14, 2015
62ede53
[SPARK-6285][SQL]Remove ParquetTestData in SparkBuild.scala and in RE…
OopsOutOfMemory Mar 15, 2015
aa6536f
[SPARK-3619] Part 2. Upgrade to Mesos 0.21 to work around MESOS-1688
jongyoul Mar 15, 2015
45f4c66
[SPARK-5922][GraphX]: Add diff(other: RDD[VertexId, VD]) in VertexRDD
Mar 16, 2015
00e730b
[SPARK-6300][Spark Core] sc.addFile(path) does not support the relati…
pzzs Mar 16, 2015
12a345a
[SPARK-2087] [SQL] Multiple thriftserver sessions with single HiveCon…
chenghao-intel Mar 16, 2015
d19efed
[SPARK-6330] Fix filesystem bug in newParquet relation
vlyubin Mar 16, 2015
f149b8b
[SPARK-6077] Remove streaming tab while stopping StreamingContext
zhichao-li Mar 16, 2015
e3f315a
[SPARK-6327] [PySpark] fix launch spark-submit from python
Mar 16, 2015
9667b9f
[SPARK-5712] [SQL] fix comment with semicolon at end
adrian-wang Mar 17, 2015
f0edeae
[SPARK-6299][CORE] ClassNotFoundException in standalone mode when run…
swkimme Mar 17, 2015
6870722
[SQL][docs][minor] Fixed sample code in SQLContext scaladoc
lomigmegard Mar 17, 2015
b3e6eca
[SPARK-6357][GraphX] Add unapply in EdgeContext
maropu Mar 17, 2015
b2d8c02
SPARK-6044 [CORE] RDD.aggregate() should not use the closure serializ…
srowen Mar 17, 2015
25f3580
[SPARK-4011] tighten the visibility of the members in Master/Worker c…
CodingCat Mar 17, 2015
005d1c5
[SPARK-6269] [CORE] Use ScalaRunTime's array methods instead of java.…
mccheah Mar 17, 2015
e26db9b
[docs] [SPARK-4820] Spark build encounters "File name too long" on so…
Mar 17, 2015
c928796
[SPARK-6331] Load new master URL if present when recovering streaming…
tdas Mar 17, 2015
e9f22c6
[SPARK-6365] jetty-security also needed for SPARK_PREPEND_CLASSES to …
squito Mar 17, 2015
0f673c2
[SPARK-3266] Use intermediate abstract classes to fix type erasure is…
JoshRosen Mar 17, 2015
4cca391
[SPARK-6313] Add config option to disable file locks/fetchFile cache …
nemccarthy Mar 17, 2015
d9f3e01
[SPARK-6336] LBFGS should document what convergenceTol means
Lewuathe Mar 17, 2015
c94d062
[SPARK-6226][MLLIB] add save/load in PySpark's KMeansModel
mengxr Mar 17, 2015
4633a87
[SPARK-6330] [SQL] Add a test case for SPARK-6330
Mar 18, 2015
dc9c919
[SPARK-6366][SQL] In Python API, the default save mode for save and s…
yhuai Mar 18, 2015
a012e08
[SPARK-6383][SQL]Fixed compiler and errors in Dataframe examples
Mar 18, 2015
5c80643
[SPARK-5908][SQL] Resolve UdtfsAlias when only single Alias is used
viirya Mar 18, 2015
78cb08a
[SPARK-5404] [SQL] Update the default statistic number
chenghao-intel Mar 18, 2015
a6ee2f7
[SPARK-5651][SQL] Add input64 in blacklist and add test suit for crea…
watermen Mar 18, 2015
3579003
[SPARK-6247][SQL] Fix resolution of ambiguous joins caused by new ali…
marmbrus Mar 18, 2015
6205a25
[SPARK-6372] [core] Propagate --conf to child processes.
Mar 18, 2015
e09c852
SPARK-6389 YARN app diagnostics report doesn't report NPEs
steveloughran Mar 18, 2015
9d112a9
[SPARK-6286][minor] Handle missing Mesos case TASK_ERROR.
dragos Mar 18, 2015
981fbaf
[SPARK-6325] [core,yarn] Do not change target executor count when kil…
Mar 18, 2015
a95ee24
[SPARK-6374] [MLlib] add get for GeneralizedLinearAlgo
hhbyyh Mar 18, 2015
3db1387
SPARK-6085 Part. 2 Increase default value for memory overhead
jongyoul Mar 19, 2015
540b2a4
[SPARK-6394][Core] cleanup BlockManager companion object and improve …
cloud-fan Mar 19, 2015
645cf3f
[SPARK-6222][Streaming] Dont delete checkpoint data when doing pre-ba…
tdas Mar 19, 2015
2c3f83c
[SPARK-4012] stop SparkContext when the exception is thrown from an i…
CodingCat Mar 19, 2015
797f8a0
[SPARK-6402][DOC] - Remove some refererences to shark in docs and ec2
Mar 19, 2015
3c4e486
[SPARK-5843] [API] Allowing map-side combine to be specified in Java.
mccheah Mar 19, 2015
dda4ded
[SPARK-6291] [MLLIB] GLM toString & toDebugString
yanboliang Mar 19, 2015
8cb23a1
[SPARK-5313][Project Infra]: Create simple framework for highlighting…
Mar 19, 2015
3b5aaa6
[Core][minor] remove unused `visitedStages` in `DAGScheduler.stageDep…
cloud-fan Mar 19, 2015
f17d43b
[SPARK-6219] [Build] Check that Python code compiles
nchammas Mar 19, 2015
0745a30
Tighten up field/method visibility in Executor and made some code mor…
rxin Mar 20, 2015
116c553
[SPARK-6286][Mesos][minor] Handle missing Mesos case TASK_ERROR
jongyoul Mar 20, 2015
d08e3eb
SPARK-5134 [BUILD] Bump default Hadoop version to 2+
srowen Mar 20, 2015
6f80c3e
SPARK-6338 [CORE] Use standard temp dir mechanisms in tests to avoid…
srowen Mar 20, 2015
db4d317
[SPARK-6428][MLlib] Added explicit type for public methods and implem…
rxin Mar 20, 2015
28bcb9e
[SPARK-6370][core] Documentation: Improve all 3 docs for RDD.sample
mbonaci Mar 20, 2015
385b2ff
[SPARK-6426][Doc]User could also point the yarn cluster config direct…
WangTaoTheTonic Mar 20, 2015
a745645
[SPARK-6371] [build] Update version to 1.4.0-SNAPSHOT.
Mar 20, 2015
48866f7
[SPARK-6095] [MLLIB] Support model save/load in Python's linear models
yanboliang Mar 20, 2015
5e6ad24
[MLlib] SPARK-5954: Top by key
coderxiang Mar 20, 2015
25636d9
[Spark 6096][MLlib] Add Naive Bayes load save methods in Python
yinxusen Mar 20, 2015
6b36470
[SPARK-5955][MLLIB] add checkpointInterval to ALS
mengxr Mar 20, 2015
49a01c7
[SPARK-6423][Mesos] MemoryUtils should use memoryOverhead if it's set
jongyoul Mar 20, 2015
11e0259
[SPARK-6309] [SQL] [MLlib] Implement MatrixUDT
MechCoder Mar 20, 2015
257cde7
[SPARK-6421][MLLIB] _regression_train_wrapper does not test initialWe…
Lewuathe Mar 20, 2015
a95043b
[SPARK-6428][SQL] Added explicit type for all public methods in sql/core
rxin Mar 20, 2015
25e271d
[SPARK-6025] [MLlib] Add helper method evaluateEachIteration to extra…
MechCoder Mar 21, 2015
bc37c97
[SPARK-5821] [SQL] ParquetRelation2 CTAS should check if delete is su…
yanboliang Mar 21, 2015
937c1e5
[SPARK-6315] [SQL] Also tries the case class string parser while read…
liancheng Mar 21, 2015
e5d2c37
[SPARK-5821] [SQL] JSON CTAS command should throw error message when …
yanboliang Mar 21, 2015
52dd4b2
[SPARK-5320][SQL]Add statistics method at NoRelation (override super).
x1- Mar 21, 2015
ee569a0
[SPARK-5680][SQL] Sum function on all null values, should return zero
gvramana Mar 21, 2015
94a102a
[SPARK-6250][SPARK-6146][SPARK-5911][SQL] Types are now reserved word…
yhuai Mar 21, 2015
b6090f9
[SPARK-6428][SQL] Added explicit type for all public methods for Hive…
rxin Mar 21, 2015
9b1e1f2
[SPARK-6408] [SQL] Fix JDBCRDD filtering string literals
ypcat Mar 22, 2015
b9fe504
[SPARK-6448] Make history server log parse exceptions
ryan-williams Mar 22, 2015
ab4f516
[SPARK-6455] [docs] Correct some mistakes and typos
Mar 22, 2015
adb2ff7
[SPARK-6453][Mesos] Some Mesos*Suite have a different package with th…
jongyoul Mar 22, 2015
6ef4863
SPARK-6454 [DOCS] Fix links to pyspark api
kamilsmuga Mar 22, 2015
a41b9c6
[SPARK-6122][Core] Upgrade Tachyon client version to 0.6.1.
calvinjia Mar 22, 2015
7a0da47
[HOTFIX] Build break due to https://github.com/apache/spark/pull/5128
rxin Mar 22, 2015
2bf40c5
[SPARK-6337][Documentation, SQL]Spark 1.3 doc fixes
vinodkc Mar 22, 2015
4659468
[SPARK-4985] [SQL] parquet support for date type
adrian-wang Mar 23, 2015
e566fe5
[SPARK-6397][SQL] Check the missingInput simply
watermen Mar 23, 2015
bf044de
Revert "[SPARK-6397][SQL] Check the missingInput simply"
liancheng Mar 23, 2015
9f3273b
[SPARK-6397][SQL] Check the missingInput simply
watermen Mar 23, 2015
474d132
[SPARK-6308] [MLlib] [Sql] Override TypeName in VectorUDT and MatrixUDT
MechCoder Mar 23, 2015
6cd7058
Revert "[SPARK-6122][Core] Upgrade Tachyon client version to 0.6.1."
pwendell Mar 23, 2015
bfd3ee9
[SPARK-6124] Support jdbc connection properties in OPTIONS part of th…
vlyubin Mar 24, 2015
4ce2782
[SPARK-6428] Added explicit types for all public methods in core.
rxin Mar 24, 2015
1afcf77
[SPARK-6452] [SQL] Checks for missing attributes and unresolved opera…
liancheng Mar 24, 2015
37fac1d
[SPARK-6477][Build]: Run MIMA tests before the Spark test suite
Mar 24, 2015
c12312f
Update the command to use IPython notebook
yuecong Mar 24, 2015
b293afc
[SPARK-6473] [core] Do not try to figure out Scala version if not nee…
Mar 24, 2015
85cf063
[SPARK-5559] [Streaming] [Test] Remove oppotunity we met flakiness wh…
sarutak Mar 24, 2015
08d4528
[ML][docs][minor] Define LabeledDocument/Document classes in CV example
petro-rudenko Mar 24, 2015
a1d1529
[SPARK-6475][SQL] recognize array types when infer data types from Ja…
mengxr Mar 24, 2015
6bdddb6
[SPARK-6361][SQL] support adding a column with metadata in DF
mengxr Mar 24, 2015
32efadd
[SPARK-6459][SQL] Warn when constructing trivially true equals predicate
marmbrus Mar 24, 2015
26c6ce3
[SPARK-6437][SQL] Use completion iterator to close external sorter
marmbrus Mar 24, 2015
3fa3d12
[SPARK-6054][SQL] Fix transformations of TreeNodes that hold StructTypes
marmbrus Mar 24, 2015
046c1e2
[SPARK-6375][SQL] Fix formatting of error messages.
marmbrus Mar 24, 2015
cbeaf9e
[SPARK-6376][SQL] Avoid eliminating subqueries until optimization
marmbrus Mar 24, 2015
a8f51b8
[SPARK-6458][SQL] Better error messages for invalid data sources
marmbrus Mar 24, 2015
7215aa7
[SPARK-6209] Clean up connections in ExecutorClassLoader after failin…
JoshRosen Mar 24, 2015
7334801
[SPARK-6428][SQL] Added explicit types for all public methods in cata…
rxin Mar 24, 2015
6948ab6
[SPARK-6088] Correct how tasks that get remote results are shown in UI.
kayousterhout Mar 24, 2015
d8ccf65
[SPARK-3570] Include time to open files in shuffle write time.
kayousterhout Mar 24, 2015
f7c3668
Revert "[SPARK-5771][UI][hotfix] Change Requested Cores into * if def…
Mar 24, 2015
dd907d1
Revert "[SPARK-5771] Number of Cores in Completed Applications of Sta…
Mar 24, 2015
05c2214
[SPARK-6469] Improving documentation on YARN local directories usage
Mar 25, 2015
6930e96
[SPARK-6512] add contains to OpenHashMap
mengxr Mar 25, 2015
9459865
[SPARK-6428][Streaming] Added explicit types for all public methods.
rxin Mar 25, 2015
c14ddd9
[SPARK-6515] update OpenHashSet impl
mengxr Mar 25, 2015
c5cc414
[DOCUMENTATION]Fixed Missing Type Import in Documentation
bllchmbrs Mar 25, 2015
64262ed
[SPARK-6483][SQL]Improve ScalaUdf called performance.
zzcclp Mar 25, 2015
10c7860
[SPARK-6496] [MLLIB] GeneralizedLinearAlgorithm.run(input, initialWei…
yanboliang Mar 25, 2015
982952f
[ML][FEATURE] SPARK-5566: RegEx Tokenizer
Mar 25, 2015
968408b
[SPARK-6409][SQL] It is not necessary that avoid old inteface of hive…
pzzs Mar 25, 2015
883b7e9
[SPARK-6076][Block Manager] Fix a potential OOM issue when StorageLev…
zsxwing Mar 25, 2015
acef51d
[SPARK-6537] UIWorkloadGenerator: The main thread should not stop Spa…
sarutak Mar 25, 2015
c1b74df
[SPARK-5771] Master UI inconsistently displays application cores
Mar 25, 2015
4353373
[SPARK-6256] [MLlib] MLlib Python API parity check for regression
yanboliang Mar 25, 2015
4fc4d03
[SPARK-5987] [MLlib] Save/load for GaussianMixtureModels
MechCoder Mar 25, 2015
d44a336
[SPARK-6079] Use index to speed up StatusTracker.getJobIdsForGroup()
JoshRosen Mar 26, 2015
8c3b005
[SPARK-6450] [SQL] Fixes metastore Parquet table conversion
liancheng Mar 26, 2015
e6d1406
[SPARK-5498][SQL]fix query exception when partition schema does not m…
jeanlyn Mar 26, 2015
73d5775
[SPARK-6326][SQL] Improve castStruct to be faster
viirya Mar 26, 2015
328daf6
[SPARK-6271][SQL] Sort these tokens in alphabetic order to avoid furt…
pzzs Mar 26, 2015
5ab6e9f
[SPARK-6202] [SQL] enable variable substitution on test framework
adrian-wang Mar 26, 2015
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,7 @@
*.iml
*.iws
*.pyc
*.pyo
.idea/
.idea_modules/
build/*.jar
Expand Down
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -85,7 +85,7 @@ storage systems. Because the protocols have changed in different versions of
Hadoop, you must build Spark against the same version that your cluster runs.

Please refer to the build documentation at
["Specifying the Hadoop Version"](http://spark.apache.org/docs/latest/building-with-maven.html#specifying-the-hadoop-version)
["Specifying the Hadoop Version"](http://spark.apache.org/docs/latest/building-spark.html#specifying-the-hadoop-version)
for detailed guidance on building for a particular distribution of Hadoop, including
building for particular Hive and Hive Thriftserver distributions. See also
["Third Party Hadoop Distributions"](http://spark.apache.org/docs/latest/hadoop-third-party-distributions.html)
Expand Down
12 changes: 1 addition & 11 deletions assembly/pom.xml
Original file line number Diff line number Diff line change
Expand Up @@ -21,7 +21,7 @@
<parent>
<groupId>org.apache.spark</groupId>
<artifactId>spark-parent_2.10</artifactId>
<version>1.3.0-SNAPSHOT</version>
<version>1.4.0-SNAPSHOT</version>
<relativePath>../pom.xml</relativePath>
</parent>

Expand Down Expand Up @@ -114,16 +114,6 @@
<exclude>META-INF/*.RSA</exclude>
</excludes>
</filter>
<filter>
<!-- Exclude libgfortran, libgcc for license issues -->
<artifact>org.jblas:jblas</artifact>
<excludes>
<!-- Linux amd64 is OK; not statically linked -->
<exclude>lib/static/Linux/i386/**</exclude>
<exclude>lib/static/Mac OS X/**</exclude>
<exclude>lib/static/Windows/**</exclude>
</excludes>
</filter>
</filters>
</configuration>
<executions>
Expand Down
2 changes: 1 addition & 1 deletion bagel/pom.xml
Original file line number Diff line number Diff line change
Expand Up @@ -21,7 +21,7 @@
<parent>
<groupId>org.apache.spark</groupId>
<artifactId>spark-parent_2.10</artifactId>
<version>1.3.0-SNAPSHOT</version>
<version>1.4.0-SNAPSHOT</version>
<relativePath>../pom.xml</relativePath>
</parent>

Expand Down
124 changes: 0 additions & 124 deletions bin/compute-classpath.cmd

This file was deleted.

161 changes: 0 additions & 161 deletions bin/compute-classpath.sh

This file was deleted.

8 changes: 4 additions & 4 deletions bin/load-spark-env.sh
Original file line number Diff line number Diff line change
Expand Up @@ -41,9 +41,9 @@ fi

if [ -z "$SPARK_SCALA_VERSION" ]; then

ASSEMBLY_DIR2="$FWDIR/assembly/target/scala-2.11"
ASSEMBLY_DIR1="$FWDIR/assembly/target/scala-2.10"
ASSEMBLY_DIR2="$SPARK_HOME/assembly/target/scala-2.11"
ASSEMBLY_DIR1="$SPARK_HOME/assembly/target/scala-2.10"

if [[ -d "$ASSEMBLY_DIR2" && -d "$ASSEMBLY_DIR1" ]]; then
echo -e "Presence of build for both scala versions(SCALA 2.10 and SCALA 2.11) detected." 1>&2
echo -e 'Either clean one of them or, export SPARK_SCALA_VERSION=2.11 in spark-env.sh.' 1>&2
Expand All @@ -54,5 +54,5 @@ if [ -z "$SPARK_SCALA_VERSION" ]; then
export SPARK_SCALA_VERSION="2.11"
else
export SPARK_SCALA_VERSION="2.10"
fi
fi
fi
Loading