Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
S
spark
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Wiki
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Model registry
Operate
Environments
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
cs525-sp18-g07
spark
Commits
a2a0cf9d
Commit
a2a0cf9d
authored
11 years ago
by
Patrick Wendell
Browse files
Options
Downloads
Patches
Plain Diff
Docs describing Spark monitoring and instrumentation
parent
c592a3c9
No related branches found
Branches containing commit
No related tags found
Tags containing commit
No related merge requests found
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
docs/_layouts/global.html
+1
-0
1 addition, 0 deletions
docs/_layouts/global.html
docs/monitoring.md
+49
-0
49 additions, 0 deletions
docs/monitoring.md
with
50 additions
and
0 deletions
docs/_layouts/global.html
+
1
−
0
View file @
a2a0cf9d
...
...
@@ -97,6 +97,7 @@
<a
href=
"api.html"
class=
"dropdown-toggle"
data-toggle=
"dropdown"
>
More
<b
class=
"caret"
></b></a>
<ul
class=
"dropdown-menu"
>
<li><a
href=
"configuration.html"
>
Configuration
</a></li>
<li><a
href=
"monitoring.html"
>
Monitoring
</a></li>
<li><a
href=
"tuning.html"
>
Tuning Guide
</a></li>
<li><a
href=
"hardware-provisioning.html"
>
Hardware Provisioning
</a></li>
<li><a
href=
"building-with-maven.html"
>
Building Spark with Maven
</a></li>
...
...
This diff is collapsed.
Click to expand it.
docs/monitoring.md
0 → 100644
+
49
−
0
View file @
a2a0cf9d
---
layout
:
global
title
:
Monitoring and Instrumentation
---
There are several ways to monitor the progress of Spark jobs.
# Web Interfaces
When a SparkContext is initialized, it launches a web server (by default at port 3030) which
displays useful information. This includes a list of active and completed scheduler stages,
a summary of RDD blocks and partitions, and environmental information. If multiple SparkContexts
are running on the same host, they will bind to succesive ports beginning with 3030 (3031, 3032,
etc).
Spark's Standlone Mode scheduler also has its own
[
web interface
](
spark-standalone.html#monitoring-and-logging
)
.
# Spark Metrics
Spark has a configurable metrics system based on the
[
Coda Hale Metrics Library
](
http://metrics.codahale.com/
)
.
This allows users to report Spark metrics to a variety of sinks including HTTP, JMX, and CSV
files. The metrics system is configured via a configuration file that Spark expects to be present
at
`$SPARK_HOME/conf/metrics.conf`
. A custom file location can be specified via the
`spark.metrics.conf`
Java system property. Spark's metrics are decoupled into different
_instances_
corresponding to Spark components. Within each instance, you can configure a
set of sinks to which metrics are reported. The following instances are currently supported:
*
`master`
: The Spark standalone master process.
*
`applications`
: A component within the master which reports on various applications.
*
`worker`
: A Spark standalone worker process.
*
`executor`
: A Spark executor.
*
`driver`
: The Spark driver process (the process in which your SparkContext is created).
The syntax of the metrics configuration file is defined in an example configuration file,
`$SPARK_HOME/conf/metrics.conf.template`
.
# Advanced Instrumentation
Several external tools can be used to help profile the performance of Spark jobs:
*
Cluster-wide monitoring tools, such as
[
Ganglia
](
http://ganglia.sourceforge.net/
)
, can provide
insight into overall cluster utilization and resource bottlenecks. For instance, a Ganglia
dashboard can quickly reveal whether a particular workload is disk bound, network bound, or
CPU bound.
*
OS profiling tools such as
[
dstat
](
http://dag.wieers.com/home-made/dstat/
)
,
[
iostat
](
http://linux.die.net/man/1/iostat
)
, and
[
iotop
](
http://linux.die.net/man/1/iotop
)
can provide fine-grained profiling on individual nodes.
*
JVM utilities such as
`jstack`
for providing stack traces,
`jmap`
for creating heap-dumps,
`jstat`
for reporting time-series statistics and
`jconsole`
for visually exploring various JVM
properties are useful for those comfortable with JVM internals.
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment