You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
`ShuffleReadMetricsReporter` is an abstraction of [reporters](#implementations) that allow tracking the following **Shuffle Read Metrics** (for each shuffle):
4
+
5
+
Shuffle Read Metric | When used
6
+
-|-
7
+
Corrupt Merged Block Chunks | FIXME
8
+
Fetch Wait Time | FIXME
9
+
Local Blocks Fetched | FIXME
10
+
Local Bytes Read | FIXME
11
+
Merged Local Blocks Fetched | FIXME
12
+
Merged Local Bytes Read | FIXME
13
+
Merged Local Chunks Fetched | FIXME
14
+
Merged Fetch Fallback Count | FIXME
15
+
Merged Remote Blocks Fetched | FIXME
16
+
Merged Remote Chunks Fetched | FIXME
17
+
Merged Remote Bytes Read | FIXME
18
+
Merged Remote Requests Duration | FIXME
19
+
Remote Blocks Fetched | [ShuffleBlockFetcherIterator](../storage/ShuffleBlockFetcherIterator.md#shuffleRemoteMetricsUpdate) for [ShuffleBlockChunkId](../storage/BlockId.md#ShuffleBlockChunkId)s
20
+
Remote Bytes Read | FIXME
21
+
Remote Bytes Read To Disk | FIXME
22
+
Remote Requests Duration | FIXME
23
+
Total Records Read | FIXME
24
+
25
+
`ShuffleReadMetricsReporter` is used to create the following:
Copy file name to clipboardExpand all lines: docs/storage/BlockId.md
+73-14Lines changed: 73 additions & 14 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,7 +4,7 @@
4
4
5
5
## Contract
6
6
7
-
### <spanid="name"><spanid="toString"> Name
7
+
###<spanid="toString"> Name { #name }
8
8
9
9
```scala
10
10
name:String
@@ -24,7 +24,9 @@ Used when:
24
24
??? note "Sealed Abstract Class"
25
25
`BlockId` is a Scala **sealed abstract class** which means that all of the implementations are in the same compilation unit (a single file).
26
26
27
-
### <spanid="BroadcastBlockId"> BroadcastBlockId
27
+
Learn more in the [Scala Language Specification]({{ scala.spec }}/05-classes-and-objects.html#sealed).
28
+
29
+
### BroadcastBlockId { #BroadcastBlockId }
28
30
29
31
`BlockId` for [broadcast variable](../broadcast-variables/index.md) blocks:
30
32
@@ -40,7 +42,15 @@ Used when:
40
42
*`SerializerManager` is requested to [shouldCompress](../serializer/SerializerManager.md#shouldCompress)
41
43
*`AppStatusListener` is requested to [onBlockUpdated](../status/AppStatusListener.md#onBlockUpdated)
42
44
43
-
### <spanid="RDDBlockId"> RDDBlockId
45
+
### CacheId { #CacheId }
46
+
47
+
`BlockId` for...FIXME
48
+
49
+
### PythonStreamBlockId { #PythonStreamBlockId }
50
+
51
+
`BlockId` for...FIXME
52
+
53
+
### RDDBlockId { #RDDBlockId }
44
54
45
55
`BlockId` for [RDD](../rdd/RDD.md) partitions:
46
56
@@ -60,9 +70,22 @@ Used when:
60
70
61
71
[Compressed](../serializer/SerializerManager.md#shouldCompress) when [spark.rdd.compress](../configuration-properties.md#spark.rdd.compress) configuration property is enabled
`BlockId` for shuffle block chunks in [Push-Based Shuffle](../push-based-shuffle.md):
80
+
81
+
*`shuffleId` identifier
82
+
*`shuffleMergeId` identifier
83
+
*`reduceId` identifier
84
+
*`chunkId` identifier
85
+
86
+
Uses `shuffleChunk_[shuffleId]_[shuffleMergeId]_[reduceId]_[chunkId]` pattern for the [name](#name)
87
+
88
+
### ShuffleBlockId { #ShuffleBlockId }
66
89
67
90
`BlockId` for shuffle blocks:
68
91
@@ -83,13 +106,41 @@ Used when:
83
106
84
107
[Compressed](../serializer/SerializerManager.md#shouldCompress) when [spark.shuffle.compress](../configuration-properties.md#spark.shuffle.compress) configuration property is enabled
0 commit comments