Summarize Nearby

Note:

This functionality is currently only supported in Map Viewer Classic (formerly known as Map Viewer). It will be available in a future release of the new Map Viewer.

Summarize Nearby The Summarize Nearby tool finds features that are within a specified distance of features in the input layer using a line distance or travel mode. Statistics are then calculated for the nearby features.

Note:

Summarize Nearby requires the following directions and routing services to run with travel modes:

  • Service Area (Asynchronous)
  • Routing Utilities

For more information, see Configure utility services.

Workflow diagram

Summarize Nearby workflow diagram

Examples

A school board is building a new elementary school and needs to decide between several potential locations. They decide that the school should be in the area with the most children aged 5 to 12. The population of children aged 5 to 12 can be calculated with the Summarize Nearby tool using a one-mile walking distance around each potential location and census blocks that include population data.

Terminology

TermDescription

Geodesic

Refers to a line drawn on a sphere. A geodesic line drawn on the globe represents the curvature of the earth's geoid.

Euclidean distance

A straight-line distance as measured on a flat surface (that is, a Cartesian plane).

Usage notes

The Summarize Nearby tool requires two input layers: a layer from which distances will be measured and a layer to summarize. Both layers may be points, lines, or areas. The first input layer can contain up to 1,000 features.

The nearby distance can be based on a line distance or several different travel modes using time or distance as the measure. Travel modes are only enabled when the first input layer contains point features. You can create multiple output areas by inputting each time or distance separated by a space.

Some travel modes have an option for using traffic conditions. When traffic conditions are used, they may be based on live conditions or typical conditions for a specified day of the week and time. Live traffic conditions can be offset up to 4 hours from the current time. Times for typical conditions can be set by 15-minute intervals for the entire day and night.

Travel modes can be configured by the administrator of your organization. The administrator can also add new travel modes or remove travel modes that are not necessary for your organization. The default distance measure will be in Miles or Kilometers, depending on the Units setting in your profile.

Travel modesDescriptionDefaultsOptions

Walking Time Walking Time

Follows paths and roads that allow pedestrian traffic and finds solutions that optimize travel time. The walking speed is set to 5 kilometers per hour.

Walking speed is set at 5 kilometers (3.1 miles) per hour. The walking speed can be configured by the administrator of your organization.

The default Walking Time is 5 Minutes.

Durations can be in Seconds, Minutes, or Hours. A duration of up to 9 hours can be used as input.

Rural Driving Distance Rural Driving Distance

Models the movement of cars and other similar small automobiles, such as pickup trucks, and finds solutions that optimize travel distance. Travel obeys one-way roads, avoids illegal turns, and follows other rules that are specific to cars but does not discourage travel on unpaved roads.

The default Rural Driving Distance is 5 Miles (US Standard setting) or 5 Kilometers (Metric setting).

Distances can be in Miles, Yards, Feet, Kilometers, or Meters. A distance of up to 300 miles (482.80 kilometers) can be used as input.

Driving Time Driving Time

Models the movement of cars and other similar small automobiles, such as pickup trucks, and finds solutions that optimize travel time. Travel obeys one-way roads, avoids illegal turns, and follows other rules that are specific to cars. When you specify a start time, dynamic travel speeds based on traffic are used where traffic data is available.

Verify whether traffic data is available in your region by clicking the See availability link in the tool pane.

The default Driving Time is 5 Minutes.

Use traffic is unchecked by default. The driving speed will be based on historical and live traffic data.

Durations can be in Seconds, Minutes, or Hours. A duration of up to 5 hours can be used as input.

Traffic can be based on live conditions or typical conditions for a specified day of the week and time.

Driving Distance Driving Distance

Models the movement of cars and other similar small automobiles, such as pickup trucks, and finds solutions that optimize travel distance. Travel obeys one-way roads, avoids illegal turns, and follows other rules that are specific to cars.

The default Driving Distance is 5 Miles (US Standard setting) or 5 Kilometers (Metric setting).

Distances can be in Miles, Yards, Feet, Kilometers, or Meters. A distance of up to 300 miles (482.80 kilometers) can be used as input.

Walking Distance Walking Distance

Follows paths and roads that allow pedestrian traffic and finds solutions that optimize travel distance.

The default Walking Distance is 5 Miles (US Standard setting) or 5 Kilometers (Metric setting).

Distances can be in Miles, Yards, Feet, Kilometers, or Meters. A distance of up to 27 miles (43.45 kilometers) can be used as input.

Rural Driving Time Rural Driving Time

Models the movement of cars and other similar small automobiles, such as pickup trucks, and finds solutions that optimize travel time. Travel obeys one-way roads, avoids illegal turns, and follows other rules that are specific to cars but does not discourage travel on unpaved roads. When you specify a start time, dynamic travel speeds based on traffic are used where it is available.

Verify whether traffic data is available in your region by clicking the See availability link in the tool pane.

The default Rural Driving Time is 5 Minutes.

Use traffic is unchecked by default. The rural driving speed will be based on historical and live traffic data.

Durations can be in Seconds, Minutes, or Hours. A duration of up to 5 hours can be used as input.

Traffic can be based on live conditions or typical conditions for a specified day of the week and time.

Trucking Time Trucking Time

Models basic truck travel by preferring designated truck routes and finds solutions that optimize travel time. Routes must obey one-way roads, avoid illegal turns, and so on.

The trucking speed will be based on either historical average speeds for automobiles or the posted speed limits for trucks, whichever is smaller.

Follows rules applicable to heavy trucks.

The default Trucking Time is 5 Minutes.

Use traffic is unchecked by default. The trucking speed will be based on historical and live traffic data but will not exceed the posted trucking speed.

Durations can be in Seconds, Minutes, or Hours. A duration of up to 5 hours can be used as input.

Traffic can be based on live conditions or typical conditions for a specified day of the week and time.

Trucking Distance Trucking Distance

Models basic truck travel by preferring designated truck routes and finds solutions that optimize travel distance. Routes must obey one-way roads, avoid illegal turns, and so on.

Follows rules applicable to heavy trucks.

The default Trucking Distance is 5 Miles (US Standard setting) or 5 Kilometers (Metric setting).

Distances can be in Miles, Yards, Feet, Kilometers, or Meters. A distance of up to 300 miles (482.80 kilometers) can be used as input.

The Return bounding areas box is checked by default. The result layer includes the area or areas created using the specified distance measurement around the input layer as well as all of the requested summarized data. When unchecked, the result layer is the same as the input layer but includes the summarized data.

A Count of Points, Total Length, or Total Area box appears depending on the type of features to summarize in your layer. The box is checked by default and can only be unchecked if statistics are being calculated.

CalculationInput featuresDefaultOptions

Count of Points

Points

None

None

Total Length

Lines

Miles (US Standard setting) or Kilometers (Metric setting)

  • Miles
  • Feet
  • Kilometers
  • Meters
  • Yards

Total Area

Areas

Square Miles (US Standard setting) or Square Kilometers (Metric setting)

  • Square Miles
  • Square Kilometers
  • Square Meters
  • Hectares
  • Acres

There are five options for statistics that you can calculate on numeric fields in the layer to be summarized: sum, minimum, maximum, average, and standard deviation. Each time a Field and Statistic are entered, a new row is added to the tool pane so that more than one statistic can be calculated at once. You can view the summarized data in the result layer's table or pop-ups.

Optionally, you can select a group by field so that statistics are calculated separately for each unique attribute value. When a group by field is selected, the pop-up for each feature in the output layer contains charts showing each summary count or total and statistic by field value. A summary table listing each feature and statistic by group by field value is also created.

The Add minority, majority and Add percentages check boxes are enabled when a group by field is entered. The minority and majority values are the least and most dominant values, respectively, from the group by field, where dominance is determined using the count of points, total length, or total area of each value. When Add minority, majority is checked, two new fields are added to the result layer. The fields list the values from the group by field that are the minority and majority for each result feature. When Add percentages is checked, a new field is added to the result table listing the percentage of the count of points, total length, or total area within each feature that belongs to each value from the group by field. If Add minority, majority is also checked, two additional fields are added to the result layer listing the percentage of the count of points, total length, or total area that belongs to the minority and majority values for each feature.

If Use current map extent is checked, only the features in the input layer and the layer to be summarized that are visible within the current map extent will be analyzed. If unchecked, all features in both the input layer and the layer to be summarized will be analyzed, even if they are outside the current map extent.

Limitations

  • The first input layer from which distances are measured has a limit of 1,000 features.
  • Travel modes are only enabled when point features are used as the input layer from which distances are measured.
  • Travel times cannot exceed 9 hours (540 minutes) when walking or 5 hours (300 minutes) for all other travel times.
  • Travel distances cannot exceed 27 miles (43.45 kilometers) when walking or 300 miles (482.8 kilometers) for all other travel distances.
  • Lines and areas are summarized using proportions; it is best to summarize absolute data (such as population) rather than relative data (such as average income) when lines or areas are being summarized.
  • An error occurs if the tool takes more than 60 minutes to execute when using travel modes. If this error occurs, try rerunning the analysis with fewer input features.

How Summarize Nearby works

The sections below describe the functionality of the Summarize Nearby tool.

Equations

Average and Std Deviation are calculated using weighted mean and weighted standard deviation for line and area features. None of the statistics for point features are weighted. The following table shows the equations used to calculate standard deviation, weighted mean, and weighted standard deviation:

StatisticEquationVariablesFeatures

Standard Deviation

Standard deviation equation

where:

  • N = Number of observations
  • xi = Observations
  • = Mean

Points

Weighted Mean

Weighted mean equation

where:

  • N = Number of observations
  • xi = Observations
  • wi = Weights

Lines and Areas

Weighted Standard Deviation

Weighted standard deviation equation

where:

  • N = Number of observations
  • xi = Observations
  • wi = Weights
  • w = Weighted mean
  • N' = Number of non-zero weights

Lines and Areas

Note:

Null values are excluded from all statistical calculations. For example, the mean of 10, 5, and a null value is 7.5 ((10+5)/2).

Points

Point layers are summarized using only the point features within the boundary drawn by the Summarize Nearby tool. If Return bounding areas is checked, the results layer is displayed using graduated symbols.

The figure and table below show the statistical calculations of a point layer. In the hypothetical analysis in the figure, a point layer is being summarized using a line distance from an input point. The Classes field was used to calculate the statistics (Sum, Minimum, Maximum, Average, and Std Deviation) for the layer.

Summarizing a point layer
Point layers are summarized using only points located within the specified distance (line or travel mode). In this example, the summarized points include two yellow, one green, and four red points.

StatisticResult

Sum

1 + 1 + 2 + 3 + 3 + 3 + 3 = 16

Minimum

Minimum of :

[1, 1, 2, 3, 3, 3, 3] = 1

Maximum

Maximum of :

[1, 1, 2, 3, 3, 3, 3] = 3

Average

16/7 = 2.29

Std Deviation

Standard deviation of points
= 0.95

A real-life scenario in which you can use this analysis is determining the availability of yoga classes within a two-mile radius of your house. Each point represents a gym or studio that offers yoga classes. The Classes field contains the number of unique classes offered at each point. From the results, you can see that there are seven facilities offering classes within two miles of your house and that the maximum number of unique classes at a single facility is three.

Lines

Line layers are summarized using only the proportions of the line features that are within the boundary drawn by the Summarize Nearby tool. When summarizing lines, use fields with absolute numbers so proportional calculations make logical sense in your analysis. If Return bounding areas is checked, the results layer is displayed using graduated symbols.

The figure and table below show the statistical calculations of a line layer. In the hypothetical analysis in the figure, a line layer is being summarized using a line distance from an input point. The Houses field was used to calculate the statistics (Sum, Minimum, Maximum, Average, and Std Deviation) for the layer. The statistics are calculated using only the proportion of the line that is within the specified distance of the input point.

Summarizing a line layer
Line layers are summarized using only proportions of the lines located within the specified distance (line or travel mode).

StatisticResult

Sum

2 + 1 + 2 + 1 + 1 = 7

Minimum

Minimum of :

[2, 1, 2, 1, 1] = 1

Maximum

Maximum of :

[2, 1, 2, 1, 1] = 2

Average

Weighted mean of lines
= 1.68

Std Deviation

Weighted standard deviation of lines
= 0.52

A real-life scenario in which you can use this analysis is determining how many houses are within five miles of a remote forest fire. Each line segment represents a street or lane with houses built along it. The Houses field contains the number of houses on each segment. From the results, you can see that there are seven houses within five miles of the fire.

Areas

Area layers are summarized using only the proportions of the area features that are within the boundary drawn by the Summarize Nearby tool. When summarizing areas, use fields with absolute numbers so proportional calculations make logical sense in your analysis. If Return bounding areas is checked, the results layer is displayed using graduated colors.

The figure and table below show the statistical calculations of an area layer. In the hypothetical analysis in the figure, an area layer is being summarized using a line distance from an input point. The Tables field was used to calculate the statistics (Sum, Minimum, Maximum, Average, and Std Deviation) for the layer. The statistics are calculated using only the proportion of the area that is within the specified distance of the input point.

Summarizing an area layer
Area layers are summarized using only proportions of the areas located within the specified distance (line or travel mode).

StatisticResult

Sum

1 + 1 + 2 + 3 = 7

Minimum

Minimum of:

[1, 1, 2, 3] = 1

Maximum

Maximum of:

[1, 1, 2, 3] = 3

Average

Weighted mean of areas
= 1.55

Std Deviation

Weighted standard deviation of areas
= 0.89

A real-life scenario in which you can use this analysis is determining how many picnic tables there are within a 100-foot radius of different campsites. Each area represents a picnic park and the Tables field contains the number of picnic tables in each park. From the results, you can see that there are seven tables near this specific campsite and the most picnic tables at a single picnic park is three.

Line distance

The Summarize Nearby tool uses a geodesic method when finding features with a straight-line distance, rather than a Euclidean method. Geodesic lines account for the actual shape of the earth (an ellipsoid, or more properly, a geoid). Distances are calculated between two points on a curved surface (the geoid) as opposed to two points on a flat surface (the Cartesian plane).

Travel modes

Service Areas use Dijkstra's algorithm to traverse a network. The service returns a subset of connected edge features that are within a specified travel distance or travel time. Travel areas are created by inputting the edges into a triangulated irregular network (TIN) data structure. The network distance along the lines serves as the height of the locations inside the TIN. Locations not traversed by the service area are put in with a much larger height value. An area-generation routine is used with this TIN to carve out regions encompassing areas in between the specified break values (travel time or travel distance).

The Live traffic option uses the current time as the departure. The traffic speed is predicted for the trip using live speeds, historical speeds, and current events, such as weather.

When typical conditions for a day of the week and time are being used, the travel speeds are based on historical speeds averaged across 5-minute intervals for the entire week. The selected time corresponds to local time in the time zone that your data is in.

When either traffic condition is used, the Summarize Nearby tool takes into consideration the changing traffic conditions based on elapsed time from departure.

Similar tools

Use Summarize Nearby to calculate statistics on point, line, or area features within close proximity of another input layer. Other tools may be useful in solving similar but slightly different problems.

Map Viewer Classic analysis tools

If you have predefined boundaries in which you want to summarize a point, line, or area layer, use the Summarize Within tool.

If you have predefined boundaries in which you want to summarize a point layer and you want to keep only the areas containing one or more points, use the Aggregate Points tool.

ArcGIS Pro analysis tools