sematext · fulyauluturk · Jan 13, 2025 · Jan 10, 2025 · Jan 10, 2025 · Jan 10, 2025
diff --git a/docs/images/logs/pipelines/size_processor.png b/docs/images/logs/pipelines/size_processor.png
diff --git a/docs/images/logs/pipelines/size_processor_f&f.png b/docs/images/logs/pipelines/size_processor_f&f.png
diff --git a/docs/images/logs/pipelines/size_processor_query_large_docs.png b/docs/images/logs/pipelines/size_processor_query_large_docs.png
diff --git a/docs/images/logs/pipelines/size_processor_quick_chart.png b/docs/images/logs/pipelines/size_processor_quick_chart.png
diff --git a/docs/images/logs/pipelines/size_processor_spike.png b/docs/images/logs/pipelines/size_processor_spike.png
diff --git a/docs/logs/identify-log-volume-spikes.md b/docs/logs/identify-log-volume-spikes.md
@@ -0,0 +1,32 @@
+title: Identify Large Logs to Cut Logging Costs
+description: Use Logs Pipeline's size processor to identify large logs and extract vital information from long text fields to reduce your costs.
+
+By leveraging the **Size Processor**, you can identify large logs, analyze their content, and optimize your log storage. By removing or optimizing unnecessary fields, the overall **log volume decreases**, leading to **lower costs**.
+
+> **Note:** Be sure to check out [Plan Recommendations](../logs/plan-recommendations) to ensure you’re using the most cost-effective plan for your logs in Sematext.
+
+Go to [Logs Pipelines](https://sematext.com/docs/logs/pipelines/) screen, add a **Size Processor** and configure it to extract the size of each log message and store it in a numeric field (e.g., `doc_size`).
+
+![Logs Pipelines Size Processor](../images/logs/pipelines/size_processor.png)
+
+Within a few minutes, you will see that new logs coming into your App will contain the doc_size field. Click on this field and select the [Quick Chart](https://sematext.com/docs/logs/logs-table-quick-actions/#quick-chart) action.
+
+![Logs Pipelines Size Processor](../images/logs/pipelines/size_processor_quick_chart.png)
+
+Create a time series chart and use the Max aggregation to show the largest log size in each time interval. Analyze the chart for spikes, which indicate unusually large logs.
+
+![Logs Pipelines Size Processor](../images/logs/pipelines/size_processor_spike.png)
+
+When you see a spike, open [Fields & Filters](https://sematext.com/docs/logs/fields/#fields-structure), find the `doc_size` field, and check the top log volume values.
+
+![Logs Pipelines Size Processor](../images/logs/pipelines/size_processor_f&f.png)
+
+Filter the logs with large volumes, and identify the field that contains long text or large number of fields.
+
+![Logs Pipelines Size Processor](../images/logs/pipelines/size_processor_query_large_docs.png)
+
+Use the [Field Extractor](https://sematext.com/docs/logs/field-extractor-processor/) in [Logs Pipelines](https://sematext.com/docs/logs/pipelines/) to extract the important part of the field's value into a new field, and then use the **Drop Field Processor** to completely remove the original field from your logs.
+
+Alternatively, if the large log volume is caused by a large number of unnecessary fields, use the **Drop Field Processor** to remove those fields entirely from your logs.
+
+See [How Logs Pipelines Can Reduce Your Log Monitoring Costs](https://sematext.com/docs/logs/reduce-costs-with-pipelines/) to learn more about using the Field Extractor and Drop Field Processor to cut costs effectively.
diff --git a/docs/logs/reduce-costs-with-pipelines.md b/docs/logs/reduce-costs-with-pipelines.md
@@ -7,6 +7,8 @@ Log [pipelines](../logs/pipelines) offer a range of [processors](../logs/process
 
 Let's consider a scenario where we have a Logs App that is receiving query response logs from a database, resulting in a lengthy message field.
 
+> **Note:** Check out [Identify Large Logs to Cut Logging Costs](../logs/identify-log-volume-spikes) to learn how to use the **Size Processor** to identify large logs.
+
 ![Lenghty Message Field Log](../images/logs/pipelines/log-with-lenghty-message-field.png)
 
 Let's zoom in on that chunky message field:

diff --git a/mkdocs.yml b/mkdocs.yml
@@ -71,6 +71,7 @@ pages:
           - Sampling Processor: logs/sampling-processor.md
       - Handling Sensitive Data: logs/handle-sensitive-data-with-pipelines.md
       - Too many fields in the index: logs/too-many-fields-in-index.md
+      - Identify Large Logs to Cut Logging Costs: logs/identify-log-volume-spikes.md
       - Reducing Log Monitoring Costs: logs/reduce-costs-with-pipelines.md
       - Plan Recommendations: logs/plan-recommendations.md
       - Logs Usage Screen: logs/logs-usage-screen.md