v6.3 EE Release Notes
This document was translated by GPT-4
# 1. Universal Service Map, Application, Networks, Infrastructure, Events
- AutoMetrics
- Supports parsing of FastCGI Protocol
- Supports parsing of HengSheng T3 Protocol
- Supports parsing of event and serialization_id fields for Dubbo Protocol
- Recognizes RST disconnection in SLB detection as normal behavior
- Adds an endpoint field to the Application Aggregate Metrics table
- Supports collection of statement-ids for MySQL calls to associate COM_STMT_EXECUTE with COM_STMT_QUERY and trace SQL statements
- AutoTracing
- Supports parsing of Tingyun's Tracing field
X-Tingyun
- Supports distributed tracing of calls before and after ALB/SLB services
- Supports parsing of TraceID in MySQL statements
- Adds allow_multiple_trace_ids_in_tracing_result configuration item to allow multiple TraceIDs in the distributed tracing result
- Supports enhancing distributed tracing data by calling APM's Trace API
- Supports parsing of Tingyun's Tracing field
- AutoTagging
- Supports automatic association of K8s Annotation and Env tags
- Adds the ability to associate K8s containers automatically by PID to resolve the tagging issue of container resource information in HostNetwork Pod in eBPF data
- SQL
- Adds count operator to metrics to calculate the number of rows in the original data
- Universal Service Map
- Optimizes the service list
- Only shows performance metrics when the service is called
- Supports switching the signal source through signal_source
- Adjusts the default display metrics and shows a bar graph representing the relative size of the metrics
- Adds service topology
- Supports defining an integrated business access topology for both cloud and non-cloud environments
- Optimizes the service list
- Application
- Distributed Tracing
- When adjusting the split ratio of the page by sliding right, do not scale the flame graph
- Supports distributed tracing from the network Span as a starting point
- Supports showing the collection network card information of the network Span in the flame graph
- The quick filter box on the left supports quick filtering and switching of the signal source
- Continuous Profiling
- Supports the linked display of tables and flame graphs
- Optimizes the merging logic of Function Stack in the flame graph
- Compresses the storage of Function Stack in ClickHouse
- Support for eBPF collection of OnCPU Profile data for compiled (Golang/Rust, etc.) and interpreted (Java, etc.) languages
- Distributed Tracing
- Infrastructure
- Adds Host and Container pages, displayed based on Prometheus metrics
- Events
- Splits the Event page into Resource Change, File Read/Write, and Alert events pages
- GUI
- Enhances snapshot search capabilities: supports sorting, condition copying, and other operations
- Search bar supports pasting
Key: value
search conditions - Search bar supports modifying the operator of existing conditions
- Simplifies the search conditions of the page and the conditions carried by the right-sliding page, improving usability
- Comprehensive optimization of page UI
- Optimizes the Quick Filter box on the left: supports search filtering, displays matched data quantity, supports filtering of metric value range, supports switching query region, supports switching data table
- Supports opening multiple right-sliding pages in succession and switching between different right-sliding pages
- The event data displayed in the right-sliding page can switch to view the events of the client or server separately
- The pop-up page for viewing the database field supports displaying the table name
- Supports viewing the search conditions of the current subview
# 2. Views, Metrics, Alerts, Reports
- Views
- Supports dragging to modify the table size
- Optimizes the display details of the bar chart
- Adds a new subview type: Overview Diagram
- Refactoring of the subview editing page
- Optimizes the layout of subview buttons
- Merges Line Chart with the Top Line Chart
- Optimizes the display when template variable names are too long
- Supports dragging to order the template variable list
- Metrics
- Supports inputting PromQL to query data
- Alerts
- Supports creating alert policies directly (no need to create views)
- HTTP Push Endpoint supports using tags to render push content
- Email Push title supports using variables
- Optimizes the display of system alert events
- GUI
- Unifies search condition input boxes of Subview Editing Page, Metric Search Page, and Alert Policy Editing Page
# 3. Resources, Systems
- Resources
- Optimizes display of POD list, VPC list, availability zone list, region list, adds container node quantity, collector status column
- Supports sync of
OpenKruise
CloneSet and Advanced StatefulSet workload types - Support independent configuration of sync intervals for different cloud platforms
- Supports sync of IP address on loopback interface (usually VIP)
- Integration
- Prometheus Integration
- PromQL supports topk, bottomk functions
- PromQL API supports RFC3339 time format
- Supports obtaining HTTP Header in RemoteWrite as an additional Label
- Optimizes the storage performance of RemoteWrite, and optimizes query performance of RemoteRead and PromQL
- OpenTelemetry Integration
- Supports operation without ClickHouse
- Prometheus Integration
- Agent
- Plugin
- New "so plugin" support capability, providing C SDK
- Wasm Demo: Parses error code in HTTP payload, and reassigns response_code and response_exception
- Wasm Demo: Parses Protobuf message in Payload
- The cycle reporting interval of long-lived streams is changed from absolute 0 seconds (at the beginning of every minute) to relative 0 seconds (in relation to the start of the stream every 60 minutes)
- Advantage: Reduces pressure of sending stream logs at absolute 0 seconds, avoid splitting streams that last less than 60 seconds into two stream logs
- Configuration
- Supports configuration of CPU affinity and priority
- Adds kprobe-blacklist configuration item, sets the port number blacklist for eBPF data collection, avoiding collection loops
- Adds l4_log_ignore_tap_sides and l7_log_ignore_tap_sides for setting positions to ignore in flow logs and invocation logs, respectively, to reduce the amount of data collected
- Adaptation
- Supports operation in Tencent TCE's DPDK host machine
- Removes HostNetwork requirement for container collector
- Supports environments where the number of matched network card (tap_interface_regex) results exceeds 255
- Supports running as a Sidecar within a business Pod
- Supports deployment as a blue whale plugin
- Plugin
- Server
- Supports configuration of system alert sending mailbox on the page
- The default value for the auto-deletion time of the anomaly controller and data nodes is set to 30 days
- Unifies storage of alert events in ClickHouse
- Detects changes in resource information, and pushes updates to the Agent in real time
- Supports disabling of K8s cluster auto-discovery function, so it can be used as a synced K8s cluster for public cloud
- Supports specifying (fixing) the Agent that syncs K8s resource information
- Adds deepflow identifier to all deployed dependent HostPath paths
- CLI
- Releases deepflow-ctl for MacOS