Releases · pganalyze/collector

30 Jul 00:38

github-actions

v0.45.0

5e6cd6f

v0.45.0

Log Insights: Filter out log_statement=all and log_duration=on log lines
- This replaces the previous behaviour that prevented all log collection for
  servers that had either log_statement=all or log_duration=on enabled.
- With the new logic, we continue ignoring these high-frequency events
  (which would cause downstream problems), but accept all other log events,
  including threshold-based auto_explain events.
Track extensions that are installed on each database
- This is helpful to ensure that the necessary schema definitions are
  loaded by pganalyze, e.g. for use by the Index Advisor.
- Ignore objects that are provided by extensions, as determined by pg_depend
  (e.g. function definitions, etc)
Add support for Google AlloyDB for PostgreSQL
- This adds new options to specify the AlloyDB cluster ID and instance ID
- Special cases the log parsing to support AlloyDB's [filename:line] prefix
- Supports AlloyDB's modified autovacuum log output
Add explicit support for Aiven Postgres databases
- Support was previously available via the self-managed instructions, but
  this adds explicit support and improved setup instructions
- Existing Aiven servers that were detected as self-managed will be
  automatically updated to be recognized as Aiven servers
Self-managed servers
- Support disk statistics for software RAID devices
  - These statistics are summarized across all component disk devices and
    then tracked for the parent software RAID device as one. Note that this
    is only done in case these statistics are not yet set (which is the case
    for the typical Linux software RAID setup).
- Allow using pg_read_file to read log files (instead of log tail / syslog)
  - This relies on the built-in Postgres function pg_read_file to read log
    files and return the log data over the Postgres connection.
  - This requires superuser (either directly or through a helper) and thus
    does not work on managed database providers, with the exception of
    Crunchy Bridge, for which this is already the mechanism to fetch logs.
  - Additionally, this carries higher overhead than directly tailing log
    files, or using syslog, and thus should only be used when necessary.
  - Set db_log_pg_read_file = 1 / LOG_PG_READ_FILE=1 to enable the logic
Crunchy Bridge
- Fix collection of system metrics
Heroku Postgres
- Fix blank log line parsing
Add --test-section parameter to set a specific config section to test
Fully qualify constraint definitions, to support non-standard schemas
Add support for log_line_prefix %m [%p] %q%u@%d and %t [%p] %q%u@%d %h

Assets 4

29 Jun 09:02

github-actions

v0.44.0

4572b6d

v0.44.0

Add optional normalization of sensitive fields in EXPLAIN plans
- Introduces new "filter_query_sample = normalize" setting that normalizes
  expression fields in the EXPLAIN plan ("Filter", "Index Cond", etc) using
  the pg_query normalization logic. Unknown EXPLAIN fields are discarded when
  this option is active.
- Turning on this setting will also cause all query samples (whether they
  have an EXPLAIN attached or not) to have their query text normalized
  and their parameters marked as <removed>.
- This setting is recommended when EXPLAIN plans may contain sensitive
  data that should not be stored. Please verify that the logic works
  as expected with your workload and log output.
- In order to mask EXPLAIN output in the actual log stream as well (not just
  the query samples / EXPLAIN plans), make sure to use a filter_log_secret
  setting that includes the statement_text value
Be more accepting with outdated pg_stat_statements versions
- With this change, its no longer required to run
  "ALTER EXTENSION pg_stat_statements UPDATE" in order to use
  the collector after a Postgres upgrade
- The collector will output an info message in case an outdated
  pg_stat_statements version is in use
Allow pg_stat_statements to be installed in schemas other than "public"
- This is automatically detected based on information in pg_extension
  and does not require any extra configuration when using a special schema
Log Insights
- Remove unnecessary "duration_ms" and "unparsed_explain_text" metadata
  fields, they are already contained within the query sample data
- Always mark STATEMENT/QUERY log lines as "statement_text" log secret,
  instead of "unidentified" log secret in some cases
Amazon RDS / Amazon Aurora
- Fix rare bug with duplicate pg_settings values on Aurora Postgres
- Add RDS instance role hint when NoCredentialProviders error is hit
Heroku Postgres
- Add support for new log_line_prefix
- Log processing: Avoid repeating the same line over and over again
- Fix log handling when consuming logs for multiple databases
Google Cloud SQL
- Re-enable log stitching for messages - whilst the GCP release notes mention
  that this is no longer a problem as of Sept 2021, log events can still be
  split up into multiple messages if they exceed a threshold around 1000-2000
  lines, or ~100kb
Custom types: Correctly track custom type reference for array types
Improve the "too many tables" error message to clarify possible solutions
Fix bug related to new structured JSON logs feature (see prior release)
Update pg_query_go to v2.1.2
- Fixes memory leak in pg_query_fingerprint error handling
- Fix parsing some operators with ? character (ltree / promscale extensions)

Assets 4

03 May 00:45

github-actions

v0.43.1

9973062

v0.43.1

Add option for emitting collector logs as structured JSON logs (@jschaf)
- Example output:
```
{"severity":"INFO","message":"Running collector test with pganalyze-collector ...","time":"2022-04-19T12:31:05.100489-07:00"}
```
- Enable this option by passing the "--json-logs" option to the collector binary
Log Insights: Add support for Postgres 14 autovacuum and autoanalyze log events
Column stats helper: Indicate which database is missing the helper in error message
Azure Database for PostgreSQL
- Add log monitoring support for Flexible Server deployment option
Heroku Postgres
- Fix environment parsing to support parsing of equals signs in variables
- Log test: Don't count Heroku Postgres free tier as hard failure (emit warning instead)

Assets 4

31 Mar 05:18

github-actions

v0.43.0

741d01a

v0.43.0

Add integration for Crunchy Bridge provider
Check citus.shard_replication_factor before querying citus_table_size
- This fixes support for citus.shard_replication_factor > 1
Filter out vacuum records we cannot match to a table name
- This can occur when a manual vacuum is run in a database other than the
  primary database that is being monitored, previously leading to
  processing errors in the backend
Docker image: Add tzdata package
- This is required to allow timezone parsing during log line handling

Assets 4

16 Feb 20:31

github-actions

v0.42.2

b884fdc

v0.42.2

Fix cleanup of temporary files used when processing logs
- Previous collectors may have left temp files in your system's temp directory
- To manually clean up stray temp files:
  - Shut down the collector
  - Install the new package
  - Delete any files owned by the user running the collector (pganalyze by default) in the temp directory
  - Start the collector

Assets 4

02 Feb 03:05

github-actions

v0.42.1

c372909

v0.42.1

Log Insights
- Handle non-UTC/non-local log_timezone values correctly
- Use consistent 10s interval for streamed logs (instead of shorter intervals)
- Log streams: Support processing primary and secondary lines out of order
  - This resolves issues on GCP when log lines are received out of order
- C22 Auth failed event: Detect additional DETAIL information
- Add regexp match for "permission denied for table" event
Normalization: Attempt auto-fixing truncated queries
Heroku: Do not count free memory in total memory
Config file handling: Handle boolean values more consistently
- Treat case-insensitive false, off, no, 'f', and 'n' as false in addition
  to zero

Assets 4

21 Dec 21:05

github-actions

v0.42.0

000229a

v0.42.0

Provide both x86/amd64 and ARM64 packages and Docker image
- This means you can now run the collector more easily on modern
  ARM-based platforms such as Graviton-based AWS instances
Bugfix: Write state file before reloading collector
- This avoids lost statistics when the collector is reloaded mid-cycle
  between the full snapshot runs
Reduce Docker image build time and use slim image for 18x size reduction
- With thanks to Chris at Kandji for this contribution

Assets 4

15 Dec 17:10

github-actions

v0.41.3

2b31058

v0.41.3

Log Insights: Add "invalid input syntax for type json" log event
- This is a variant of the existing invalid input syntax log event,
  with a few additional details.
Log Insights: Improve handling of "malformed array literal" log event
- Add support for a double quote inside the array content
- Mark the content as a table data log secret
- Add the known DETAIL line "Unexpected array element"
Fix incorrect index recorded for unknown parent or foreign key tables
- Previously we would sometimes use 0 in these situations, which could
  cause errors in snapshot processing
Heroku: drop Log Insights instructions after log test
- This was intended to ease onboarding, but it contradicts our in-app
  instructions to wait until real snapshot data comes in to proceed
  with Log Insights setup
AWS: Cache RDS server IDs and errors to reduce API requests
- This can help avoid hitting rate limits when monitoring a large number
  of servers
Fix issue with domains with no constraints

Assets 3

18 Nov 19:24

github-actions

v0.41.2

275a02a

v0.41.2

Add two additional log_line_prefix settings
- %p-%s-%c-%l-%h-%u-%d-%m
- %m [%p][%b][%v][%x] %q[user=%u,db=%d,app=%a]
Change schema collection message to warning during test run
- This helps discover schema collection issues, e.g. due
  to connection restrictions or other permission problems
Fix issue with multiple domain constraints
Upgrade gopsutil to v3.21.10
- This adds support for M1 Macs, amongst other improvements
  for OS metris collection

Assets 3

03 Nov 20:51

github-actions

v0.41.1

880863e

v0.41.1

Fix schema stats for databases with some custom data types
Fix tracking of index scans over time

Assets 3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: pganalyze/collector

v0.45.0

v0.44.0

v0.43.1

v0.43.0

v0.42.2

v0.42.1

v0.42.0

v0.41.3

v0.41.2

v0.41.1