Cassandra

This Elastic integration collects logs and metrics from cassandra.


Version	1.11.0 (View all)
Compatible Kibana version(s)	8.8.0 or higher
Supported Serverless project types What's this?	Security Observability
Subscription level What's this?	Basic
Level of support What's this?	Elastic

This integration periodically fetches metrics from Cassandra using jolokia agent. It can parse System logs.

Compatibility

This integration has been tested against Cassandra version 3.11.11.

Troubleshooting

If log.flags is shown conflicted under the logs-* data view, then this issue can be solved by reindexing the Logs data stream's indices.

Note:

This document provides details about reindexing.

Logs

Cassandra system logs from cassandra.log files.

An example event for log looks as following:

{
    "@timestamp": "2022-08-01T07:33:01.952Z",
    "agent": {
        "ephemeral_id": "d6102ad8-04fe-46fa-bf67-cc98e3665348",
        "hostname": "docker-fleet-agent",
        "id": "d1a9277c-e5a2-4ee3-a973-18f2b62e3ad8",
        "name": "docker-fleet-agent",
        "type": "filebeat",
        "version": "7.15.0"
    },
    "data_stream": {
        "dataset": "cassandra.log",
        "namespace": "ep",
        "type": "logs"
    },
    "ecs": {
        "version": "8.5.1"
    },
    "elastic_agent": {
        "id": "d1a9277c-e5a2-4ee3-a973-18f2b62e3ad8",
        "snapshot": false,
        "version": "7.15.0"
    },
    "event": {
        "agent_id_status": "verified",
        "category": [
            "database"
        ],
        "dataset": "cassandra.log",
        "ingested": "2022-08-01T07:33:17Z",
        "kind": "event",
        "module": "cassandra",
        "original": "INFO  [main] 2022-08-01 07:33:01,952 YamlConfigurationLoader.java:92 - Configuration location: file:/etc/cassandra/cassandra.yaml",
        "type": "info"
    },
    "input": {
        "type": "log"
    },
    "log": {
        "file": {
            "path": "/tmp/service_logs/cassandra/system.log"
        },
        "level": "INFO",
        "offset": 0,
        "origin": {
            "file": {
                "line": 92,
                "name": "YamlConfigurationLoader.java"
            }
        }
    },
    "message": "Configuration location: file:/etc/cassandra/cassandra.yaml",
    "process": {
        "thread": {
            "name": "main"
        }
    },
    "tags": [
        "forwarded",
        "cassandra-systemlogs"
    ]
}

Exported fields

Field	Description	Type
@timestamp	Event timestamp.	date
cassandra.log.meta	Log meta infos like java stack_trace.	keyword
data_stream.dataset	Data stream dataset.	constant_keyword
data_stream.namespace	Data stream namespace.	constant_keyword
data_stream.type	Data stream type.	constant_keyword
ecs.version	ECS version this event conforms to. `ecs.version` is a required field and must exist in all events. When querying across multiple indices -- which may conform to slightly different ECS versions -- this field lets integrations adjust to the schema version of the events.	keyword
error.message	Error message.	match_only_text
event.category	This is one of four ECS Categorization Fields, and indicates the second level in the ECS category hierarchy. `event.category` represents the "big buckets" of ECS categories. For example, filtering on `event.category:process` yields all events relating to process activity. This field is closely related to `event.type`, which is used as a subcategory. This field is an array. This will allow proper categorization of some events that fall in multiple categories.	keyword
event.ingested	Timestamp when an event arrived in the central data store. This is different from `@timestamp`, which is when the event originally occurred. It's also different from `event.created`, which is meant to capture the first time an agent saw the event. In normal conditions, assuming no tampering, the timestamps should chronologically look like this: `@timestamp` < `event.created` < `event.ingested`.	date
event.kind	This is one of four ECS Categorization Fields, and indicates the highest level in the ECS category hierarchy. `event.kind` gives high-level information about what type of information the event contains, without being specific to the contents of the event. For example, values of this field distinguish alert events from metric events. The value of this field can be used to inform how these kinds of events should be handled. They may warrant different retention, different access control, it may also help understand whether the data coming in at a regular interval or not.	keyword
input.type	Type of Filebeat input.	keyword
log.file.path	Full path to the log file this event came from, including the file name. It should include the drive letter, when appropriate. If the event wasn't read from a log file, do not populate this field.	keyword
log.flags	Flags for the log file.	keyword
log.level	Original log level of the log event. If the source of the event provides a log level or textual severity, this is the one that goes in `log.level`. If your source doesn't specify one, you may put your event transport's severity here (e.g. Syslog severity). Some examples are `warn`, `err`, `i`, `informational`.	keyword
log.offset	Offset of the entry in the log file.	long
log.origin.file.line	The line number of the file containing the source code which originated the log event.	long
log.origin.file.name	The name of the file containing the source code which originated the log event. Note that this field is not meant to capture the log file. The correct field to capture the log file is `log.file.path`.	keyword
message	For log events the message field contains the log message, optimized for viewing in a log viewer. For structured logs without an original message field, other fields can be concatenated to form a human-readable summary of the event. If multiple messages exist, they can be combined into one message.	match_only_text
process.thread.name	Thread name.	keyword
tags	List of keywords used to tag each event.	keyword

Metrics

Cassandra metrics using jolokia agent installed on cassandra.

An example event for metrics looks as following:

{
    "@timestamp": "2022-08-02T07:46:20.906Z",
    "agent": {
        "ephemeral_id": "dd01aaac-f888-4fdb-832d-d05840060d78",
        "hostname": "docker-fleet-agent",
        "id": "f8436de1-7850-497f-905d-b6c9ca3116ca",
        "name": "docker-fleet-agent",
        "type": "metricbeat",
        "version": "7.15.0"
    },
    "cassandra": {
        "metrics": {
            "cache": {
                "key_cache": {
                    "capacity": 104857600,
                    "one_minute_hit_rate": 0.7055988630359871,
                    "requests": {
                        "one_minute_rate": 10.000444146293233
                    }
                },
                "row_cache": {
                    "capacity": 0,
                    "requests": {
                        "one_minute_rate": 0
                    }
                }
            },
            "client": {
                "connected_native_clients": 0
            },
            "client_request": {
                "casread": {
                    "one_minute_rate": 0
                },
                "caswrite": {
                    "one_minute_rate": 0
                },
                "range_slice": {
                    "one_minute_rate": 0,
                    "total_latency": 0
                },
                "read": {
                    "count": 0,
                    "one_minute_rate": 0,
                    "timeouts": 0,
                    "timeoutsms": 0,
                    "total_latency": 0,
                    "unavailables": 0,
                    "unavailablesms": 0
                },
                "write": {
                    "count": 0,
                    "one_minute_rate": 0,
                    "timeouts": 0,
                    "timeoutsms": 0,
                    "total_latency": 0,
                    "unavailables": 0,
                    "unavailablesms": 0
                }
            },
            "column_family": {
                "total_disk_space_used": 72611
            },
            "compaction": {
                "completed": 45,
                "pending": 0
            },
            "dropped_message": {
                "batch_remove": 0,
                "batch_store": 0,
                "counter_mutation": 0,
                "hint": 0,
                "mutation": 0,
                "paged_range": 0,
                "range_slice": 0,
                "read": 0,
                "read_repair": 0,
                "request_response": 0,
                "trace": 0
            },
            "gc": {
                "concurrent_mark_sweep": {
                    "collection_count": 1,
                    "collection_time": 27
                },
                "par_new": {
                    "collection_count": 1,
                    "collection_time": 24
                }
            },
            "memory": {
                "heap_usage": {
                    "committed": 4054777856,
                    "init": 4158652416,
                    "max": 4054777856,
                    "used": 478032264
                },
                "other_usage": {
                    "committed": 62853120,
                    "init": 2555904,
                    "max": -1,
                    "used": 61234528
                }
            },
            "storage": {
                "exceptions": 0,
                "load": 72611,
                "total_hint_in_progress": 0,
                "total_hints": 0
            },
            "system": {
                "cluster": "Test Cluster",
                "data_center": "datacenter1",
                "live_nodes": [
                    "192.168.224.2"
                ],
                "rack": "rack1",
                "version": "3.11.11"
            },
            "table": {
                "all_memtables_heap_size": 4569,
                "all_memtables_off_heap_size": 0,
                "live_disk_space_used": 72611,
                "live_ss_table_count": 11
            },
            "task": {
                "complete": 55,
                "pending": 0,
                "total_commitlog_size": 67108864
            },
            "thread_pools": {
                "counter_mutation_stage": {
                    "request": {
                        "active": 0,
                        "pending": 0
                    }
                },
                "mutation_stage": {
                    "request": {
                        "active": 0,
                        "pending": 0
                    }
                },
                "read_repair_stage": {
                    "request": {
                        "active": 0,
                        "pending": 0
                    }
                },
                "read_stage": {
                    "request": {
                        "active": 0,
                        "pending": 0
                    }
                },
                "request_response_stage": {
                    "request": {
                        "active": 0,
                        "pending": 0
                    }
                }
            }
        }
    },
    "data_stream": {
        "dataset": "cassandra.metrics",
        "namespace": "ep",
        "type": "metrics"
    },
    "ecs": {
        "version": "8.5.1"
    },
    "elastic_agent": {
        "id": "f8436de1-7850-497f-905d-b6c9ca3116ca",
        "snapshot": false,
        "version": "7.15.0"
    },
    "event": {
        "agent_id_status": "verified",
        "category": [
            "database"
        ],
        "created": "2022-08-02T07:46:20.906Z",
        "dataset": "cassandra.metrics",
        "duration": 13448617,
        "ingested": "2022-08-02T07:46:24Z",
        "kind": "event",
        "module": "cassandra",
        "type": [
            "info"
        ]
    },
    "host": {
        "architecture": "x86_64",
        "containerized": true,
        "hostname": "docker-fleet-agent",
        "id": "2cbd07697ac16c7d26f103cb3d40e3aa",
        "ip": [
            "192.168.192.7"
        ],
        "mac": [
            "02:42:c0:a8:c0:07"
        ],
        "name": "docker-fleet-agent",
        "os": {
            "codename": "Core",
            "family": "redhat",
            "kernel": "3.10.0-1160.71.1.el7.x86_64",
            "name": "CentOS Linux",
            "platform": "centos",
            "type": "linux",
            "version": "7 (Core)"
        }
    },
    "metricset": {
        "name": "jmx",
        "period": 10000
    },
    "service": {
        "address": "http://elastic-package-service_cassandra_1:8778/jolokia/%3FignoreErrors=true\u0026canonicalNaming=false",
        "type": "jolokia"
    }
}

Exported fields

Field	Description	Type	Metric Type
@timestamp	Event timestamp.	date
agent.id	Unique identifier of this agent (if one exists). Example: For Beats this would be beat.id.	keyword
cassandra.metrics.cache.key_cache.capacity		long	gauge
cassandra.metrics.cache.key_cache.one_minute_hit_rate		long	gauge
cassandra.metrics.cache.key_cache.requests.one_minute_rate		long	gauge
cassandra.metrics.cache.row_cache.capacity		long	gauge
cassandra.metrics.cache.row_cache.one_minute_hit_rate		long	gauge
cassandra.metrics.cache.row_cache.requests.one_minute_rate		long	gauge
cassandra.metrics.client.connected_native_clients		long	gauge
cassandra.metrics.client_request.casread.one_minute_rate		double	gauge
cassandra.metrics.client_request.caswrite.one_minute_rate		double	gauge
cassandra.metrics.client_request.range_slice.one_minute_rate		double	gauge
cassandra.metrics.client_request.range_slice.total_latency		double	counter
cassandra.metrics.client_request.read.count		long	counter
cassandra.metrics.client_request.read.one_minute_rate		double	gauge
cassandra.metrics.client_request.read.timeouts	Number of read timeouts encountered.	double	counter
cassandra.metrics.client_request.read.timeoutsms		double	gauge
cassandra.metrics.client_request.read.total_latency		double	counter
cassandra.metrics.client_request.read.unavailables	Number of read unavailables encountered.	double	counter
cassandra.metrics.client_request.read.unavailablesms		double	gauge
cassandra.metrics.client_request.write.count		long	counter
cassandra.metrics.client_request.write.one_minute_rate		double	gauge
cassandra.metrics.client_request.write.timeouts		double	counter
cassandra.metrics.client_request.write.timeoutsms		double	gauge
cassandra.metrics.client_request.write.total_latency		double	counter
cassandra.metrics.client_request.write.unavailables		double	counter
cassandra.metrics.client_request.write.unavailablesms		double	gauge
cassandra.metrics.column_family.total_disk_space_used		long	gauge
cassandra.metrics.compaction.completed	compaction completed tasks.	long	gauge
cassandra.metrics.compaction.pending	compaction pending tasks.	long	gauge
cassandra.metrics.dropped_message.batch_remove		long	counter
cassandra.metrics.dropped_message.batch_store		long	counter
cassandra.metrics.dropped_message.counter_mutation		long	counter
cassandra.metrics.dropped_message.hint		long	counter
cassandra.metrics.dropped_message.mutation		long	counter
cassandra.metrics.dropped_message.paged_range		long	counter
cassandra.metrics.dropped_message.range_slice		long	counter
cassandra.metrics.dropped_message.read		long	counter
cassandra.metrics.dropped_message.read_repair		long	counter
cassandra.metrics.dropped_message.request_response		long	counter
cassandra.metrics.dropped_message.trace		long	counter
cassandra.metrics.gc.concurrent_mark_sweep.collection_count	Total number of CMS collections that have occurred.	long	gauge
cassandra.metrics.gc.concurrent_mark_sweep.collection_time	Approximate accumulated CMS collection elapsed time in milliseconds.	long	gauge
cassandra.metrics.gc.par_new.collection_count	Total number of ParNew collections that have occurred.	long	gauge
cassandra.metrics.gc.par_new.collection_time	Approximate accumulated ParNew collection elapsed time in milliseconds.	long	gauge
cassandra.metrics.memory.heap_usage.committed	Committed heap memory usage.	long	gauge
cassandra.metrics.memory.heap_usage.init	Initial heap memory usage.	long	gauge
cassandra.metrics.memory.heap_usage.max	Max heap memory usage.	long	gauge
cassandra.metrics.memory.heap_usage.used	Used heap memory usage.	long	gauge
cassandra.metrics.memory.other_usage.committed	Committed non-heap memory usage.	long	gauge
cassandra.metrics.memory.other_usage.init	Initial non-heap memory usage.	long	gauge
cassandra.metrics.memory.other_usage.max	Max non-heap memory usage.	long	gauge
cassandra.metrics.memory.other_usage.used	Used non-heap memory usage.	long	gauge
cassandra.metrics.storage.exceptions	The number of the total exceptions.	long	counter
cassandra.metrics.storage.load	Storage used for Cassandra data in bytes.	long	counter
cassandra.metrics.storage.total_hint_in_progress	The number of the total hits in progress.	long	counter
cassandra.metrics.storage.total_hints	The number of the total hits.	long	counter
cassandra.metrics.system.cluster		keyword
cassandra.metrics.system.data_center		keyword
cassandra.metrics.system.joining_nodes		keyword
cassandra.metrics.system.leaving_nodes		keyword
cassandra.metrics.system.live_nodes		keyword
cassandra.metrics.system.moving_nodes		keyword
cassandra.metrics.system.rack		keyword
cassandra.metrics.system.unreachable_nodes		keyword
cassandra.metrics.system.version		keyword
cassandra.metrics.table.all_memtables_heap_size		long	gauge
cassandra.metrics.table.all_memtables_off_heap_size		long	gauge
cassandra.metrics.table.live_disk_space_used		long	counter
cassandra.metrics.table.live_ss_table_count		long	gauge
cassandra.metrics.task.complete	completed tasks.	long	gauge
cassandra.metrics.task.pending	pending tasks.	long	gauge
cassandra.metrics.task.total_commitlog_size	total commitlog size of tasks.	long	gauge
cassandra.metrics.thread_pools.counter_mutation_stage.request.active		long	gauge
cassandra.metrics.thread_pools.counter_mutation_stage.request.pending		long	gauge
cassandra.metrics.thread_pools.mutation_stage.request.active		long	gauge
cassandra.metrics.thread_pools.mutation_stage.request.pending		long	gauge
cassandra.metrics.thread_pools.read_repair_stage.request.active		long	gauge
cassandra.metrics.thread_pools.read_repair_stage.request.pending		long	gauge
cassandra.metrics.thread_pools.read_stage.request.active		long	gauge
cassandra.metrics.thread_pools.read_stage.request.pending		long	gauge
cassandra.metrics.thread_pools.request_response_stage.request.active		long	gauge
cassandra.metrics.thread_pools.request_response_stage.request.pending		long	gauge
cloud.account.id	The cloud account or organization id used to identify different entities in a multi-tenant environment. Examples: AWS account id, Google Cloud ORG Id, or other unique identifier.	keyword
cloud.availability_zone	Availability zone in which this host, resource, or service is located.	keyword
cloud.instance.id	Instance ID of the host machine.	keyword
cloud.project.id	The cloud project identifier. Examples: Google Cloud Project id, Azure Project id.	keyword
cloud.provider	Name of the cloud provider. Example values are aws, azure, gcp, or digitalocean.	keyword
cloud.region	Region in which this host, resource, or service is located.	keyword
container.id	Unique container id.	keyword
data_stream.dataset	Data stream dataset.	constant_keyword
data_stream.namespace	Data stream namespace.	constant_keyword
data_stream.type	Data stream type.	constant_keyword
ecs.version	ECS version this event conforms to. `ecs.version` is a required field and must exist in all events. When querying across multiple indices -- which may conform to slightly different ECS versions -- this field lets integrations adjust to the schema version of the events.	keyword
error.message	Error message.	match_only_text
event.category	This is one of four ECS Categorization Fields, and indicates the second level in the ECS category hierarchy. `event.category` represents the "big buckets" of ECS categories. For example, filtering on `event.category:process` yields all events relating to process activity. This field is closely related to `event.type`, which is used as a subcategory. This field is an array. This will allow proper categorization of some events that fall in multiple categories.	keyword
event.created	event.created contains the date/time when the event was first read by an agent, or by your pipeline. This field is distinct from @timestamp in that @timestamp typically contain the time extracted from the original event. In most situations, these two timestamps will be slightly different. The difference can be used to calculate the delay between your source generating an event, and the time when your agent first processed it. This can be used to monitor your agent's or pipeline's ability to keep up with your event source. In case the two timestamps are identical, @timestamp should be used.	date
event.dataset	Name of the dataset. If an event source publishes more than one type of log or events (e.g. access log, error log), the dataset is used to specify which one the event comes from. It's recommended but not required to start the dataset name with the module name, followed by a dot, then the dataset name.	keyword
event.kind	This is one of four ECS Categorization Fields, and indicates the highest level in the ECS category hierarchy. `event.kind` gives high-level information about what type of information the event contains, without being specific to the contents of the event. For example, values of this field distinguish alert events from metric events. The value of this field can be used to inform how these kinds of events should be handled. They may warrant different retention, different access control, it may also help understand whether the data coming in at a regular interval or not.	keyword
event.module	Name of the module this data is coming from. If your monitoring agent supports the concept of modules or plugins to process events of a given source (e.g. Apache logs), `event.module` should contain the name of this module.	keyword
event.type	This is one of four ECS Categorization Fields, and indicates the third level in the ECS category hierarchy. `event.type` represents a categorization "sub-bucket" that, when used along with the `event.category` field values, enables filtering events down to a level appropriate for single visualization. This field is an array. This will allow proper categorization of some events that fall in multiple event types.	keyword
host.name	Name of the host. It can contain what `hostname` returns on Unix systems, the fully qualified domain name, or a name specified by the user. The sender decides which value to use.	keyword
service.address	Address where data about this service was collected from. This should be a URI, network address (ipv4:port or [ipv6]:port) or a resource path (sockets).	keyword
service.type	The type of the service data is collected from. The type can be used to group and correlate logs and metrics from one service type. Example: If logs or metrics are collected from Elasticsearch, `service.type` would be `elasticsearch`.	keyword

Changelog

Version	Details	Kibana version(s)
1.11.0	Enhancement View pull request Enable 'secret' for the sensitive fields, supported from 8.12.	8.8.0 or higher
1.10.1	Bug fix View pull request Update the link to the correct reindexing procedure.	8.8.0 or higher
1.10.0	Enhancement View pull request Update the package format_version to 3.0.0.	8.8.0 or higher
1.9.2	Bug fix View pull request Fix the type for `log.flags` field.	8.8.0 or higher
1.9.1	Bug fix View pull request Add null check and ignore_missing check to the rename processor	8.8.0 or higher
1.9.0	Enhancement View pull request Enable time series data streams for the metrics datasets. This dramatically reduces storage for metrics and is expected to progressively improve query performance. For more details, see https://www.elastic.co/guide/en/elasticsearch/reference/current/tsds.html.	8.8.0 or higher
1.8.1	Enhancement View pull request Add metric_type mapping for the fields of `metrics` datastream.	8.3.0 or higher
1.8.0	Enhancement View pull request Add dimension fields for metrics datastream for TSDB enablement.	8.3.0 or higher
1.7.0	Enhancement View pull request Rename ownership from obs-service-integrations to obs-infraobs-integrations	8.3.0 or higher
1.6.0	Enhancement View pull request Migrate `System Logs` dashboard visualizations to lens.	8.3.0 or higher
1.5.0	Enhancement View pull request Migrate `Overview` dashboard visualizations to lens.	8.3.0 or higher
1.4.1	Enhancement View pull request Added categories and/or subcategories.	7.15.0 or higher 8.0.0 or higher
1.4.0	Enhancement View pull request Update ECS version to 8.5.1	7.15.0 or higher 8.0.0 or higher
1.3.0	Enhancement View pull request Update cassandra package as per best practices.	7.15.0 or higher 8.0.0 or higher
1.2.3	Bug fix View pull request Fix dashboard issues.	7.15.0 or higher 8.0.0 or higher
1.2.2	Bug fix View pull request Fix typo in config template for ignoring host enrichment	—
1.2.1	Enhancement View pull request Add documentation for multi-fields	—
1.2.0	Enhancement View pull request Update to ECS 8.0	—
1.1.0	Enhancement View pull request Release cassandra package for v8.0.0	7.15.0 or higher 8.0.0 or higher
1.0.0	Enhancement View pull request GA Release	—
0.0.1	Enhancement View pull request Initial draft of the package	—

On this page

Article navigation

Compatibility

Troubleshooting

Logs

Metrics

Changelog