-
Notifications
You must be signed in to change notification settings - Fork 751
Metrics for Gobblin ETL
Issac Buenrostro edited this page Aug 21, 2015
·
7 revisions
Gobblin ETL comes equipped with instrumentation using [Gobblin Metrics](Gobblin Metrics), as well as end points to easily extend this instrumentation.
Each construct in a Gobblin ETL run computes metrics regarding it's performance / progress. Each metric is tagged by default with the following tags:
- jobName: Gobblin generated name for the job.
- jobId: Gobblin generated id for the job.
- clusterIdentifier: string identifier the cluster / host where the job was run. Obtained from resource manager, job tracker, or the name of the host.
- taskId: Gobblin generated id for the task that generated the metric.
- construct: construct type that generated the metric (e.g. extractor, converter, etc.)
- class: specific class of the construct that generated the metric.
This is the list of operational metrics implemented by default, grouped by construct.
- gobblin.extractor.records.read: meter for records read.
- gobblin.extractor.records.failed: meter for records failed to read.
- gobblin.extractor.extract.time: timer for reading of records.
- gobblin.converter.records.in: meter for records going into the converter.
- gobblin.converter.records.out: meter for records outputted by the converter.
- gobblin.converter.records.failed: meter for records that failed to be converted.
- gobblin.converter.convert.time: timer for conversion time of each record.
- gobblin.fork.operator.records.in: meter for records going into the fork operator.
- gobblin.fork.operator.forks.out: meter for records going out of the fork operator (each record is counted once for each fork it is emitted to).
- gobblin.fork.operator.fork.time: timer for forking of each record.
- gobblin.qualitychecker.records.in: meter for records going into the row level policy.
- gobblin.qualitychecker.records.passed: meter for records passing the row level policy check.
- gobblin.qualitychecker.records.failed: meter for records failing the row level policy check.
- gobblin.qualitychecker.check.time: timer for row level policy checking of each record.
- gobblin.writer.records.in: meter for records requested to be written.
- gobblin.writer.records.written: meter for records actually written.
- gobblin.writer.records.failed: meter for records failed to be written.
- gobblin.writer.write.time: timer for writing each record.
- Home
- [Getting Started](Getting Started)
- Architecture
- User Guide
- Working with Job Configuration Files
- [Deployment](Gobblin Deployment)
- Gobblin on Yarn
- Compaction
- [State Management and Watermarks] (State-Management-and-Watermarks)
- Working with the ForkOperator
- [Configuration Glossary](Configuration Properties Glossary)
- [Partitioned Writers](Partitioned Writers)
- Monitoring
- Schedulers
- [Job Execution History Store](Job Execution History Store)
- Gobblin Build Options
- Troubleshooting
- [FAQs] (FAQs)
- Case Studies
- Gobblin Metrics
- [Quick Start](Gobblin Metrics)
- [Existing Reporters](Existing Reporters)
- [Metrics for Gobblin ETL](Metrics for Gobblin ETL)
- [Gobblin Metrics Architecture](Gobblin Metrics Architecture)
- [Implementing New Reporters](Implementing New Reporters)
- [Gobblin Metrics Performance](Gobblin Metrics Performance)
- Developer Guide
- [Customization: New Source](Customization for New Source)
- [Customization: Converter/Operator](Customization for Converter and Operator)
- Code Style Guide
- IDE setup
- Monitoring Design
- Project
- [Feature List](Feature List)
- Contributors/Team
- [Talks/Tech Blogs](Talks and Tech Blogs)
- News/Roadmap
- Posts
- Miscellaneous