in Tech

The Logging Infrastructure of WePay

1.6k Views

**Log Aggregation: WePay’s Logging Infrastructure**
Without logs, navigating through issues and errors would be like stumbling in the dark. In this article, we will discuss how WePay has set up its logging infrastructure and the components they use for log aggregation, processing, enrichment, buffering, ingestion, storage, and searching.

**Log Aggregation**
WePay utilizes Filebeat to collect and ship logs from virtual machines (VMs) and microservices. Filebeat ensures reliable delivery of events with no data loss by storing the delivery state of each event in a registry file. If the defined output is blocked, Filebeat will continue trying to send events until it receives confirmation from the output.

**Log Processing and Enrichment**
WePay utilizes Logstash for advanced log processing and enrichment. Logstash filters parse each log event, identify named fields, and transform them into a common format for analysis. Logstash’s features include:
– Deriving structure from unstructured data using grok and mutate filters
– Deciphering geographic coordinates from IP addresses
– Anonymizing personally identifiable information (PII) data and excluding sensitive fields from logs
– Simplifying processing, regardless of data source, format, or schema
– Converting JSON log messages into Avro format matching an Avro schema for storage in Kafka

**Log Buffering and Ingestion**
To handle sudden log surges and protect Elasticsearch, WePay uses Apache Kafka for log buffering. In WePay’s logging pipeline, Logstash forwards logs to Kafka in Avro format. WePay also utilizes lightweight Confluent Kafka Connectors for ingesting logs into Elastic, Google BigQuery, and Google Cloud Storage Buckets.

**Storing Logs**
WePay, as a payments company, follows PCI DSS audits requirements, which state that logs must be retained for a minimum of one year, with 90 days of logs available for immediate analysis. WePay stores logs in three places:
1. Elasticsearch: Logs are retained for 90 days for immediate analysis.
2. Google BigQuery: Logs are stored for long-term retention and compliance requirements.
3. Google Cloud Storage: Logs are stored for long-term retention and to support data backfilling in case of BigQuery connector issues.

WePay employs separate Elastic clusters for different environments: development, testing, staging, and production. This separation offers advantages such as isolating log surges caused by bugs or incorrect logging formats and minimizing risks during upgrades.

**Hot Warm Cold Architecture**
To manage data retention and index lifecycle, WePay has set up three types of elastic data nodes: hot, warm, and cold. Hot nodes handle new indexes with extensive reads and writes, while warm and cold nodes host older indexes with reads and no writes. Elastic’s Index Lifecycle Management (ILM) triggers actions based on conditions. WePay’s ILM policy includes transitioning new indexes to warm nodes after one week and freezing indexes on cold nodes after one month. Indexes are deleted after three months.

**Searching with Kibana**
WePay uses two elastic clusters but wanted to have a single Kibana instance for searching logs. They achieved this using cross-cluster search. By creating a third Elasticsearch cluster, requests can be sent to both the development and production clusters. This setup has several benefits, including centralized security roles and permissions, dedicated machine learning nodes, search thread throttling, and increased overall security.

**Future Work**
WePay has plans to integrate application performance monitoring (APM) data into Elasticsearch for better correlation with logging events. They also aim to implement Cross Cluster Replication for improved fault tolerance and utilize the Frozen Tier feature in Elastic to search for logs beyond the 90-day retention period.

**In Conclusion**
WePay’s logging infrastructure, with its log aggregation, processing, enrichment, buffering, ingestion, storage, and searching capabilities, has been designed to handle high volumes of logs while ensuring data integrity, security, and compliance. The separation of environments and the utilization of different Elastic clusters have proven effective in managing and protecting the logging pipeline. With continuous improvements and future developments, WePay continues to enhance its logging capabilities for efficient troubleshooting and analysis.

The Logging Infrastructure of WePay

Ezoic Earnings: Report on Income from Niche Sites in May 2024

Attract Free Traffic to Your Links, Website, and Affiliate Marketing in 2024

Starting a Profitable Affiliate Marketing Business in 7 Days Using A.I.

Introduction to Affiliate Marketing Trends: Part 1

Creating a Free Affiliate Marketing Website with AI

iRobot’s Revolutionary Roomba j7+ with Poop Detection Available at Unbeatable Price!

Examining the mechanisms of server-side rendering and hydration in Gatsby and Next

Samsung launches the highly anticipated One UI 6 beta program, welcoming users to immerse themselves in the cutting-edge interface

Detecting new fraudulent behaviors through unsupervised graph anomaly detection

Xsolla Unveils Exciting Collaborations to Empower Game Developers and Unveils Tokyo Expansion

Enhanced Streaming Experience for Sidekick Users

Leave a ReplyCancel reply

Tour of Pearl Garden in Om Nagar, Vasai West

Watch the detailed tutorial on investing in UAP Old Mutual Unit Trust Fund now!

GenAfrica Asset Managers: Our Portfolio

Assessing Vulnerabilities of 5G Networks: An In-depth Field Campaign | MIT News

Gabriel Davidescu, UTI Construction and Facility Management, unveils all about Brașov Airport

iRobot’s Revolutionary Roomba j7+ with Poop Detection Available at Unbeatable Price!

Ezoic Earnings: Report on Income from Niche Sites in May 2024

Attract Free Traffic to Your Links, Website, and Affiliate Marketing in 2024

Starting a Profitable Affiliate Marketing Business in 7 Days Using A.I.

Introduction to Affiliate Marketing Trends: Part 1

Creating a Free Affiliate Marketing Website with AI

Traffic source that is free for affiliate marketing and websites in 2024 by Anup Gutta.

Download the free book on GetBigCommissions.Com. For high-quality lead magnets.

Demo of the UpTik Affiliate Outreach Bot for TikTok Shop Live with a Comprehensive Update Overview and a 2-Day Trial Offer

Building a Profitable Affiliate Marketing Funnel on Pinterest

Ezoic Earnings: Report on Income from Niche Sites in May 2024

Attract Free Traffic to Your Links, Website, and Affiliate Marketing in 2024

Starting a Profitable Affiliate Marketing Business in 7 Days Using A.I.

Introduction to Affiliate Marketing Trends: Part 1

Creating a Free Affiliate Marketing Website with AI

Traffic source that is free for affiliate marketing and websites in 2024 by Anup Gutta.

Download the free book on GetBigCommissions.Com. For high-quality lead magnets.

[Milestone Achievement] GPT-4 Demonstrates Unprecedented Accuracy in MIT Exam, Accompanied by OpenLLaMA 13B Debut

Leave a ReplyCancel reply

Log In

Sign In

Forgot password?

Your password reset link appears to be invalid or expired.

Log in

Privacy Policy

Add to Collection

No Collections

Hold on! Before you go away...