Apache Kafka: Comprehensive Guide with Use Cases and Deployment

Introduction

Apache Kafka is a distributed streaming platform designed to handle high-throughput, fault-tolerant, real-time data feeds. Originally developed by LinkedIn, Kafka has become the de facto standard for building event-driven architectures, real-time data pipelines, and streaming applications.

This guide covers:

Kafka Fundamentals: Core concepts, architecture, and components
Use Cases: Real-world applications and patterns
Deployment: Step-by-step setup and configuration
Best Practices: Performance, reliability, and scalability
Practical Examples: Code samples and deployment scripts

What is Apache Kafka?

Apache Kafka is a distributed streaming platform that offers:

High Throughput: Handle millions of messages per second
Scalability: Horizontally scalable architecture
Durability: Persistent message storage
Fault Tolerance: Replication and distributed design
Real-Time Processing: Low-latency message delivery
Decoupling: Decouple producers and consumers

Key Concepts

Topic: A category or feed name to which messages are published. Topics are partitioned and replicated across brokers.

Partition: Topics are divided into partitions for parallelism and scalability. Each partition is an ordered, immutable sequence of messages.

Producer: Applications that publish messages to topics.

Consumer: Applications that read messages from topics.

Consumer Group: A group of consumers that work together to consume messages from a topic. Each message is delivered to only one consumer in the group.

Broker: A Kafka server that stores data and serves clients.

Cluster: A collection of brokers working together.

Offset: A unique identifier for each message within a partition.

Replication: Copies of partitions stored on multiple brokers for fault tolerance.

Architecture

High-Level Architecture

┌─────────────┐     ┌─────────────┐     ┌─────────────┐
│  Producer   │────▶│   Producer  │────▶│  Producer   │
│      A      │     │      B      │     │      C      │
└──────┬──────┘     └──────┬──────┘     └──────┬──────┘
       │                    │                    │
       └────────────────────┴────────────────────┘
                            │
                            ▼
              ┌─────────────────────────┐
              │   Kafka Cluster         │
              │                         │
              │  ┌──────────┐           │
              │  │  Broker  │           │
              │  │  Cluster │           │
              │  └────┬─────┘           │
              │       │                 │
              │  ┌────┴─────┐           │
              │  │  Topics  │           │
              │  │(Partitions)          │
              │  └──────────┘           │
              └──────┬──────────────────┘
                     │
       ┌─────────────┴─────────────┐
       │                           │
┌──────▼──────┐           ┌───────▼──────┐
│  Consumer   │           │  Consumer   │
│   Group 1   │           │   Group 2   │
└─────────────┘           └─────────────┘

Explanation:

Producers: Applications that publish messages to Kafka topics (e.g., web servers, microservices, data pipelines).
Kafka Cluster: A collection of Kafka brokers that store and serve messages. Brokers handle read/write requests and replication.
Broker Cluster: Multiple brokers working together to provide high availability and scalability.
Topics (Partitions): Logical categories for messages. Topics are divided into partitions for parallel processing and scalability.
Consumer Groups: Groups of consumers that work together to consume messages from topics. Each message is delivered to only one consumer in a group.

Core Architecture

┌─────────────┐     ┌─────────────┐     ┌─────────────┐
│  Producer   │────▶│   Broker    │◀────│  Consumer   │
│             │     │   Cluster   │     │   Group     │
└─────────────┘     └─────────────┘     └─────────────┘
                           │
                    ┌──────┴──────┐
                    │             │
              ┌─────▼───┐   ┌─────▼───┐
              │ Broker  │   │ Broker  │
              │   1     │   │   2     │
              └─────────┘   └─────────┘
                    │             │
              ┌─────▼─────────────▼───┐
              │      Topic            │
              │  ┌─────────────────┐  │
              │  │   Partition 0  │  │
              │  │   Partition 1  │  │
              │  │   Partition 2  │  │
              │  └─────────────────┘  │
              └───────────────────────┘

Common Use Cases

1. Event Streaming and Event-Driven Architecture

Build event-driven systems that react to events in real-time.

Use Cases:

Microservices communication
Event sourcing
CQRS (Command Query Responsibility Segregation)
Real-time analytics

Example:

from kafka import KafkaProducer, KafkaConsumer
import json

# Producer: Publish events
producer = KafkaProducer(
    bootstrap_servers=['localhost:9092'],
    value_serializer=lambda v: json.dumps(v).encode('utf-8')
)

def publish_event(event_type, event_data):
    """Publish event to Kafka topic"""
    event = {
        'event_type': event_type,
        'event_data': event_data,
        'timestamp': str(datetime.now())
    }
    
    producer.send('events', value=event)
    producer.flush()

# Consumer: Process events
consumer = KafkaConsumer(
    'events',
    bootstrap_servers=['localhost:9092'],
    group_id='event-processors',
    value_deserializer=lambda m: json.loads(m.decode('utf-8')),
    auto_offset_reset='earliest',
    enable_auto_commit=True
)

def process_events():
    """Process events from Kafka"""
    for message in consumer:
        event = message.value
        event_type = event['event_type']
        event_data = event['event_data']
        
        # Route to appropriate handler
        if event_type == 'user.created':
            handle_user_created(event_data)
        elif event_type == 'order.placed':
            handle_order_placed(event_data)
        elif event_type == 'payment.processed':
            handle_payment_processed(event_data)

2. Real-Time Data Pipelines

Build real-time data pipelines for ETL and data integration.

Use Cases:

Data ingestion
ETL pipelines
Data synchronization
Log aggregation

Example:

from kafka import KafkaProducer, KafkaConsumer
import json

# Producer: Ingest data from source
producer = KafkaProducer(
    bootstrap_servers=['localhost:9092'],
    value_serializer=lambda v: json.dumps(v).encode('utf-8')
)

def ingest_data(source_data):
    """Ingest data into Kafka"""
    for record in source_data:
        producer.send('raw-data', value=record)
    producer.flush()

# Consumer: Transform and load data
consumer = KafkaConsumer(
    'raw-data',
    bootstrap_servers=['localhost:9092'],
    group_id='etl-workers',
    value_deserializer=lambda m: json.loads(m.decode('utf-8'))
)

def transform_and_load():
    """Transform data and load to destination"""
    for message in consumer:
        raw_data = message.value
        
        # Transform data
        transformed_data = transform(raw_data)
        
        # Load to destination (database, data warehouse, etc.)
        load_to_destination(transformed_data)
        
        # Commit offset
        consumer.commit()

3. Log Aggregation

Centralize logs from multiple sources for analysis and monitoring.

Use Cases:

Application log aggregation
System monitoring
Security auditing
Debugging and troubleshooting

Example:

from kafka import KafkaProducer
import logging
import json

# Configure Kafka handler for Python logging
class KafkaHandler(logging.Handler):
    def __init__(self, kafka_producer, topic):
        logging.Handler.__init__(self)
        self.producer = kafka_producer
        self.topic = topic
    
    def emit(self, record):
        try:
            log_entry = {
                'level': record.levelname,
                'message': record.getMessage(),
                'timestamp': str(record.created),
                'module': record.module,
                'function': record.funcName
            }
            self.producer.send(self.topic, value=log_entry)
        except Exception:
            self.handleError(record)

# Setup logging with Kafka
producer = KafkaProducer(
    bootstrap_servers=['localhost:9092'],
    value_serializer=lambda v: json.dumps(v).encode('utf-8')
)

kafka_handler = KafkaHandler(producer, 'application-logs')
logger = logging.getLogger('myapp')
logger.addHandler(kafka_handler)
logger.setLevel(logging.INFO)

# Use logger
logger.info("Application started")
logger.error("Error occurred", exc_info=True)

4. Stream Processing

Process and analyze data streams in real-time.

Use Cases:

Real-time analytics
Fraud detection
Recommendation systems
Monitoring and alerting

Example with Kafka Streams (Java):

import org.apache.kafka.streams.*;
import org.apache.kafka.streams.kstream.*;

public class StreamProcessor {
    public static void main(String[] args) {
        Properties props = new Properties();
        props.put(StreamsConfig.APPLICATION_ID_CONFIG, "stream-processor");
        props.put(StreamsConfig.BOOTSTRAP_SERVERS_CONFIG, "localhost:9092");
        props.put(StreamsConfig.DEFAULT_KEY_SERDE_CLASS_CONFIG, Serdes.String().getClass());
        props.put(StreamsConfig.DEFAULT_VALUE_SERDE_CLASS_CONFIG, Serdes.String().getClass());
        
        StreamsBuilder builder = new StreamsBuilder();
        
        // Read from input topic
        KStream<String, String> source = builder.stream("input-topic");
        
        // Process stream
        KStream<String, String> processed = source
            .filter((key, value) -> value != null)
            .mapValues(value -> value.toUpperCase())
            .peek((key, value) -> System.out.println("Processed: " + value));
        
        // Write to output topic
        processed.to("output-topic");
        
        KafkaStreams streams = new KafkaStreams(builder.build(), props);
        streams.start();
    }
}

5. Activity Tracking

Track user activities and events for analytics.

Use Cases:

User behavior tracking
Clickstream analysis
A/B testing
Personalization

Example:

from kafka import KafkaProducer
import json
from datetime import datetime

producer = KafkaProducer(
    bootstrap_servers=['localhost:9092'],
    value_serializer=lambda v: json.dumps(v).encode('utf-8')
)

def track_user_activity(user_id, activity_type, activity_data):
    """Track user activity"""
    activity = {
        'user_id': user_id,
        'activity_type': activity_type,
        'activity_data': activity_data,
        'timestamp': datetime.now().isoformat(),
        'session_id': get_session_id(user_id)
    }
    
    # Send to topic partitioned by user_id for ordering
    producer.send(
        'user-activities',
        key=user_id.encode('utf-8'),
        value=activity
    )

# Track various activities
track_user_activity('user123', 'page_view', {'page': '/home'})
track_user_activity('user123', 'click', {'element': 'button', 'id': 'signup'})
track_user_activity('user123', 'purchase', {'product_id': 'prod456', 'amount': 99.99})

6. Messaging System

Use Kafka as a traditional message queue for microservices.

Use Cases:

Service-to-service communication
Task queues
Notification systems
Workflow orchestration

Example:

from kafka import KafkaProducer, KafkaConsumer
import json

# Producer: Send messages
producer = KafkaProducer(
    bootstrap_servers=['localhost:9092'],
    value_serializer=lambda v: json.dumps(v).encode('utf-8')
)

def send_message(topic, message):
    """Send message to topic"""
    producer.send(topic, value=message)
    producer.flush()

# Consumer: Receive messages
consumer = KafkaConsumer(
    'notifications',
    bootstrap_servers=['localhost:9092'],
    group_id='notification-service',
    value_deserializer=lambda m: json.loads(m.decode('utf-8'))
)

def process_notifications():
    """Process notifications"""
    for message in consumer:
        notification = message.value
        send_notification(notification)

Deployment Guide

Prerequisites

Java: Kafka requires Java 8 or higher
ZooKeeper: Required for Kafka (or use KRaft mode in newer versions)
System Resources: Minimum 4GB RAM, 2 CPU cores
Disk Space: Sufficient storage for message retention

Step 1: Install Java

Linux (Ubuntu/Debian):

sudo apt update
sudo apt install openjdk-11-jdk
java -version

macOS:

brew install openjdk@11

Windows: Download and install from Oracle or AdoptOpenJDK

Step 2: Download Kafka

# Download Kafka
wget https://downloads.apache.org/kafka/3.6.0/kafka_2.13-3.6.0.tgz

# Extract
tar -xzf kafka_2.13-3.6.0.tgz
cd kafka_2.13-3.6.0

Step 3: Start ZooKeeper

Kafka uses ZooKeeper for cluster coordination (or KRaft mode in Kafka 3.3+).

Start ZooKeeper:

# Start ZooKeeper
bin/zookeeper-server-start.sh config/zookeeper.properties

ZooKeeper Configuration (config/zookeeper.properties):

dataDir=/tmp/zookeeper
clientPort=2181
maxClientCnxns=0

Step 4: Start Kafka Broker

Start Kafka:

# Start Kafka broker
bin/kafka-server-start.sh config/server.properties

Kafka Configuration (config/server.properties):

# Broker ID
broker.id=0

# Listeners
listeners=PLAINTEXT://localhost:9092

# Log directories
log.dirs=/tmp/kafka-logs

# ZooKeeper connection
zookeeper.connect=localhost:2181

# Replication
default.replication.factor=1
min.insync.replicas=1

# Partitions
num.partitions=3

# Log retention
log.retention.hours=168  # 7 days
log.segment.bytes=1073741824  # 1GB

Step 5: Create Topics

Using Kafka CLI:

# Create topic
bin/kafka-topics.sh --create \
  --topic my-topic \
  --bootstrap-server localhost:9092 \
  --partitions 3 \
  --replication-factor 1

# List topics
bin/kafka-topics.sh --list --bootstrap-server localhost:9092

# Describe topic
bin/kafka-topics.sh --describe \
  --topic my-topic \
  --bootstrap-server localhost:9092

Using Python:

from kafka.admin import KafkaAdminClient, NewTopic

admin_client = KafkaAdminClient(
    bootstrap_servers=['localhost:9092']
)

# Create topic
topic = NewTopic(
    name='my-topic',
    num_partitions=3,
    replication_factor=1,
    topic_configs={
        'retention.ms': '604800000'  # 7 days
    }
)

admin_client.create_topics([topic])

Step 6: Install Python Client

Install kafka-python:

pip install kafka-python

Install confluent-kafka (recommended):

pip install confluent-kafka

Step 7: Configure Producer

Basic Producer:

from kafka import KafkaProducer
import json

producer = KafkaProducer(
    bootstrap_servers=['localhost:9092'],
    value_serializer=lambda v: json.dumps(v).encode('utf-8'),
    key_serializer=lambda k: k.encode('utf-8') if k else None,
    acks='all',  # Wait for all replicas
    retries=3,
    max_in_flight_requests_per_connection=1,
    compression_type='snappy'
)

# Send message
producer.send('my-topic', key='key1', value={'message': 'Hello Kafka'})
producer.flush()

Advanced Producer Configuration:

from kafka import KafkaProducer
from kafka.errors import KafkaError

producer = KafkaProducer(
    bootstrap_servers=['localhost:9092'],
    value_serializer=lambda v: json.dumps(v).encode('utf-8'),
    acks='all',
    retries=3,
    batch_size=16384,  # 16KB
    linger_ms=10,  # Wait 10ms for batching
    buffer_memory=33554432,  # 32MB
    compression_type='snappy',
    max_in_flight_requests_per_connection=5
)

def send_with_callback(topic, key, value):
    """Send message with callback"""
    future = producer.send(topic, key=key, value=value)
    
    def on_send_success(record_metadata):
        print(f"Message sent to {record_metadata.topic} "
              f"partition {record_metadata.partition} "
              f"offset {record_metadata.offset}")
    
    def on_send_error(exception):
        print(f"Error sending message: {exception}")
    
    future.add_callback(on_send_success)
    future.add_errback(on_send_error)

Step 8: Configure Consumer

Basic Consumer:

from kafka import KafkaConsumer
import json

consumer = KafkaConsumer(
    'my-topic',
    bootstrap_servers=['localhost:9092'],
    group_id='my-consumer-group',
    value_deserializer=lambda m: json.loads(m.decode('utf-8')),
    auto_offset_reset='earliest',  # 'earliest' or 'latest'
    enable_auto_commit=True,
    auto_commit_interval_ms=1000
)

# Consume messages
for message in consumer:
    print(f"Topic: {message.topic}, "
          f"Partition: {message.partition}, "
          f"Offset: {message.offset}, "
          f"Value: {message.value}")

Advanced Consumer Configuration:

from kafka import KafkaConsumer
from kafka.errors import KafkaError

consumer = KafkaConsumer(
    'my-topic',
    bootstrap_servers=['localhost:9092'],
    group_id='my-consumer-group',
    value_deserializer=lambda m: json.loads(m.decode('utf-8')),
    auto_offset_reset='earliest',
    enable_auto_commit=False,  # Manual commit
    max_poll_records=100,  # Process up to 100 messages per poll
    session_timeout_ms=30000,  # 30 seconds
    heartbeat_interval_ms=3000,  # 3 seconds
    fetch_min_bytes=1024,  # Wait for at least 1KB
    fetch_max_wait_ms=500  # Wait up to 500ms
)

def process_messages():
    """Process messages with manual commit"""
    try:
        for message in consumer:
            try:
                # Process message
                process_message(message.value)
                
                # Commit offset after successful processing
                consumer.commit()
            except Exception as e:
                print(f"Error processing message: {e}")
                # Don't commit, message will be retried
    except KafkaError as e:
        print(f"Kafka error: {e}")
    finally:
        consumer.close()

Step 9: Multi-Broker Cluster Setup

Broker 1 (config/server-1.properties):

broker.id=1
listeners=PLAINTEXT://localhost:9092
log.dirs=/tmp/kafka-logs-1
zookeeper.connect=localhost:2181

Broker 2 (config/server-2.properties):

broker.id=2
listeners=PLAINTEXT://localhost:9093
log.dirs=/tmp/kafka-logs-2
zookeeper.connect=localhost:2181

Broker 3 (config/server-3.properties):

broker.id=3
listeners=PLAINTEXT://localhost:9094
log.dirs=/tmp/kafka-logs-3
zookeeper.connect=localhost:2181

Start brokers:

bin/kafka-server-start.sh config/server-1.properties &
bin/kafka-server-start.sh config/server-2.properties &
bin/kafka-server-start.sh config/server-3.properties &

Create topic with replication:

bin/kafka-topics.sh --create \
  --topic replicated-topic \
  --bootstrap-server localhost:9092 \
  --partitions 3 \
  --replication-factor 3

Best Practices

1. Partitioning Strategy

Choose partition count based on:

Throughput requirements
Consumer parallelism needs
Retention requirements

Guidelines:

More partitions = more parallelism
Too many partitions = overhead
Recommended: 3-6 partitions per broker

# Create topic with appropriate partitions
admin_client.create_topics([
    NewTopic(
        name='high-throughput-topic',
        num_partitions=6,  # 6 partitions for parallelism
        replication_factor=3
    )
])

2. Replication Factor

Set replication factor for fault tolerance:

Production: replication_factor >= 3
Development: replication_factor = 1

bin/kafka-topics.sh --create \
  --topic production-topic \
  --bootstrap-server localhost:9092 \
  --partitions 6 \
  --replication-factor 3  # 3 replicas for fault tolerance

3. Message Keys

Use keys for partitioning:

Same key → same partition
Enables ordering within partition
Useful for related messages

# Send with key for partitioning
producer.send(
    'user-events',
    key=user_id.encode('utf-8'),  # Same user → same partition
    value=event_data
)

4. Consumer Groups

Use consumer groups for parallel processing:

Multiple consumers in group
Each partition consumed by one consumer
Automatic load balancing

# Multiple consumers in same group
consumer1 = KafkaConsumer('topic', group_id='my-group')
consumer2 = KafkaConsumer('topic', group_id='my-group')
consumer3 = KafkaConsumer('topic', group_id='my-group')

5. Idempotent Producers

Enable idempotence for exactly-once semantics:

producer = KafkaProducer(
    bootstrap_servers=['localhost:9092'],
    enable_idempotence=True,  # Exactly-once semantics
    acks='all',
    retries=3
)

6. Compression

Enable compression to reduce network and storage:

producer = KafkaProducer(
    bootstrap_servers=['localhost:9092'],
    compression_type='snappy'  # or 'gzip', 'lz4'
)

7. Monitoring

Monitor Kafka metrics:

Message rate (producer/consumer)
Lag (consumer lag)
Disk usage
Network I/O

Using kafka-consumer-groups:

# Check consumer lag
bin/kafka-consumer-groups.sh \
  --bootstrap-server localhost:9092 \
  --group my-consumer-group \
  --describe

Common Patterns

1. Producer-Consumer Pattern

# Producer
producer = KafkaProducer(bootstrap_servers=['localhost:9092'])
producer.send('topic', value='message')

# Consumer
consumer = KafkaConsumer('topic', group_id='my-group')
for message in consumer:
    process(message.value)

2. Fan-Out Pattern

# Multiple consumer groups consume same topic
consumer_group_1 = KafkaConsumer('events', group_id='analytics')
consumer_group_2 = KafkaConsumer('events', group_id='notifications')
consumer_group_3 = KafkaConsumer('events', group_id='audit')

3. Request-Reply Pattern

# Producer sends request
request_id = str(uuid.uuid4())
producer.send('requests', key=request_id, value=request_data)

# Consumer processes and sends reply
consumer = KafkaConsumer('requests', group_id='processors')
for message in consumer:
    result = process_request(message.value)
    producer.send('replies', key=message.key, value=result)

4. Change Data Capture (CDC)

# Capture database changes and publish to Kafka
def capture_changes():
    for change in database_changes:
        producer.send('database-changes', value=change)

Troubleshooting

Common Issues

1. Consumer Lag

Increase consumer instances
Optimize processing logic
Check network latency
Monitor consumer performance

2. Message Loss

Set acks='all' in producer
Increase replication factor
Check disk space
Monitor broker health

3. High Latency

Optimize batch size
Adjust compression
Check network bandwidth
Monitor broker load

4. Partition Imbalance

Use consistent keys
Monitor partition distribution
Rebalance partitions if needed

Managed Kafka Services

Amazon MSK (Managed Streaming for Kafka)

Benefits:

Fully managed
Automatic scaling
High availability
Security and compliance

Setup:

# Create MSK cluster using AWS CLI
aws kafka create-cluster \
  --cluster-name my-kafka-cluster \
  --broker-node-group-info file://broker-info.json \
  --kafka-version 3.6.0

Confluent Cloud

Benefits:

Fully managed
Global availability
Schema registry included
Connectors ecosystem

What Interviewers Look For

Kafka Knowledge & Application

Core Concepts Understanding
- Topics, partitions, consumers
- Consumer groups
- Replication
- Red Flags: Vague understanding, wrong concepts, can’t explain
Partitioning Strategy
- Appropriate partitioning
- Key-based partitioning
- Parallelism
- Red Flags: No partitioning strategy, poor distribution, bottlenecks
Consumer Group Design
- Parallel processing
- Load balancing
- Red Flags: No consumer groups, poor parallelism, bottlenecks

System Design Skills

When to Use Kafka
- Event-driven architecture
- High-throughput streaming
- Real-time processing
- Red Flags: Wrong use case, over-engineering, can’t justify
Scalability Design
- Horizontal scaling
- Partition strategy
- Red Flags: Vertical scaling, no scaling, bottlenecks
Reliability Design
- Replication factor
- Exactly-once semantics
- Durability
- Red Flags: No replication, message loss, no durability

Problem-Solving Approach

Trade-off Analysis
- Throughput vs latency
- Consistency vs availability
- Red Flags: No trade-offs, dogmatic choices
Edge Cases
- Consumer lag
- Partition failures
- Message ordering
- Red Flags: Ignoring edge cases, no handling
Performance Optimization
- Compression
- Batching
- Red Flags: No optimization, poor performance

Communication Skills

Kafka Explanation
- Can explain Kafka architecture
- Understands concepts
- Red Flags: No understanding, vague explanations
Decision Justification
- Explains why Kafka
- Discusses alternatives
- Red Flags: No justification, no alternatives

Meta-Specific Focus

Event-Driven Systems Expertise
- Kafka knowledge
- Streaming architecture
- Key: Show event-driven systems expertise
High-Throughput Design
- Scalability focus
- Performance optimization
- Key: Demonstrate high-throughput expertise

Conclusion

Apache Kafka is a powerful distributed streaming platform for building real-time data pipelines and event-driven applications. Key takeaways:

Choose appropriate partitions for parallelism
Set replication factor for fault tolerance
Use consumer groups for parallel processing
Enable idempotence for exactly-once semantics
Monitor consumer lag and performance
Use compression to optimize network and storage

Whether you’re building event-driven architectures, real-time data pipelines, or streaming applications, Kafka provides the scalability, durability, and performance you need.

Introduction

What is Apache Kafka?

Key Concepts

Architecture

High-Level Architecture

Core Architecture

Common Use Cases

1. Event Streaming and Event-Driven Architecture

2. Real-Time Data Pipelines

3. Log Aggregation

4. Stream Processing

5. Activity Tracking

6. Messaging System

Deployment Guide

Prerequisites

Step 1: Install Java

Step 2: Download Kafka

Step 3: Start ZooKeeper

Step 4: Start Kafka Broker

Step 5: Create Topics

Step 6: Install Python Client

Step 7: Configure Producer

Step 8: Configure Consumer

Step 9: Multi-Broker Cluster Setup

Best Practices

1. Partitioning Strategy

2. Replication Factor

3. Message Keys

4. Consumer Groups

5. Idempotent Producers

6. Compression

7. Monitoring

Common Patterns

1. Producer-Consumer Pattern

2. Fan-Out Pattern

3. Request-Reply Pattern

4. Change Data Capture (CDC)

Troubleshooting

Common Issues

Managed Kafka Services

Amazon MSK (Managed Streaming for Kafka)

Confluent Cloud

What Interviewers Look For

Kafka Knowledge & Application

System Design Skills

Problem-Solving Approach

Communication Skills

Meta-Specific Focus

Conclusion

References

Related Posts

Recent Posts