Back to Projects
    Transformers
    NLP
    Topic Modeling

    NLP Pipeline for Support Intelligence

    Transformer-based topic modeling and intent classification to surface insights from 2M+ tickets.

    Overview

    Developed a comprehensive NLP pipeline to analyze customer support tickets and extract actionable insights. The system automatically categorizes tickets, identifies emerging issues, and provides sentiment analysis to help prioritize customer success efforts. Built with modern transformer models and deployed as a scalable microservice.

    Code Highlight

    Intent Classification with BERT
    from transformers import AutoTokenizer, AutoModelForSequenceClassification
    import torch
    from sklearn.preprocessing import LabelEncoder
    class SupportTicketClassifier:
    def __init__(self, model_name="bert-base-uncased"):
    self.tokenizer = AutoTokenizer.from_pretrained(model_name)
    self.model = AutoModelForSequenceClassification.from_pretrained(
    model_name,
    num_labels=12
    )
    self.label_encoder = LabelEncoder()
    def preprocess_text(self, text):
    """Clean and tokenize support ticket text"""
    # Remove PII and standardize formatting
    cleaned_text = self.remove_sensitive_data(text)
    return self.tokenizer(
    cleaned_text,
    truncation=True,
    padding='max_length',
    max_length=512,
    return_tensors='pt'
    )
    def classify_intent(self, ticket_text):
    inputs = self.preprocess_text(ticket_text)
    with torch.no_grad():
    outputs = self.model(**inputs)
    predictions = torch.nn.functional.softmax(outputs.logits, dim=-1)
    intent_id = torch.argmax(predictions, dim=-1).item()
    confidence = torch.max(predictions).item()
    return {
    'intent': self.label_encoder.inverse_transform([intent_id])[0],
    'confidence': confidence,
    'urgency_score': self.calculate_urgency(predictions)
    }

    Key Results

    Processed 2M+ support tickets
    85% accuracy in intent classification
    Reduced manual categorization by 90%
    Identified 15 new product issues early

    Technologies Used

    Python
    Transformers
    BERT
    Scikit-learn
    PostgreSQL
    Docker

    Project Category

    ai automation

    Repository

    View on GitHub