Back to Projects
Transformers
NLP
Topic Modeling
NLP Pipeline for Support Intelligence
Transformer-based topic modeling and intent classification to surface insights from 2M+ tickets.
Overview
Developed a comprehensive NLP pipeline to analyze customer support tickets and extract actionable insights. The system automatically categorizes tickets, identifies emerging issues, and provides sentiment analysis to help prioritize customer success efforts. Built with modern transformer models and deployed as a scalable microservice.
Code Highlight
Intent Classification with BERT
from transformers import AutoTokenizer, AutoModelForSequenceClassificationimport torchfrom sklearn.preprocessing import LabelEncoderclass SupportTicketClassifier:def __init__(self, model_name="bert-base-uncased"):self.tokenizer = AutoTokenizer.from_pretrained(model_name)self.model = AutoModelForSequenceClassification.from_pretrained(model_name,num_labels=12)self.label_encoder = LabelEncoder()def preprocess_text(self, text):"""Clean and tokenize support ticket text"""# Remove PII and standardize formattingcleaned_text = self.remove_sensitive_data(text)return self.tokenizer(cleaned_text,truncation=True,padding='max_length',max_length=512,return_tensors='pt')def classify_intent(self, ticket_text):inputs = self.preprocess_text(ticket_text)with torch.no_grad():outputs = self.model(**inputs)predictions = torch.nn.functional.softmax(outputs.logits, dim=-1)intent_id = torch.argmax(predictions, dim=-1).item()confidence = torch.max(predictions).item()return {'intent': self.label_encoder.inverse_transform([intent_id])[0],'confidence': confidence,'urgency_score': self.calculate_urgency(predictions)}
Key Results
Processed 2M+ support tickets
85% accuracy in intent classification
Reduced manual categorization by 90%
Identified 15 new product issues early
Technologies Used
Python
Transformers
BERT
Scikit-learn
PostgreSQL
Docker
Project Category
ai automation