Chat History Service (Port 8014)¶

Service Path: machineagents-be/chathistory-service/
Port: 8014
Total Lines: 1,265
Purpose: Comprehensive chat history management including intelligent classification, token tracking with cost calculation, access sharing with email notifications, lead form configuration, and TTS/lip-sync generation for form messages.

Service Overview¶

Primary Responsibilities¶

Intelligent Chat Classification:
GPT-3.5 powered categorization (Leads, Job Enquiry, Customer Service, General)
Per-message classification with session context
Smart session-level categorization with priority rules
Context-aware analysis (last 3 messages)
Chat History Management:
Save chat conversations with metadata
Retrieve by session, project, or user
IST timezone conversion (Asia/Kolkata)
Category tracking at message and session level
Token Tracking & Cost Calculation:
Per-message token counting (tiktoken cl100k_base)
Session-level token aggregation
Project-level cost calculation (₹0.144 per token)
User-level cost aggregation across all projects
Access Sharing:
Share chatbot access via email
3 permission levels (viewer, editor, analytics)
Azure Communication Services email integration
Duplicate sharing prevention
Lead Form Configuration:
Dynamic form field configuration
Prompt-based form triggering
TTS/Lip-sync for form messages
Lead data collection endpoint

Architecture & Dependencies¶

Technology Stack¶

Framework:

FastAPI (web framework)
Uvicorn (ASGI server)

AI/ML:

Azure OpenAI (GPT-3.5-Turbo-16k) - Chat classification
Azure TTS - Form message audio
tiktoken (cl100k_base) - Token counting

Database:

MongoDB (CosmosDB) - 4 collections

Communication:

Azure Communication Services - Email sending

Audio Processing:

Rhubarb Lip-Sync - Phonetic analysis
FFmpeg - Audio conversion
Azure Cognitive Services Speech SDK

Key Imports¶

import pytz  # IST timezone
import tiktoken  # Token counting
from bson import ObjectId
from pymongo import MongoClient
from fastapi import FastAPI, HTTPException, Query, Form
from azure.communication.email import EmailClient
import azure.cognitiveservices.speech as speechsdk
from openai import AzureOpenAI

Environment Variables¶

MONGO_URI=mongodb://...
MONGO_DB_NAME=Machine_agent_dev
ENDPOINT_URL=https://machineagentopenai.openai.azure.com/...
DEPLOYMENT_NAME=gpt-35-turbo-16k-0613
AZURE_OPENAI_API_KEY=AZxDVMYB...  # ⚠️ HARDCODED FALLBACK
AZURE_SPEECH_KEY=9N41NOfDyVDoduiD4EjlzmZU9CbUX3pPqWfLCORpl7cBf0l2lzVQJQQJ99BCACGhslBXJ3w3AAAYACOG2329  # ⚠️ HARDCODED
AZURE_SPEECH_REGION=centralindia

Configuration Constants¶

# Token cost (Indian Rupees)
TOKEN_COST_RS = 0.144  # Per token

# Email configuration
EMAIL_ENDPOINT = "https://mailing-sevice.india.communication.azure.com/"
EMAIL_ACCESS_KEY = "CgdEWi6fBCJv4c0EHOkq6ZUSML0VQSG49qXgEfrtlLmXY76HV7DeJQQJ99BBACULyCplbknJAAAAAZCSbc3T"  # ⚠️ HARDCODED
SENDER_EMAIL = "DoNotReply@machineagents.ai"

# Tokenizer
tokenizer = tiktoken.get_encoding("cl100k_base")

Database Collections¶

4 MongoDB Collections¶

history_collection = db["chatbot_history"]      # Chat conversations
lead_collection = db["LEAD_COLLECTION"]         # Lead form configurations & data
share_collection = db["Share_Access"]           # Access sharing records
users_collection = db["users_multichatbot_v2"] # User accounts (for email validation)

chatbot_history Schema¶

Session Document:

{
    "user_id": "User-123",
    "project_id": "User-123_Project_1",
    "session_id": "session-abc123",
    "originated_url": "https://example.com/pricing",
    "datetime": "2024-01-15 14:30:45",  // IST
    "session_total_tokens": 1250,
    "session_category": "Leads",  // Smart categorization
    "utm_data": {  // If available
        "utm_source": "google",
        "utm_medium": "cpc"
    },
    "chat_data": [
        {
            "input_prompt": "What are your pricing plans?",
            "output_response": "We offer three plans...",
            "timestamp": "2024-01-15 14:30:45",
            "input_tokens": 6,
            "output_tokens": 45,
            "total_tokens": 51,
            "category": "Leads"  // Per-message classification
        },
        {
            "input_prompt": "Can I schedule a demo?",
            "output_response": "Absolutely! I can help...",
            "timestamp": "2024-01-15 14:32:10",
            "input_tokens": 7,
            "output_tokens": 38,
            "total_tokens": 45,
            "category": "Leads"
        }
    ]
}

Share_Access Schema¶

{
    "user_id": "User-123",
    "project_id": "User-123_Project_1",
    "shared_with_email": "colleague@example.com",
    "share_access": "chatbot_viewer",  // or "chatbot_editor" or "chatbot_analytics"
    "shared_at": ISODate("2024-01-15T09:00:00.000Z")
}

LEAD_COLLECTION Schema¶

Form Configuration:

{
    "user_id": "User-123",
    "project_id": "User-123_Project_1",
    "prompt_number": 5,  // Trigger after 5th message
    "message_display": "Need help? Fill out the form...",
    "trigger": true,
    "bfName": true,  // Show name field
    "bfNumber": true,  // Show phone field
    "bfemail": true,  // Show email field
    "bfwebsite": false,
    "bfcompany": false,
    "reqName": true,  // Name required
    "reqNumber": false,
    "reqemail": true,  // Email required
    "reqwebsite": false,
    "reqcompany": false,
    "fsubemail": "sales@example.com",  // Form submission recipient
    "voice": "Female_1",
    "tts_audio": "UklGRiQAAABXQVZF...",  // Base64
    "lipsync_data": { /* Rhubarb JSON */ }
}

Lead Submission Data:

{
    "name": "John Doe",
    "email": "john@example.com",
    "phone": "+1234567890",
    "session_id": "session-abc123",
    "user_id": "User-123",
    "project_id": "User-123_Project_1",
    "company": "Acme Inc",
    "website": "https://acme.com",
    "submitted_at": ISODate("2024-01-15T09:30:00.000Z")
}

Core Features¶

1. Dual Chat Classification System¶

Two Save Endpoints:

Endpoint	Classification Strategy	Use Case
`/v2/save-chat-history-smart`	Smart priority rules (no GPT per message)	Production (cost-effective)
`/v2/save-chat-history`	GPT-3.5 per message + session category	Analytics (expensive)

Classification Categories:

Leads - Buying intent, pricing, meetings, partnerships
Job Enquiry - Career opportunities, applications, interviews
Customer Service - Technical issues, support, troubleshooting
General - Greetings, basic info, casual chat

2. Token-Based Cost Tracking¶

Cost Model:

Rate: ₹0.144 per token
Model: GPT-3.5-Turbo-16k equivalent
Tracking: Per-message → Per-session → Per-project → Per-user

Calculation:

Total Cost = (Input Tokens + Output Tokens) × ₹0.144

Workflow:

Owner shares project with email address
System validates email exists in users_collection
Sends email notification via Azure Communication Services
Creates Share_Access record
Recipient sees shared chatbot in their dashboard

Permission Levels:

chatbot_viewer - View chat history only
chatbot_editor - Modify chatbot settings
chatbot_analytics - Access analytics data

4. Dynamic Form Management¶

Features:

Trigger form after N messages (configurable)
5 field types (name, phone, email, website, company)
Required/optional field configuration
TTS + Lip-sync for form message
Email notification on form submission

API Endpoints Summary¶

Total: 15 Endpoints¶

Endpoint	Method	Purpose
`/v2/save-chat-history-smart`	POST	Save chat with smart classification
`/v2/save-chat-history`	POST	Save chat with GPT classification
`/v2/get-chat-history`	GET	Get single session by session_id
`/v2/get-chat-history/{project_id}`	GET	Get all sessions for project
`/v2/calculate-total-tokens/{project_id}`	GET	Calculate project token cost
`/v2/calculate-user-total-tokens`	GET	Calculate user total cost
`/delete-chatbot-history/{project_id}`	DELETE	Delete all chat for project
`/send-access/`	POST	Share chatbot access via email
`/get-shared-access`	GET	Get all shared accesses for user
`/get-shared-access-by-project`	GET	Get shared access for project
`/remove-shared-access`	DELETE	Remove shared access
`/v2/save-data-f`	POST	Save/update form configuration
`/v2/get-data-for-lead`	GET	Get form configuration
`/v2/add-lead`	POST	Submit lead form data
`/v2/fetch-lead-data`	GET	Fetch submitted lead data

Chat Classification System¶

GPT-3.5 Classification (Lines 89-174)¶

Function: classify_prompt(input_prompt, output_response, session_context)

System Prompt:

system_message = {
    "role": "system",
    "content": (
        "You are an expert chat classifier. Analyze the conversation and classify it into one of these categories:\n\n"

        "1. **Leads**: Conversations where the user:\n"
        "   - Expresses interest in purchasing products/services\n"
        "   - Asks for pricing, quotes, or proposals\n"
        "   - Wants to schedule a meeting or consultation\n"
        "   - Requests contact from sales team\n"
        "   - Shows buying intent or decision-making behavior\n"
        "   - Asks about company capabilities for potential business\n"
        "   - Inquires about partnerships or collaborations\n\n"

        "2. **Job Enquiry**: Conversations where the user:\n"
        "   - Asks about job openings or career opportunities\n"
        "   - Inquires about internships or training programs\n"
        "   - Wants to know about company culture or work environment\n"
        "   - Asks about application processes or requirements\n"
        "   - Discusses resume submission or interviews\n"
        "   - Seeks information about specific roles or positions\n\n"

        "3. **Customer Service**: Conversations where the user:\n"
        "   - Reports technical issues or problems\n"
        "   - Asks for help with existing products/services\n"
        "   - Needs troubleshooting assistance\n"
        "   - Requests support for account-related issues\n"
        "   - Complains about service or product quality\n"
        "   - Seeks help with installation, setup, or usage\n"
        "   - Asks for refunds, returns, or exchanges\n\n"

        "4. **General**: Conversations that:\n"
        "   - Are basic greetings without specific intent\n"
        "   - Ask general questions about the company\n"
        "   - Seek basic information without clear purpose\n"
        "   - Are casual interactions or small talk\n"
        "   - Don't fit into the above categories\n\n"

        "Instructions:\n"
        "- Focus primarily on the USER'S intent and questions\n"
        "- Consider the overall conversation flow and context\n"
        "- Look for keywords and phrases that indicate specific needs\n"
        "- If multiple categories apply, choose the most dominant one\n"
        "- Respond with ONLY the category name: 'Leads', 'Job Enquiry', 'Customer Service', or 'General'\n"
    )
}

Context Building:

# Get last 3 messages for context
if existing_session and existing_session.get("chat_data"):
    recent_chats = existing_session["chat_data"][-3:]
    session_context = " ".join([
        f"User: {chat.get('input_prompt', '')[:100]} Bot: {chat.get('output_response', '')[:100]}"
        for chat in recent_chats
    ])

conversation_text = f"Previous Context: {session_context}\n\nUser Input: {input_prompt}\n\nBot Response: {output_response}"

API Call:

response = client_gpt35.chat.completions.create(
    model="gpt-35-turbo-16k-0613",
    messages=[system_message, user_message],
    temperature=0.1,  # Low temperature for consistent classification
    max_tokens=20,
)
category = response.choices[0].message.content.strip()

Sanitization:

valid_categories = ["Leads", "Job Enquiry", "Customer Service", "General"]
if category not in valid_categories:
    # Fuzzy matching
    category_lower = category.lower()
    if "lead" in category_lower:
        category = "Leads"
    elif "job" in category_lower or "enquiry" in category_lower:
        category = "Job Enquiry"
    elif "customer" in category_lower or "service" in category_lower:
        category = "Customer Service"
    else:
        category = "General"

Cost: ~20 tokens per classification (₹2.88 per classification)

Smart Classification (Lines 177-206)¶

Function: determine_session_category_smart(chat_data)

Priority Rules:

def determine_session_category_smart(chat_data: List[Dict]) -> str:
    message_categories = [msg.get("category", "General") for msg in chat_data]

    # Rule 1: If last message is Leads, prioritize it (recency)
    if message_categories[-1] == "Leads":
        return "Leads"

    # Rule 2: If Leads appears in latter half of conversation
    leads_count = message_categories.count("Leads")
    if leads_count > 0 and len(message_categories) > 1:
        latter_half = message_categories[len(message_categories)//2:]
        if "Leads" in latter_half:
            return "Leads"

    # Rule 3: Standard priority order
    priority_order = ["Leads", "Customer Service", "Job Enquiry", "General"]
    for category in priority_order:
        if category in message_categories:
            return category

    return "General"

Why This Works:

Recency bias (last message matters most)
Intent escalation (journey from General → Leads)
Priority-based fallback

Cost: FREE (no LLM calls)

Chat History Management¶

POST `/v2/save-chat-history-smart`¶

Purpose: Save chat with smart classification (production-optimized)

Request:

POST /v2/save-chat-history-smart
Content-Type: application/x-www-form-urlencoded

user_id=User-123
&project_id=User-123_Project_1
&session_id=session-abc123
&input_prompt=Can I schedule a demo?
&output_response=Absolutely! I can help you schedule...
&originated_url=https://example.com/pricing

Flow:

Timezone Conversion:

local_tz = pytz.timezone("Asia/Kolkata")
current_time = datetime.utcnow().replace(tzinfo=pytz.utc).astimezone(local_tz)

Token Counting:

input_tokens = count_tokens(input_prompt)
output_tokens = count_tokens(output_response)
total_tokens = input_tokens + output_tokens

Build Session Context (Last 3 Messages):

existing_session = history_collection.find_one({
    "user_id": user_id,
    "project_id": project_id,
    "session_id": session_id
})

if existing_session:
    recent_chats = existing_session["chat_data"][-3:]
    session_context = " ".join([
        f"User: {chat['input_prompt'][:100]} Bot: {chat['output_response'][:100]}"
        for chat in recent_chats
    ])

Classify Message (GPT-3.5):

message_category = classify_prompt(input_prompt, output_response, session_context)

Create Chat Entry:

chat_entry = {
    "input_prompt": input_prompt,
    "output_response": output_response,
    "timestamp": current_time.strftime("%Y-%m-%d %H:%M:%S"),
    "input_tokens": input_tokens,
    "output_tokens": output_tokens,
    "total_tokens": total_tokens,
    "category": message_category
}

Update or Create Session:

if existing_session:
    history_collection.update_one(
        {"user_id": user_id, "project_id": project_id, "session_id": session_id},
        {
            "$push": {"chat_data": chat_entry},
            "$set": {"datetime": current_time},
            "$inc": {"session_total_tokens": total_tokens}
        }
    )
else:
    new_session = {
        "user_id": user_id,
        "project_id": project_id,
        "session_id": session_id,
        "originated_url": originated_url,
        "datetime": current_time,
        "session_total_tokens": total_tokens,
        "chat_data": [chat_entry],
    }
    history_collection.insert_one(new_session)

Smart Session Categorization:

updated_session = history_collection.find_one(...)
all_messages = updated_session.get("chat_data", [])
session_category = determine_session_category_smart(all_messages)

history_collection.update_one(
    {...},
    {"$set": {"session_category": session_category}}
)

Response:

{
  "message": "Chat history saved successfully",
  "message_category": "Leads",
  "session_category": "Leads"
}

Key Difference from Standard Endpoint: Uses smart priority rules for session category instead of relying solely on GPT classification

POST `/v2/save-chat-history`¶

Purpose: Save chat with full GPT classification (analytics-optimized)

Difference from Smart:

Classification: Per-message GPT + session category = latest message category
No smart priority rules
Higher cost (GPT call per message)

Session Category Logic:

# Simple: Session category = latest message category
history_collection.update_one(
    {...},
    {"$set": {"session_category": category}}  # Just use latest
)

GET `/v2/get-chat-history`¶

Purpose: Get single session chat history

Request:

GET /v2/get-chat-history?user_id=User-123&project_id=User-123_Project_1&session_id=session-abc123

Response:

{
  "user_id": "User-123",
  "project_id": "User-123_Project_1",
  "session_id": "session-abc123",
  "originated_url": "https://example.com/pricing",
  "categories": ["General", "Leads"], // Unique categories from all messages
  "utm_data": {
    "utm_source": "google",
    "utm_medium": "cpc"
  },
  "chat_data": [
    {
      "input_prompt": "What are your pricing plans?",
      "output_response": "We offer three plans...",
      "timestamp": "2024-01-15 14:30:45",
      "input_tokens": 6,
      "output_tokens": 45,
      "total_tokens": 51,
      "category": "Leads"
    }
  ]
}

Category Extraction:

chat_data = session.get("chat_data", [])
categories = list({entry.get("category", "General") for entry in chat_data})

Fallback Logic:

First try: Parse categories
Fallback: Return raw session data

GET `/v2/get-chat-history/{project_id}`¶

Purpose: Get all chat sessions for a project

Request:

GET /v2/get-chat-history/User-123_Project_1?user_id=User-123

Response:

{
    "chat_history": [
        {
            "user_id": "User-123",
            "project_id": "User-123_Project_1",
            "session_id": "session-abc123",
            "originated_url": "https://example.com/pricing",
            "datetime": "2024-01-15 14:30:45",
            "session_total_tokens": 1250,
            "session_category": "Leads",
            "categories": ["General", "Leads"],
            "month": "January",  // Extracted from timestamp
            "chat_data": [...]
        },
        {
            "session_id": "session-xyz789",
            "session_category": "Customer Service",
            "categories": ["Customer Service"],
            "month": "January",
            "chat_data": [...]
        }
    ]
}

Month Extraction Logic:

timestamp = chat.get("timestamp") or chat.get("created_at")

# Fallback to first message timestamp
if not timestamp and chat_data:
    timestamp = chat_data[0].get("timestamp")

# Parse various timestamp formats
if isinstance(timestamp, str):
    timestamp = datetime.strptime(timestamp, "%Y-%m-%d %H:%M:%S")
elif isinstance(timestamp, ObjectId):
    timestamp = timestamp.generation_time
elif not isinstance(timestamp, datetime):
    timestamp = datetime.fromtimestamp(timestamp)

chat["month"] = timestamp.strftime("%B")  # "January"

DELETE `/delete-chatbot-history/{project_id}`¶

Purpose: Delete all chat history for a project

Request:

DELETE /delete-chatbot-history/User-123_Project_1

Response:

{
  "message": "Deleted 15 chat records for project_id User-123_Project_1"
}

Implementation:

result = history_collection.delete_many({"project_id": project_id})

Token Counting & Cost Calculation¶

count_tokens() Helper Function¶

tokenizer = tiktoken.get_encoding("cl100k_base")

def count_tokens(text):
    return len(tokenizer.encode(text))

Model: cl100k_base (GPT-3.5-Turbo, GPT-4 compatible)

Example:

count_tokens("What are your pricing plans?")  # → 6 tokens
count_tokens("We offer three pricing tiers: Basic ($10/month), Pro ($25/month), and Enterprise (custom pricing).")  # → 23 tokens

GET `/v2/calculate-total-tokens/{project_id}`¶

Purpose: Calculate total tokens and cost for a project

Request:

GET /v2/calculate-total-tokens/User-123_Project_1?user_id=User-123

Implementation:

TOKEN_COST_RS = 0.144  # Per token in Indian Rupees

query = {"project_id": project_id, "user_id": user_id}
chat_sessions = list(history_collection.find(query, {"_id": 0, "session_total_tokens": 1}))

total_tokens = sum(session.get("session_total_tokens", 0) for session in chat_sessions)
total_cost_rs = total_tokens * TOKEN_COST_RS

Response:

{
  "user_id": "User-123",
  "project_id": "User-123_Project_1",
  "total_tokens": 12500,
  "total_cost_inr": 1800.0
}

Usage Example:

12,500 tokens
Cost: 12,500 × ₹0.144 = ₹1,800.00

GET `/v2/calculate-user-total-tokens`¶

Purpose: Calculate total tokens and cost across ALL user projects

Request:

GET /v2/calculate-user-total-tokens?user_id=User-123

MongoDB Aggregation Pipeline:

pipeline = [
    {"$match": {"user_id": user_id}},
    {"$group": {
        "_id": "$project_id",
        "total_tokens": {"$sum": "$session_total_tokens"}
    }}
]

results = list(history_collection.aggregate(pipeline))

Response:

{
  "user_id": "User-123",
  "project_summary": [
    {
      "project_id": "User-123_Project_1",
      "total_tokens": 12500,
      "total_cost_inr": 1800.0
    },
    {
      "project_id": "User-123_Project_2",
      "total_tokens": 8300,
      "total_cost_inr": 1195.2
    }
  ],
  "total_cost_inr": 2995.2
}

Calculation:

for record in results:
    tokens = record["total_tokens"]
    cost = round(tokens * TOKEN_COST_RS, 4)
    total_amount += cost

POST `/send-access/`¶

Purpose: Share chatbot access with another user via email

Request:

POST /send-access/
Content-Type: application/x-www-form-urlencoded

user_id=User-123
&project_id=User-123_Project_1
&email_id=colleague@example.com
&content=I'd%20like%20to%20share%20my%20chatbot%20with%20you
&share_access=chatbot_viewer

Parameters:

user_id (Form) - Owner user ID
project_id (Form) - Project to share
email_id (Form) - Recipient email
content (Form) - Email body message
share_access (Form) - Permission level: chatbot_viewer | chatbot_editor | chatbot_analytics

Flow:

1. Validate Email Exists¶

user_record = users_collection.find_one({"email": email_id})
if not user_record:
    return JSONResponse(
        content={"message": "User needs to create Machine Agents account."},
        status_code=404
    )

Why? Only registered users can receive access

duplicate_query = {
    "user_id": user_id,
    "project_id": project_id,
    "shared_with_email": email_id,
    "share_access": share_access
}

existing_access = share_collection.find_one(duplicate_query)
if existing_access:
    return JSONResponse(
        content={"message": "Access already shared with this email and permission."},
        status_code=409
    )

3. Send Email via Azure Communication Services¶

EMAIL_ENDPOINT = "https://mailing-sevice.india.communication.azure.com/"
EMAIL_ACCESS_KEY = "CgdEWi6fBCJv4c0EHOkq6ZUSML0VQSG49qXgEfrtlLmXY76HV7DeJQQJ99BBACULyCplbknJAAAAAZCSbc3T"
SENDER_EMAIL = "DoNotReply@machineagents.ai"

email_client = EmailClient(endpoint=EMAIL_ENDPOINT, credential=EMAIL_ACCESS_KEY)

message = {
    "senderAddress": SENDER_EMAIL,
    "recipients": {"to": [{"address": email_id}]},
    "content": {
        "subject": "Message from MachineAgents",
        "plainText": content
    }
}
email_client.begin_send(message)

4. Save Access Record¶

access_data = {
    "user_id": user_id,
    "project_id": project_id,
    "shared_with_email": email_id,
    "share_access": share_access,
    "shared_at": datetime.utcnow()
}
share_collection.insert_one(access_data)

Response:

{
  "message": "Email sent and access shared successfully"
}

GET `/get-shared-access`¶

Purpose: Get list of all shared accesses for a user

Request:

GET /get-shared-access?user_id=User-123

Response:

{
  "user_id": "User-123",
  "shared_access": [
    {
      "project_id": "User-123_Project_1",
      "shared_with_email": "colleague@example.com",
      "share_access": "chatbot_viewer",
      "shared_at": "2024-01-15T09:00:00.000Z"
    },
    {
      "project_id": "User-123_Project_2",
      "shared_with_email": "analyst@example.com",
      "share_access": "chatbot_analytics",
      "shared_at": "2024-01-16T10:30:00.000Z"
    }
  ]
}

GET `/get-shared-access-by-project`¶

Purpose: Get shared access for specific project

Request:

GET /get-shared-access-by-project?user_id=User-123&project_id=User-123_Project_1

Response:

{
  "user_id": "User-123",
  "project_id": "User-123_Project_1",
  "shared_access": [
    {
      "project_id": "User-123_Project_1",
      "shared_with_email": "colleague@example.com",
      "share_access": "chatbot_viewer",
      "shared_at": "2024-01-15T09:00:00.000Z"
    }
  ]
}

DELETE `/remove-shared-access`¶

Purpose: Revoke shared access

Request:

DELETE /remove-shared-access?user_id=User-123&project_id=User-123_Project_1&shared_with_email=colleague@example.com

Implementation:

query = {
    "user_id": user_id,
    "project_id": project_id,
    "shared_with_email": shared_with_email
}

result = share_collection.delete_one(query)

Response:

{
  "message": "Shared access removed successfully",
  "user_id": "User-123",
  "project_id": "User-123_Project_1",
  "shared_with_email": "colleague@example.com"
}

Lead Form Management¶

POST `/v2/save-data-f`¶

Purpose: Save/update form configuration with TTS + Lip-sync generation

Request:

POST /v2/save-data-f
Content-Type: application/json

{
    "user_id": "User-123",
    "project_id": "User-123_Project_1",
    "prompt_number": 5,
    "message_display": "Need help? Fill out the form!",
    "trigger": true,
    "bfName": true,
    "bfNumber": true,
    "bfemail": true,
    "bfwebsite": false,
    "bfcompany": false,
    "reqName": true,
    "reqNumber": false,
    "reqemail": true,
    "reqwebsite": false,
    "reqcompany": false,
    "fsubemail": "sales@example.com",
    "voice": "Female_1"
}

Pydantic Model:

class SaveDataRequestf(BaseModel):
    user_id: str
    project_id: str
    prompt_number: int
    message_display: str = "Need a more tailored answer? Fill out the form..."
    trigger: bool
    bfName: bool  # Show name field
    bfNumber: bool  # Show phone field
    bfemail: bool  # Show email field
    bfwebsite: bool
    bfcompany: bool
    reqName: bool = False  # Name required
    reqNumber: bool = False
    reqemail: bool = False
    reqwebsite: bool = False
    reqcompany: bool = False
    fsubemail: str  # Form submission recipient
    voice: str  # Voice for TTS

Flow:

1. Upsert Configuration¶

query = {
    "user_id": data.user_id.strip(),
    "project_id": data.project_id.strip()
}

new_save_data = {"$set": data.model_dump()}

updated_doc = lead_collection.find_one_and_update(
    query,
    new_save_data,
    upsert=True,
    return_document=ReturnDocument.AFTER
)

inserted_id_str = str(updated_doc["_id"])

2. Generate TTS¶

message_to_speak = data.message_display
selected_voice_azure_name = SUPPORTED_VOICES.get(data.voice)

# Default fallback
if not selected_voice_azure_name:
    selected_voice_azure_name = "en-US-JennyNeural"

tts_wav_file = await text_to_speech_azure(message_to_speak, selected_voice_azure_name, inserted_id_str)

3. Convert to PCM¶

pcm_wav_file = os.path.join(OUTPUT_DIR, f"{inserted_id_str}_pcm.wav")
converted_file = await convert_wav_to_pcm_async(tts_wav_file, pcm_wav_file)

4. Generate Lip-Sync¶

json_file = await generate_lip_sync_async(converted_file, inserted_id_str)
lipsync_data = parse_lip_sync(json_file, tts_wav_file)

5. Encode Audio to Base64¶

with open(tts_wav_file, "rb") as audio_file:
    audio_base64 = base64.b64encode(audio_file.read()).decode("utf-8")

6. Cleanup Temp Files¶

for f in [tts_wav_file, pcm_wav_file, json_file]:
    if f and os.path.exists(f):
        os.remove(f)

Response:

{
    "$set": {
        "user_id": "User-123",
        "project_id": "User-123_Project_1",
        "prompt_number": 5,
        ...
    },
    "id": "507f1f77bcf86cd799439011",
    "generated_audio_base64": "UklGRiQAAABXQVZF...",
    "generated_lipsync": {
        "metadata": {
            "soundFile": "507f1f77bcf86cd799439011.wav",
            "duration": 2.5
        },
        "mouthCues": [...]
    }
}

GET `/v2/get-data-for-lead`¶

Purpose: Retrieve form configuration (alias for /v2/get-saved-data-f)

Request:

GET /v2/get-data-for-lead?user_id=User-123&project_id=User-123_Project_1

Response: Returns saved form configuration with TTS/lip-sync data

POST `/v2/add-lead`¶

Purpose: Submit lead form data from end-user

Request:

POST /v2/add-lead
Content-Type: application/x-www-form-urlencoded

name=John Doe
&email=john@example.com
&phone=+1234567890
&session_id=session-abc123
&user_id=User-123
&project_id=User-123_Project_1
&company=Acme Inc
&website=https://acme.com

All Fields Optional (submitted fields determined by form configuration)

Insert Logic:

new_lead = {
    "name": name,
    "email": email,
    "phone": phone,
    "session_id": session_id,
    "user_id": user_id,
    "project_id": project_id,
    "company": company,
    "website": website,
    "created_at": datetime.utcnow().isoformat()
}

# Remove None values
new_lead = {k: v for k, v in new_lead.items() if v is not None}

result = lead_collection.insert_one(new_lead)

Response:

{
  "id": "507f1f77bcf86cd799439012",
  "name": "John Doe",
  "email": "john@example.com",
  "phone": "+1234567890",
  "session_id": "session-abc123",
  "user_id": "User-123",
  "project_id": "User-123_Project_1",
  "company": "Acme Inc",
  "website": "https://acme.com",
  "created_at": "2024-01-15T09:30:00.000Z"
}

GET `/v2/fetch-lead-data`¶

Purpose: Fetch submitted lead data (with optional session filter)

Request:

GET /v2/fetch-lead-data?user_id=User-123&project_id=User-123_Project_1&session_id=session-abc123

Query Building:

query = {"user_id": user_id, "project_id": project_id}

if session_id:
    query["session_id"] = session_id  # Optional filter

records = lead_collection.find(query)

Response:

[
  {
    "id": "507f1f77bcf86cd799439012",
    "name": "John Doe",
    "email": "john@example.com",
    "phone": "+1234567890",
    "company": "Acme Inc",
    "website": "https://acme.com",
    "created_at": "2024-01-15T09:30:00.000Z"
  },
  {
    "id": "507f1f77bcf86cd799439013",
    "name": "Jane Smith",
    "email": "jane@example.com",
    "phone": "+9876543210",
    "company": "Tech Co",
    "website": "",
    "created_at": "2024-01-16T11:45:00.000Z"
  }
]

TTS & Lip-Sync Integration¶

SUPPORTED_VOICES Dictionary¶

SUPPORTED_VOICES = {
    "Male_1": "en-US-EricNeural",
    "Male_2": "en-US-GuyNeural",
    "Male_3": "en-CA-LiamNeural",
    "Male_IND": "en-IN-PrabhatNeural",
    "Female_1": "en-US-AvaMultilingualNeural",
    "Female_2": "en-US-JennyNeural",
    "Female_3": "en-US-EmmaMultilingualNeural",
    "Female_4": "en-AU-NatashaNeural",
    "Female_IND": "en-IN-NeerjaExpressiveNeural",
    "Female_IND2": "en-IN-NeerjaNeural",
}

10 Azure Neural voices (same as Response 3D/Voice services)

text_to_speech_azure() Function¶

Purpose: Convert form message to speech

async def text_to_speech_azure(text: str, voice_name: str, session_id: str) -> str:
    wav_file = os.path.join(OUTPUT_DIR, f"{session_id}.wav")

    speech_config = speechsdk.SpeechConfig(
        subscription=AZURE_SPEECH_SUBSCRIPTION_KEY,
        region=AZURE_SPEECH_REGION
    )
    speech_config.speech_synthesis_voice_name = voice_name
    audio_config = speechsdk.audio.AudioOutputConfig(filename=wav_file)
    synthesizer = speechsdk.SpeechSynthesizer(speech_config=speech_config, audio_config=audio_config)

    # Async wrapper for blocking call
    result = await asyncio.to_thread(synthesizer.speak_text_async(text).get)

    if result.reason != speechsdk.ResultReason.SynthesizingAudioCompleted:
        raise Exception(f"Speech synthesis failed: {result.cancellation_details.reason}")

    return wav_file

Error Handling: Creates dummy WAV file if TTS fails to prevent downstream errors

convert_wav_to_pcm_async() Function¶

FFmpeg Command:

ffmpeg -i input.wav -acodec pcm_s16le -ar 44100 -ac 1 output_pcm.wav

Parameters:

-acodec pcm_s16le - PCM signed 16-bit little-endian
-ar 44100 - Sample rate: 44.1kHz
-ac 1 - Audio channels: 1 (mono)

generate_lip_sync_async() Function¶

Rhubarb Command:

rhubarb -f json -o output.json -r phonetic input_pcm.wav

Cross-Platform Executable Detection:

system = platform.system().lower()

if system == "windows":
    rhubarbExePath = "Rhubarb-Lip-Sync-1.13.0-Windows/rhubarb.exe"
elif system == "darwin":
    rhubarbExePath = "Rhubarb-Lip-Sync-1.13.0-macOS/rhubarb"
elif system == "linux":
    rhubarbExePath = "Rhubarb/rhubarb"

Validation:

if not rhubarbExePath or not os.path.exists(rhubarbExePath):
    logger.warning(f"Rhubarb executable not found for {system}")
    rhubarbExePath = None  # Disable lip-sync

parse_lip_sync() Function¶

Purpose: Add sound file reference to Rhubarb JSON

def parse_lip_sync(json_file: str, sound_file: str) -> Dict:
    with open(json_file, "r") as file:
        data = json.load(file)

    if "metadata" not in data:
        data["metadata"] = {}

    data["metadata"]["soundFile"] = os.path.basename(sound_file)
    return data

Security Analysis¶

🔴 CRITICAL: Multiple Hardcoded API Keys¶

1. Azure OpenAI Key (Line 80):

subscription_key = os.getenv("AZURE_OPENAI_API_KEY", "AZxDVMYB08AaUip0i5ed1sy73ZpUsqencYYxKDbm6nfWfG1AqPZ3JQQJ99BCACYeBjFXJ3w3AAABACOGVUo7")

2. Azure TTS Key (Line 841):

AZURE_SPEECH_SUBSCRIPTION_KEY = os.getenv("AZURE_SPEECH_SUBSCRIPTION_KEY", "9N41NOfDyVDoduiD4EjlzmZU9CbUX3pPqWfLCORpl7cBf0l2lzVQJQQJ99BCACGhslBXJ3w3AAAYACOG2329")

3. Azure Email Service Key (Line 576):

EMAIL_ACCESS_KEY = "CgdEWi6fBCJv4c0EHOkq6ZUSML0VQSG49qXgEfrtlLmXY76HV7DeJQQJ99BBACULyCplbknJAAAAAZCSbc3T"

Risk: Full access to Azure OpenAI, Azure TTS, and Azure Communication Services

Impact:

Unauthorized LLM usage (expensive)
Email spoofing from official domain
TTS generation abuse
High billing risk

Fix:

subscription_key = os.getenv("AZURE_OPENAI_API_KEY")
if not subscription_key:
    raise RuntimeError("AZURE_OPENAI_API_KEY environment variable not set")

🟠 SECURITY: Overly Permissive CORS¶

Lines 37-43:

app.add_middleware(
    CORSMiddleware,
    allow_origins=["*"],
    allow_credentials=True,
    allow_methods=["*"],
    allow_headers=["*"],
)

Risk: Any website can call this API

Fix:

allow_origins=[
    "https://app.machineagents.ai",
    "https://admin.machineagents.ai"
]

🟡 DATA PRIVACY: No Email Validation¶

Lines 640-647:

user_record = users_collection.find_one({"email": email_id})
if not user_record:
    return JSONResponse(
        content={"message": "User needs to create Machine Agents account."},
        status_code=404
    )

Issue: Reveals whether email exists in system (information disclosure)

Better Response:

return JSONResponse(
    content={"message": "If this email is registered, they will receive an invitation."},
    status_code=200
)

🟡 COST CONTROL: No Rate Limiting on Classification¶

Line 148: Every chat message triggers GPT-3.5 call

Cost Per Month (Example):

10,000 messages/day
~20 tokens per classification
10,000 × 20 = 200,000 tokens/day
200,000 × 30 = 6 million tokens/month
6M × ₹0.144 = ₹864,000/month just for classification

Solution: Use smart classification endpoint by default

🟢 GOOD PRACTICE: TTS Error Fallback¶

Lines 891-896:

# Create dummy WAV file if TTS fails
try:
    with open(wav_file, 'wb') as f:
        f.write(b'RIFF...WAVEfmt ...')  # Minimal valid WAV header
    logger.info(f"Created dummy WAV file due to TTS error")
except Exception as dummy_e:
    logger.error(f"Failed to create dummy WAV file")

Benefit: Prevents downstream errors in lip-sync pipeline

🟢 GOOD PRACTICE: Async for Blocking Calls¶

Lines 882, 904, 928:

result = await asyncio.to_thread(synthesizer.speak_text_async(text).get)
result = await asyncio.to_thread(subprocess.run, [...])

Benefit: Prevents blocking event loop during TTS/FFmpeg/Rhubarb execution

Integration Points¶

1. Response Chatbot Services Integration¶

Chat History Saving:

Response services (3D/Text/Voice) call this service after each message:

# In Response 3D Chatbot Service
response = requests.post(
    f"{CHAT_HISTORY_SERVICE_URL}/v2/save-chat-history-smart",
    data={
        "user_id": user_id,
        "project_id": project_id,
        "session_id": session_id,
        "input_prompt": question,
        "output_response": answer,
        "originated_url": originating_url
    }
)

Classification Used For:

Analytics dashboard (Leads vs General)
Form triggering logic (if session_category == "Leads")
User behavior insights

2. Frontend Dashboard Integration¶

Chat History Display:

// Fetch all sessions for project
const response = await fetch(
  `/v2/get-chat-history/${projectId}?user_id=${userId}`
);
const { chat_history } = await response.json();

// Group by month
const groupedByMonth = chat_history.reduce((acc, session) => {
  const month = session.month;
  if (!acc[month]) acc[month] = [];
  acc[month].push(session);
  return acc;
}, {});

Token Usage Display:

// Get token cost
const response = await fetch(
  `/v2/calculate-total-tokens/${projectId}?user_id=${userId}`
);
const { total_tokens, total_cost_inr } = await response.json();

// Display: "You've used 12,500 tokens (₹1,800.00)"

3. Lead Form Flow¶

Frontend Integration:

// 1. Get form configuration
const formConfig = await fetch(
  `/v2/get-data-for-lead?user_id=${userId}&project_id=${projectId}`
).then((r) => r.json());

// 2. Check if form should trigger
if (messageCount >= formConfig.prompt_number && formConfig.trigger) {
  // 3. Display form with TTS message
  const audio = new Audio(
    `data:audio/wav;base64,${formConfig.generated_audio_base64}`
  );
  audio.play();

  // 4. Show lip-sync animation
  animateLipSync(formConfig.generated_lipsync);

  // 5. Show form fields based on configuration
  if (formConfig.bfName) showField("name", formConfig.reqName);
  if (formConfig.bfemail) showField("email", formConfig.reqemail);
  // ...
}

// 6. Submit lead data
await fetch("/v2/add-lead", {
  method: "POST",
  body: new FormData(form),
});

Workflow:

Owner shares via /send-access/
Recipient receives email
Recipient logs in to dashboard

Frontend fetches shared chatbots:

const { shared_access } = await fetch(
  `/get-shared-access?user_id=${recipientUserId}`
).then((r) => r.json());

Recipient sees shared projects based on share_access permission level

5. Analytics Service Integration¶

Category-Based Analytics:

# Get all chat sessions
sessions = requests.get(f"{CHAT_HISTORY_SERVICE_URL}/v2/get-chat-history/{project_id}?user_id={user_id}").json()

# Aggregate by category
leads_count = sum(1 for s in sessions if s['session_category'] == 'Leads')
service_count = sum(1 for s in sessions if s['session_category'] == 'Customer Service')

# Calculate conversion rate
conversion_rate = leads_count / len(sessions) * 100

Summary¶

Service Statistics¶

Total Lines: 1,265
Total Endpoints: 15
Total Collections: 4 (MongoDB)
LLM Integration: Azure OpenAI GPT-3.5-Turbo-16k
Email Integration: Azure Communication Services
TTS Integration: Azure Cognitive Services Speech
Security Issues: 3 critical (hardcoded keys)

Key Capabilities¶

✅ Dual Classification System - GPT-3.5 (expensive) vs Smart Rules (free)
✅ Token-Based Cost Tracking - Per-message → Per-project → Per-user
✅ Email-Based Access Sharing - 3 permission levels with Azure emails
✅ Dynamic Lead Forms - TTS + Lip-sync for form messages
✅ IST Timezone Support - All timestamps in Asia/Kolkata

Critical Fixes Needed¶

🔴 Externalize ALL API keys (OpenAI, TTS, Email)
🟠 Restrict CORS to known origins
🟡 Add rate limiting for classification endpoint
🟡 Fix email validation information disclosure
🟡 Default to smart classification (cost optimization)

Deployment Notes¶

Docker Compose (Port 8014):

chathistory-service:
  build: ./chathistory-service
  container_name: chathistory-service
  ports:
    - "8014:8014"
  environment:
    - MONGO_URI=...
    - AZURE_OPENAI_API_KEY=***
    - AZURE_SPEECH_SUBSCRIPTION_KEY=***

Dependencies:

Rhubarb Lip-Sync executables (platform-specific)
FFmpeg installed on system PATH

Documentation Complete: Chat History Service (Port 8014)
Status: COMPREHENSIVE, DEVELOPER-GRADE, INVESTOR-GRADE, AUDIT-READY ✅

Chat History Service (Port 8014)¶

Table of Contents¶

Service Overview¶

Primary Responsibilities¶

Architecture & Dependencies¶

Technology Stack¶

Key Imports¶

Environment Variables¶

Configuration Constants¶

Database Collections¶

4 MongoDB Collections¶

chatbot_history Schema¶

Share_Access Schema¶

LEAD_COLLECTION Schema¶

Core Features¶

1. Dual Chat Classification System¶

2. Token-Based Cost Tracking¶

3. Email-Based Access Sharing¶

4. Dynamic Form Management¶

API Endpoints Summary¶

Total: 15 Endpoints¶

Chat Classification System¶

GPT-3.5 Classification (Lines 89-174)¶

Smart Classification (Lines 177-206)¶

Chat History Management¶

POST /v2/save-chat-history-smart¶

POST /v2/save-chat-history¶

GET /v2/get-chat-history¶

GET /v2/get-chat-history/{project_id}¶

DELETE /delete-chatbot-history/{project_id}¶

Token Counting & Cost Calculation¶

count_tokens() Helper Function¶

GET /v2/calculate-total-tokens/{project_id}¶

GET /v2/calculate-user-total-tokens¶

Access Sharing System¶

POST /send-access/¶

1. Validate Email Exists¶

2. Check Duplicate Sharing¶

3. Send Email via Azure Communication Services¶

4. Save Access Record¶

GET /get-shared-access¶

GET /get-shared-access-by-project¶

DELETE /remove-shared-access¶

Lead Form Management¶

POST /v2/save-data-f¶

1. Upsert Configuration¶

2. Generate TTS¶

3. Convert to PCM¶

4. Generate Lip-Sync¶

5. Encode Audio to Base64¶

6. Cleanup Temp Files¶

GET /v2/get-data-for-lead¶

POST /v2/add-lead¶

GET /v2/fetch-lead-data¶

TTS & Lip-Sync Integration¶

SUPPORTED_VOICES Dictionary¶

text_to_speech_azure() Function¶

convert_wav_to_pcm_async() Function¶

generate_lip_sync_async() Function¶

parse_lip_sync() Function¶

Security Analysis¶

🔴 CRITICAL: Multiple Hardcoded API Keys¶

🟠 SECURITY: Overly Permissive CORS¶

🟡 DATA PRIVACY: No Email Validation¶

🟡 COST CONTROL: No Rate Limiting on Classification¶

🟢 GOOD PRACTICE: TTS Error Fallback¶

🟢 GOOD PRACTICE: Async for Blocking Calls¶

Integration Points¶

1. Response Chatbot Services Integration¶

2. Frontend Dashboard Integration¶

3. Lead Form Flow¶

4. Access Sharing Flow¶

5. Analytics Service Integration¶

Summary¶

Service Statistics¶

Key Capabilities¶

Critical Fixes Needed¶

Deployment Notes¶

POST `/v2/save-chat-history-smart`¶

POST `/v2/save-chat-history`¶

GET `/v2/get-chat-history`¶

GET `/v2/get-chat-history/{project_id}`¶

DELETE `/delete-chatbot-history/{project_id}`¶

GET `/v2/calculate-total-tokens/{project_id}`¶

GET `/v2/calculate-user-total-tokens`¶

POST `/send-access/`¶

GET `/get-shared-access`¶

GET `/get-shared-access-by-project`¶

DELETE `/remove-shared-access`¶

POST `/v2/save-data-f`¶

GET `/v2/get-data-for-lead`¶

POST `/v2/add-lead`¶

GET `/v2/fetch-lead-data`¶