MAdAiLab | Resources

HiteshHitesh Balegar

Building an AI-Powered Tax Sensitization System: Automating Compliance with AI & FastAPI

TL;DR Built an AI-powered tax classification system in 6 hours using FastAPI, OpenAI GPT-4o-mini, and Google Sheets. The system automatically classifies transactions, validates tax rates, detects anomalies, and sends summary emails to users. Key features: OAuth authentication, async workflows, cost-optimized AI usage, and resilient batch processing. Can process 100+ transactions per request and reduces manual review work by 70%. Built as independent volunteer work with Prof. Rohit Aggarwal, who provided conceptual guidance and project structure. Main challenges solved: OAuth in WSL/headless environments, OpenAI API rate limits, and async data flow management. The framework utilizes standard service abstraction patterns, which naturally enable the use of different AI providers as needed. Introduction: The Problem with Non-Tax-Sensitized Systems Modern enterprises, even Fortune 500 firms, still operate on financial systems that were never designed with tax in mind. These systems accurately record transactions, invoices, and journal entries, but they often lack the necessary granularity, context, and automation to apply tax rules accurately and consistently. The result is a heavy reliance on manual reconciliation, spreadsheet adjustments, and judgment calls from overworked tax teams—all of which increase risk, slow reporting, and leave value on the table. The Tax Sensitization Assistant project is motivated by a simple truth: organizations can't afford to keep treating tax compliance as a reactive clean-up job. If the systems feeding tax data aren't fully tax-sensitized, the CFO and tax department spend more time fixing data than interpreting it. That's wasted effort and missed insight. By leveraging AI-driven classification and enrichment, this project aims to automate the painful middle ground between raw transaction data and tax-ready information. The system continuously monitors and enhances transactional records, flags inconsistencies, and fills in missing tax context (such as jurisdiction, tax code, rate, and deductibility), all without requiring a full ERP overhaul. In this article, I'll guide you through the process of building a comprehensive AI-powered tax classification system in just six hours, utilizing FastAPI, OpenAI, and Google Sheets. The system can process 100+ transactions per request and reduces manual review work by up to 70%. More importantly, I'll show you the architecture decisions, challenges faced, and solutions that make this a production-ready system. Who This Is For This article is for: Python developers looking to build AI-powered automation systems Students and developers interested in FastAPI, AI API integration, and Google APIs Tax professionals curious about AI automation possibilities Anyone building production-ready APIs with authentication and scheduling Prerequisites: Basic Python knowledge Familiarity with REST APIs Understanding of async/await concepts (helpful but not required) What You'll Build: A complete FastAPI application AI service integration (demonstrated with OpenAI) Google OAuth 2.0 authentication Google Sheets read/write operations Scheduled and manual processing workflows Email notifications via Gmail API Real-World Applications: Tax compliance automation Document classification systems Data enrichment pipelines Any workflow requiring AI classification + validation The Problem: Why Tax Automation Matters The Real-World Challenge Fortune 500 companies still use financial systems not designed for tax. These systems produce: Missing tax codes (90% of transactions in typical datasets) Zero tax amounts (60% of transactions) Inconsistent data formats No tax context or jurisdiction information The result? Tax teams spend 70% of their time fixing data instead of analyzing it. Manual reconciliation is time-consuming, error-prone, and doesn't scale. The Solution Approach Instead of replacing entire ERP systems (costly and disruptive), we layer intelligent automation on top of existing systems: AI Classification : Use AI to infer missing tax attributes from transaction descriptions Validation : Compare AI suggestions against reference tax rates Anomaly Detection : Flag transactions requiring human review Automated Routing : Separate clean transactions from those needing review This approach: Reduces manual review work significantly (up to 70%) Enables real-time visibility into tax anomalies Creates auditable, repeatable processes Improves with each cycle of AI feedback System Architecture Overview High-Level Architecture Manual Trigger / Cron → OAuth Auth → Read Sheets → Preprocess → AI Classify → Validate → Detect Anomalies → Route → Write Sheets → Calculate Summary → Send Email Core Design Principles Modular Structure : Services, processors, and workflows are separated for maintainability Unified Authentication : Single OAuth flow for all Google APIs Resilient Processing: Transaction-level error handling Cost Optimization : GPT-4o-mini for cost-effective AI classification Priority System : Manual trigger first, scheduled second Technology Stack Framework : FastAPI (modern, fast, async-native) AI Service : Abstracted service layer (demonstrated with OpenAI GPT-4o-mini, but works with any provider) Storage: Google Sheets (free, easy, collaborative) Auth : Google OAuth 2.0 (unified for Sheets, Drive, Gmail) Scheduling : APScheduler (cron-based automation) Email : Gmail API (OAuth-authenticated) Why These Choices? FastAPI : Modern, fast, easy to use, built-in async support AI Service Abstraction : Good architecture naturally allows swapping providers—demonstrated with OpenAI, but the framework supports any LLM Google Sheets : Free, accessible, no database setup required GPT-4o-mini : 1/200th the cost of GPT-4, sufficient accuracy for this use case OAuth : Enables Gmail integration, simpler than service accounts Architecture Note : The AI service layer uses a clean abstraction pattern. This naturally allows using different providers (OpenAI, Claude, Gemini, local models) without framework changes. Repository Details: Link to the complete repository: tax-sensitization-assistant Module 1: OAuth Authentication Service The Challenge We need to authenticate with Google Sheets, Drive, and Gmail APIs. The system must operate in various environments (interactive, WSL, headless) and utilize unified credentials (as opposed to separate service accounts). The Solution A single OAuth 2.0 flow with all required scopes, environment detection, and automatic token refresh management. Link to the complete file: oauth_service.py Implementation Details: # Environment detection def is_wsl() -> bool: """Detect if running in WSL environment.""" if os.getenv('WSL_DISTRO_NAME') or os.getenv('WSLENV'): return True try: with open('/proc/version', 'r') as f: return 'microsoft' in f.read().lower() except: return False def is_headless() -> bool: """Detect if running in headless environment.""" if os.getenv('DISPLAY') or os.getenv('SSH_CONNECTION') or os.getenv('CI'): return False return True OAuth Flow Adaptation: Interactive: Automatically opens browser WSL: Provides instructions for wslview or explorer.exe Headless: Manual URL copy/paste with code entry Key Features: Automatic token refresh Credential persistence via token.json Clear user instructions for each environment Error handling and recovery Challenges Solved WSL Environment: Provides instructions for wslview or explorer.exe to open authorization URL Headless Servers: Manual URL copy/paste flow with clear instructions Token Refresh: Automatic handling with error recovery This environment-aware approach ensures the system operates effectively in all deployment scenarios, ranging from local development to production servers. Module 2: Google Sheets Integration Why Google Sheets? For Phase 1 proof of concept, Google Sheets offers: Free and easy to use No database setup required Visual inspection and collaboration Perfect for demonstrating the concept Link to the complete file: sheets_service.py Implementation Read Operations: Raw transactions from "Raw Transactions" sheet Tax reference data from "Tax Reference Data" sheet Write Operations: Clean, validated transactions to "Tax-Ready Output" sheet Flagged transactions to "Needs Review" sheet Data Flow: Read: Raw Transactions → Processed Transactions Read: Tax Reference Data → Validation Dictionary Write: Clean Transactions → Tax-Ready Output Write: Flagged Transactions → Needs Review Key Features: Graceful handling of missing/null fields Batch write operations for efficiency Automatic header creation for output sheets OAuth-authenticated API access The Google Sheets API proved surprisingly easy to use, making it an excellent choice for rapid prototyping. Module 3: AI Classification Engine The Core Functionality The AI service infers missing tax attributes from transaction descriptions: Transaction classification (supplies, services, equipment, etc.) Jurisdiction determination Tax rate suggestion Taxable status Confidence scoring Rationale explanation Architecture Note - Service Abstraction The AI service uses a clean abstraction pattern (standard good practice). This naturally allows using different providers (OpenAI, Claude, Gemini, local models). Not revolutionary—just proper separation of concerns. Demonstrated with OpenAI, but the framework supports any provider that implements the interface. Link to the complete file: ai_service.py Implementation Pattern # Service interface (standard pattern) class AIService: def classify_transaction(self, transaction) -> ClassificationResponse: raise NotImplementedError # OpenAI implementation (one example) class OpenAIService(AIService): def __init__(self): self.client = OpenAI(api_key=settings.openai_api_key) self.model = settings.openai_model def classify_transaction(self, transaction): prompt = self._build_prompt(transaction) response = self.client.chat.completions.create( model=self.model, messages=[ {"role": "system", "content": "You are a tax classification assistant..."}, {"role": "user", "content": prompt} ], response_format={"type": "json_object"}, temperature=0.3 ) # Parse and return ClassificationResponse return self._parse_response(response) # Framework uses dependency injection ai_service = get_ai_service() # Returns configured implementation result = ai_service.classify_transaction(transaction) Prompt Engineering System Prompt: You are a tax classification assistant. Analyze transactions and infer missing tax attributes. Return only valid JSON. User Prompt: Transaction details: - Description: {description} - Amount: ${amount:.2f} - Location: {location} Determine: 1. What type of purchase does this represent 2. Whether it's typically taxable in the given jurisdiction 3. The appropriate tax rate for that jurisdiction 4. Your confidence level in this classification Return JSON only with: classification, jurisdiction, suggested_tax_rate, taxable_status, confidence, rationale Response Format : Structured JSON ensures consistent parsing across all models. Cost Optimization GPT-4o-mini : ~$0.15/$0.60 per 1M tokens (used in this project) Other options: Claude Haiku, Gemini Pro, or local models for different cost/performance tradeoffs The abstraction allows choosing the best model for your needs Confidence Scores 0-1 scale where >0.85 is high confidence Enables downstream routing decisions High confidence → auto-approve, low confidence → review Standard concept that works with any LLM Challenges Solved API Rate Limits : Retry logic with exponential backoff (3 retries) Cost Concerns : Used GPT-4o-mini for cost-effectiveness JSON Parsing Errors : Structured output format with validation Provider Abstraction : Standard interface pattern handles provider differences Module 4: Data Processing Pipeline Preprocessing Before AI classification, transactions are normalized: Location code normalization (uppercase, trimming) Description cleaning Missing field flagging Timestamp addition This ensures consistent inputs for the AI model. Link to the complete file - preprocessor.py Validation AI-suggested tax rates are validated against reference data: Rate comparison with tolerance (0.25% default) Match/mismatch status assignment Handles missing reference data gracefully Why This Approach: Simple but effective (covers 70% of issues) No external API dependencies Fast and free validation Expandable for Phase 2 Link to the complete file - validator.py Anomaly Detection Two high-impact patterns catch most issues: Pattern 1: Missing Tax on Taxable Items If taxable_status = "Taxable" AND tax_amount = 0 Flags as "MissingTaxCollection" Catches 50% of common issues Pattern 2: Rate Mismatch If AI rate doesn't match reference rate Flags as "RateMismatch" Catches 20% of common issues Routing Decision: 0 anomalies → Clean transactions 0 anomalies → Review queue This focused approach covers 70% of issues without over-engineering. Link to the complete file - anomaly_detector.py Module 5: Workflow Orchestration Main Workflow Function The workflow orchestrates all processing steps: async def process_transactions() -> SummaryStats: # 1. Initialize services sheets_service = SheetsService() ai_service = AIService() gmail_service = GmailService() # 2. Read raw transactions raw_transactions = sheets_service.read_raw_transactions() # 3. Read tax reference data tax_reference = sheets_service.read_tax_reference() # 4. Preprocess transactions processed_transactions = preprocess_batch(raw_transactions) # 5. Process each transaction for processed_tx in processed_transactions: # AI Classification classification = ai_service.classify_transaction(processed_tx) # Validation validation_status, rate_match = validate_rate(...) # Anomaly Detection enriched_tx = detect_anomalies(enriched_tx, classification) # Route if enriched_tx.anomaly_count > 0: review_transactions.append(enriched_tx) else: clean_transactions.append(enriched_tx) # 6. Write outputs sheets_service.write_clean_transactions(clean_transactions) sheets_service.write_review_queue(review_transactions) # 7. Calculate summary summary = calculate_summary_stats(...) # 8. Send email gmail_service.send_daily_summary(summary) return summary Error Handling Transaction-Level Resilience: Each transaction is processed in a try-catch block One failure doesn't stop the entire batch Errors logged with transaction ID Failed transactions excluded from output This ensures batch processing continues despite individual failures. State Management Global state tracking enables monitoring: Last run timestamp Processing status (idle, processing, completed, failed) Summary statistics Current job ID Link to the complete file - tax_sensitization.py Module 6: Scheduling & API Layer Scheduler Service APScheduler handles automated processing: Cron-based scheduling (default: daily at 6 AM) Manual trigger support (FIRST PRIORITY) Job ID tracking Graceful shutdown handling Priority System: Manual trigger is FIRST PRIORITY Scheduled cron is SECOND PRIORITY Status tracking prevents confusion API Endpoints FastAPI provides REST API for monitoring and control: @router.post("/api/v1/process/trigger") async def trigger_workflow(): """Manually trigger the tax classification workflow""" job_id = await scheduler_service.trigger_manual() return {"status": "triggered", "job_id": job_id} @router.get("/api/v1/process/status") async def get_status(): """Get current processing status""" state = get_processing_state() return {"status": state["last_status"], ...} @router.get("/api/v1/process/summary") async def get_summary(): """Get last processing summary statistics""" state = get_processing_state() return {"summary": state["last_summary"], ...} FastAPI Features Used: Automatic OpenAPI documentation Pydantic models for validation Async endpoint support CORS middleware Link to the complete module - api Module 7: Email Notifications Gmail API Integration OAuth-authenticated email sending using the same credentials as Sheets: HTML-formatted summary emails Configurable enable/disable flag Professional email delivery Reliable delivery Email Content Daily summary includes: Processing summary statistics Total transactions processed Auto-validated count and percentage Review needed count Anomaly breakdown Action items Why Gmail API: Unified OAuth (no separate email credentials) Professional email delivery HTML formatting support Reliable delivery Link to the complete file - gmail_service.py Key Challenges & Solutions Challenge 1: OAuth in WSL/Headless Environments Problem: Google OAuth flow requires browser, but WSL and headless servers don't have GUI. Solution : Environment detection with appropriate flow for each: WSL: Instructions for wslview or explorer.exe Headless: Manual URL copy/paste with code entry Interactive: Automatic browser opening Impact : System works in all deployment scenarios. Challenge 2: AI API Rate Limits & Costs Problem : API rate limits and high costs with AI providers. Solution: Used GPT-4o-mini (1/200th the cost of GPT-4) Retry logic with exponential backoff Small delays between requests (0.1s) Structured output reduces parsing errors Note: The service abstraction allows switching providers if needed (standard architecture benefit). Impact: Cost-effective processing, handles rate limits gracefully Challenge 3: Async Data Flow Management Problem : Managing asynchronous tasks and data flow efficiently. Solution: Async/await patterns throughout Transaction-level error handling Sequential processing with small delays State management for tracking Impact : Efficient processing, resilient to failures. Challenge 4: Transaction Processing Resilience Problem: One bad transaction shouldn't crash entire batch. Solution : Transaction-level try-catch blocks Continue processing on individual errors Log errors with transaction ID Failed transactions excluded from output Impact : Batch processing continues despite individual failures. Results & Impact What Was Built A complete tax classification system with: Production-ready architecture OAuth authentication AI-powered classification Automated validation and anomaly detection Email notifications Performance Metrics Throughput : Can process 100+ transactions per request Functionality : Successfully classifies and writes results to Google Sheets Efficiency : Reduces manual review work by up to 70% Development Time: Built in approximately 6 hours Technical Achievements Clean AI service abstraction (standard good architecture) Environment-aware OAuth implementation Cost-optimized AI usage (GPT-4o-mini) Resilient batch processing Modular, maintainable architecture What Works Well Clean service abstraction (allows using different AI providers if needed) Google Sheets API ease of use Google OAuth simplicity FastAPI rapid development Future Improvements More robust architecture Enhanced data flow Phase 2 production hardening features Formal accuracy testing with larger datasets Lessons Learned & Best Practices What I Learned How to effectively use AI for classification without full ERP overhauls The importance of environment detection for OAuth Cost optimization strategies for AI APIs Async workflow management in Python Best Practices Applied Modular Architecture: Services, processors, and workflows separated for maintainability Transaction-Level Error Handling: Resilience for batch processing Cost Optimization: Choose cost-effective models from the start Simple but Effective Validation: Reference table lookup catches most errors What Surprised Me Ease of Google Sheets API: Much simpler than expected Simplicity of Google OAuth: Once set up, very straightforward FastAPI Rapid Development: Built entire system in 6 hours GPT-4o-mini Sufficiency: Good accuracy for most cases at fraction of cost What I'd Do Differently More robust architecture from the start Enhanced data flow design Formal testing framework Better error recovery mechanisms Advice for Others Start with Proof of Concept: Phase 1 before production hardening Use Cost-Effective AI Models: GPT-4o-mini or similar when possible Design for Different Environments: OAuth, deployment scenarios Implement Transaction-Level Error Handling: Batch processing resilience Focus on Real Problems: Don't over-engineer, solve actual needs Conclusion & Next Steps Summary We built a complete AI-powered tax classification system in 6 hours using FastAPI, OpenAI GPT-4o-mini, and Google Sheets. The system demonstrates rapid prototyping with modern Python tools, solving real-world challenges (OAuth, async workflows, cost optimization) while creating production-ready architecture. Key Takeaways FastAPI enables rapid API development Clean service abstraction (standard pattern) allows flexibility in AI provider choice Google Sheets is perfect for Phase 1 proof of concepts OAuth can be made environment-aware Async workflows enable efficient batch processing GPT-4o-mini provides cost-effective AI classification The Value Delivered System processes 100+ transactions per request Reduces manual review work by 70% Provides auditable, repeatable processes Enables real-time anomaly detection Next Steps Phase 2 : Production hardening (data quality checks, confidence-based routing, audit trails) Formal Testing : Accuracy testing with larger datasets Production Deployment : Consider real-world deployment Expand Anomaly Detection : Add more patterns for comprehensive coverage Call to Action Try building your own classification system Experiment with FastAPI and AI APIs Share your results and improvements Contribute to open-source tax automation tools Acknowledgements I would like to extend a sincere thank you to Professor Rohit Aggarwal for providing the opportunity, the foundational framework, and invaluable guidance for this project. About the Author Hitesh Balegar is a graduate student in Computer Science (AI Track) at the University of Utah, specializing in the design of production-grade AI systems and intelligent agents. He is particularly interested in how effective human-AI collaboration can be utilized to build sophisticated autonomous agents using frameworks such as LangGraph and advanced RAG pipelines. His work spans multimodal LLMs, voice-automation systems, and large-scale data workflows deployed across enterprise environments. Before attending graduate school, he led engineering initiatives at CVS Health and Zmanda, shipping high-impact systems used by thousands of users and spanning over 470 commercial locations. Dr. Rohit Aggarwal is a professor , AI researcher and practitioner. His research focuses on two complementary themes: how AI can augment human decision-making by improving learning, skill development, and productivity, and how humans can augment AI by embedding tacit knowledge and contextual insight to make systems more transparent, explainable, and aligned with human preferences. He has done AI consulting for many startups, SMEs and public listed companies. He has helped many companies integrate AI-based workflow automations across functional units, and developed conversational AI interfaces that enable users to interact with systems through natural dialogue. This article demonstrates how modern Python tools enable rapid prototyping of production-ready AI automation systems. The key is focusing on real problems, using cost-effective solutions, and building maintainable architecture from the start.

10 min read

Loading...

What an AI CoE Is Actually Supposed to Do

Measuring AI ROI and Prioritizing Initiatives: The DRIVE Framework

Taxonomy of Reliability Failures in GenAI Agentic Systems

Inferring When Cursor Is Actually Working: An Automation Agent Built Without APIs

Building an AI-Powered Creative QA System: Combining HEIM Metrics with LLM-Based Marketing Judgment

Building a Multi-Agent AI Compliance (eg SOX) System: Master Orchestrator Architecture with RAG, Vector Search, and Real-Time Monitoring

Ad Campaign Performance Analysis using n8n: Weekly Reporting and Statistical Anomaly Detection

Building an AI-Powered Tax Sensitization System: Automating Compliance with AI & FastAPI

How I Built an AI-Driven Lead Generation & Enrichment Pipeline Using GPT-4o and Clay

From No-Code to LangGraph Agent: Automating Outbound Sales