Product Systems · Data Science · Creative Front-End Prototypes

Hi, I'm Saisha Ketkar.

Hi, I'm Saisha Ketkar, a Data Science and Economics student at Northeastern University. My work spans AI-inspired product prototypes, data systems, dashboards, automation workflows, and interactive front-end projects. I like building tools that are practical enough to use, clear enough to understand, and polished enough to feel intentional.

01

Creative prototypes

Interactive web projects that explore journaling, focus, self-expression, and playful everyday tools.

02

Data and systems

Database, analytics, and storage projects that show how I structure information and evaluate technical tradeoffs.

03

Workflow design

Automation, documentation, and operational tools built to reduce friction and make teams move faster.

How I work

Frame the problem → map the flow → build the prototype → test the details → document the decisions.

I like projects where the interface and the system support each other: the product should feel intuitive on the surface, while the underlying structure handles data, state, content, and future expansion thoughtfully.

Selected work

Projects

Twitter Storage & Retrieval System

Python · MySQL · Redis · SQL

A Twitter-style backend system built to support tweet posting and home timeline retrieval at scale. I designed relational schemas for users, tweets, and follower relationships, implemented API-style functions like postTweet and getHomeTimeline, and benchmarked relational versus in-memory storage tradeoffs.

Technical focus

  • Benchmarked 1M tweet insertions and 50K timeline requests.
  • Improved insertion throughput from approximately 3,906 to 5,574 calls per second using Redis.
  • Evaluated read/write performance tradeoffs for high-volume social media workloads.

MongoDB News Classification Data Pipeline

Python · MongoDB · JSON · Aggregation Pipelines

A document-store data pipeline for transforming raw sarcasm headline data into structured MongoDB-ready documents. I cleaned malformed URLs, extracted source domains, created nested headline records, and wrote aggregation pipelines to analyze sarcasm rates and keyword trends.

Data modeling

  • Designed a flexible document schema with nested records and derived text features.
  • Used mongoimport, indexing, and aggregation stages including $unwind, $group, and $project.
  • Compared modeling tradeoffs for nested retrieval, analytics queries, and scalable ingestion.

Aircraft Incident Data Quality & Analytics Database

MySQL · R · SQL · ETL

A normalized analytics database for aircraft incident data. I designed a 3NF MySQL schema, built an ETL pipeline in R, deployed a cloud-hosted MySQL database with secure SSL, and produced SQL-driven analytics across monthly, airline, and yearly trends.

Analytics system

  • Applied statistical analysis and functional dependencies to structure the schema.
  • Validated data integrity with quality-control checks before downstream analytics.
  • Created modular SQL query patterns for reusable analytics access.

Notion + Slack Operations Automation

Notion · Slack · Workflow Design · Operations

An operations and finance workflow system built for TAMID at Northeastern to coordinate reimbursements, budgeting, event planning, and member processes across a 100+ member organization. The system connected Notion workflows with Slack notifications to reduce manual follow-up and improve operational visibility.

Workflow impact

  • Designed a centralized operations system for finance requests, approvals, and leadership visibility.
  • Integrated Slack notifications to automate reimbursement updates and reduce manual coordination.
  • Built the system around real organizational needs, recurring tasks, and cross-functional communication.