
Hi, I'm Sumit Nautiyal
Software Engineer
I design and ship data-intensive systems across cloud platforms and AI workloads.
Profile
I am a Software Engineer and graduate student at the State University of New York at Binghamton, focusing on Artificial Intelligence and data-intensive systems.
I work across the data lifecycle: ingestion, processing, modeling, and deployment of machine learning workloads on cloud infrastructure.
Outside class and work, I read research papers and play games that stress-test hardware and reaction time.
Data Engineering Expertise
AI/ML Skills
Languages & Databases
Business Intelligence
Cloud Services
CI/CD & DevOps
Security & Governance
Tools & Platforms
Timeline & Roles
Chronological view of education and roles in computer science and data engineering.
Master's in Computer Science (Artificial Intelligence Track)
Lead Application Consultant
- Led development team to design and deploy real-time, event driven, multi-cloud data ingestion pipelines by integrating with MDM, Cloud data warehouse (Snowflake), AWS S3, Kafka and target state thereby reducing data latency by 95% and improving data integrity.
- Implemented large data platforms using microservice architecture on Kubernetes for streaming applications which led to cutting infrastructure costs and improving scalability for data-intensive applications.
- Implemented data cataloging using MDM and Snowflake, establishing data lineage for 20+ country inventory catalog datasets offering for B2B customers.
- Consulted with cross-functional stakeholders to understand business requirements challenges and address remediations through data architecture and data engineering solutions, significantly improving operational efficiency and business intelligence capabilities.
- Identified data anomalies at various stages of data pipeline, performed impact analysis and collaborated with business teams on issue resolution and root cause analysis.
- Developed CI/CD pipeline automation using Airflow, GitHub Actions, Docker, and Terraform, improving deployment frequency for monolithic application.
Senior Data Engineer & Team Lead
- Managed data integration projects, synchronizing NetSuite ERP and SAP with real-time ETL services, enhancing data accuracy and system interoperability.
- Deployed scalable data solutions on AWS ECS and GCP GKE, leading to optimized performance for organization's web applications.
- Performed data cleansing, transformation, ingestion for real-time data to CRM and SAP endpoints, improving sales for potential client and implemented data analytics to create custom dashboards for senior management.
- Engaged closely with business stakeholders to align technology solutions with business objectives, presenting data-driven insights using Power BI.
Software Engineer
- Developed scalable REST & GraphQL APIs, enabling headless commerce solutions, improving data retrieval efficiency, and driving an increase in mobile conversions through performance optimization.
- Developed data-driven optimization for payment and logistics modules, cutting user friction and significantly reducing support tickets through enhanced data validation and streamlined transactional flows.
- Collaborated directly with product stakeholders to define and deliver data-centric solutions, aligning technical implementations with business KPIs and customer success metrics.
Freelance Web Consultant
- Delivered 30+ data-intensive web platforms, optimized databases, and improved client SEO analytics through robust data engineering practices, increasing organic traffic by up to 60%.
- Built Python-based inventory synchronization tool leveraging Amazon MWS & eBay APIs, integrating real-time inventory data across multiple platforms, reducing oversell incidents by 20%.
- Consulted with clients globally, gathering and translating complex data and analytics requirements into actionable insights, enhancing their operational and business intelligence capabilities.
Technology Support Engineer (Contractual)
- Provided technical analysis and data-driven troubleshooting for enterprise clients, significantly reducing downtime by utilizing SQL-based data extraction, log analysis, and root-cause identification.
- Managed technical incident data, creating detailed documentation and analytics reports, improving issue resolution rates by 30% through process optimization and data insights.
- Consulted cross-functionally with IT teams and stakeholders, effectively communicating data-driven recommendations to enhance customer system performance and satisfaction.
Junior Software Developer
- Developed and maintained web applications using Java,Spring, Hibernate, XML, HTML, CSS, and backend technologies. Collaborated with senior developers teams to deliver high-quality software solutions.
- Supported development of data-driven web applications, integrating backend services using SQL and REST APIs, ensuring accurate and real-time data delivery across web platforms.
- Assisted senior developers in database schema design and optimization, increasing query performance effective indexing and normalization strategies.
- Collaborated with business analysts to gather requirements, converting business needs into data-centric development tasks, enhancing clarity, and efficiency of project deliverables.
Bachelor's in Electrical and Electronics Engineering
Achievements & Certifications
Selected certifications, awards, and formal assessments.
Generative AI for Everyone DeepLearning.AI
Survey course on generative AI capabilities, constraints, project lifecycle, and risk surface.
View CredentialPower Skills - Stakeholder Management
Training on stakeholder identification, communication strategies, and aligning technical work with constraints.
View CredentialLinkedIn Introduction to Career Skills in Software Development
Overview of core practices expected from professional software developers.
View CredentialBest Data Engineering Project
Awarded for a real-time data ingestion pipeline that reduced latency by 95% and improved deployment frequency.
Top Performer - Data Integration
Received highest performance rating for managing large-scale data integration projects.
Academic Coursework
Core courses in AI, machine learning, security, algorithms, and systems.
CS 581B Robot Perception
State University of New York at Binghamton • Fall 2025
Study of perception algorithms for robotics applications including computer vision, sensor fusion, and 3D mapping. Hands-on projects using Python and relevant libraries. Topics include image processing, feature extraction, object recognition, SLAM, and deep learning for perception.
CS 517 Human Computer Interaction
State University of New York at Binghamton • Fall 2025
Study of principles and techniques for undamentals of HCI, basic techniques of data analysis, Mobile and Wearable Computing, Ubiquitous Computing (Internet of Things), VR/AR, Brain-Computer Interaction (BCI), Accessibility, and Smart Health. Importance of the human-computer interfaces in the design and development of things people use daily, especially those smart electronic gadgets and their profounds in the human-centered studies, such as smart watches/wristbands, mobile/wearable devices, smart speakers, touch screen, eye/hand/limb tracking, body gesture, voice assistance, VR/AR/MR, and humanoid robots.
CS 515 Social Media and Data Science Pipelines
State University of New York at Binghamton • Fall 2025
Study of data science techniques applied to social media data including data collection, processing, analysis, and visualization using Python and relevant libraries. Hands-on projects involving sentiment analysis, trend analysis, and social network analysis.
CS 565 Introduction to Artificial Intelligence
State University of New York at Binghamton • Spring 2025
Introduction to AI concepts including search algorithms, knowledge representation, and reasoning. Hands-on projects using Python.
CS 551 Systems Programming
State University of New York at Binghamton • Spring 2025
Study of system-level programming including operating systems, file systems, and concurrency in Rust. Hands-on projects building rainbow tables and implementing system utilities.
CS 536 Introduction to Machine Learning
State University of New York at Binghamton • Spring 2025
Introduction to fundamental concepts and techniques in machine learning including supervised and unsupervised learning, model evaluation, and feature selection. Hands-on projects using Python and popular ML libraries.
CS 527 Mobile Systems Security
State University of New York at Binghamton • Spring 2025
Study of security challenges and solutions in mobile computing environments. Topics include mobile OS security, app security, network security, and emerging threats. Hands-on projects and case studies.
CS 558 Introduction to Computer Security
State University of New York at Binghamton • Fall 2024
Study of fundamental concepts in computer security including cryptography, network security, and system vulnerabilities.
CS 575 Design & Analysis Comp Algorithms
State University of New York at Binghamton • Fall 2024
Study of algorithm design techniques, complexity analysis, and advanced data structures.
CS 571 Programming Languages
State University of New York at Binghamton • Fall 2024
Study of various programming paradigms including functional, logic, and object-oriented programming.
CS 590X CS Professional Development
State University of New York at Binghamton • Fall 2024
Focus on professional skills for computer science careers including teamwork, communication, and ethics.
Selected Research Papers
Papers I reference when working on transformers, SLAM, and related systems.
Attention Is All You Need
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin
Computation and Language (NeurIPS) • 2017
Introduces the Transformer architecture and attention-only sequence modeling used in modern language models.
CausalStock: Deep End-to-end Causal Discovery for News-driven Stock Movement Prediction
Shuqi Li, Yuebo Sun, Yuxin Lin, Xin Gao, Shuo Shang, Rui Yan
Advances in Neural Information Processing Systems • 2024
Combines causal discovery and deep models to predict stock movements directly from news streams.
Scalable Multi-Session Visual SLAM in Large-Scale Scenes with Subgraph Optimization
Pan, Xiaokun & Li, Zhenzhe & Fan, Tianxing & Zhai, Hongjia & Bao, Hujun & Zhang, Guofeng
IEEE Transactions on Pattern Analysis and Machine Intelligence • 2025
Presents a multi-session visual SLAM system using subgraph optimization for large scenes.
ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial and Multi-Map SLAM
Carlos Campos, Richard Elvira, Juan J. Gómez Rodríguez, José M. M. Montiel, Juan D. Tardós
IEEE Transactions on Robotics • 2021
Describes ORB-SLAM3, a visual and visual‑inertial SLAM library with multi‑map support.
Technical Books
Books I revisit for statistics, systems, HCI, and e‑commerce implementation details.
Publication
Peer‑reviewed work in electrical engineering.
Estimating torque-speed characteristic of three-phase induction motor operating under unbalance supply
Authors: A. S. Nautiyal, B. P. Thakur, C. A. Chauhan and D. K. Govind
Conference: 2013 Nirma University International Conference on Engineering (NUiCONE), Ahmedabad, 2013
Pages: 1-6 | DOI: 10.1109/NUiCONE.2013.6780144
The paper analyzes torque‑speed characteristics of three‑phase induction motors under unbalanced supply conditions and quantifies performance degradation.
Current Projects in Development
Active projects and experiments.
Social Media Real-Time Data Streaming Platform using Apache Kafka and Faktory
Real-time data processing platform using Apache Kafka and Faktory for low-latency ingestion from social media sources, with streaming analytics and dashboards.
Panoptic Segmentation for Mobile Robot Navigation
Panoptic segmentation model for mobile robot navigation in dynamic environments, integrating perception and path-planning components.
SignRing: Continuous American Sign Language Recognition Using IMU Rings and Virtual IMU Data
System for continuous American Sign Language recognition using IMU rings and virtual IMU data, built on wearable sensing and sequence models.
Projects
Recent work in software, data, and AI engineering.
Network Password Cracking with Hashassin Rainbow Tables
Hashassin generates, stores, and uses rainbow tables in a distributed architecture coordinated by a central server.
Reinforcement Learning Environment Design & Q-Learning Implementation
Custom reinforcement learning environment with a Q‑learning agent and evaluation harness.
Convolutional Neural Network Implementation and Evaluation for MNIST Handwritten Digit Classification
CNN for MNIST handwritten digit classification implemented from scratch in Python and NumPy.
AI-Powered Multimodal Misinformation Detection System
Multimodal misinformation detector over social media posts, combining NLP models for text with CNNs for images.
Cellular Network Analysis and Kotlin App Development
Signal strength, tower switching, and network stability study around Binghamton using Android data and Power BI.
Cellular Network Packet Analysis using VPN and PII detection using Wireshark
Packet‑capture analysis of VPN traffic with Wireshark to assess encryption and identify plaintext/PII leaks.
Gaming
Long‑running interest in systems-heavy and mechanics-driven games.
Currently Playing
Metro Exodus Enhanced Edition
PCAtmospheric FPS that balances linear storytelling with semi-open hub exploration.
Elden Ring
PCA masterclass in open-world traversal and boss mechanical complexity.
Gaming Stats
Recently Completed
Divinity: Original Sin 2
The gold standard for systems-driven CRPGs. Completed a Tactician run.
Divinity: Original Sin Enhanced Edition
Predecessor to DOS2 with similar tactical systems. Completed main story and major side content.
Devil May Cry 5
Action game focused on combo-heavy combat. Completed main story and side missions; targeting 100% completion.
Hades
Roguelike with fast combat and run-based progression. Multiple successful clears.
Age of Empires II: Definitive Edition
RTS with historical campaigns and long-term multiplayer play.
Figment
Action‑adventure with puzzle sequences and stylized art direction.
The last campfire
Puzzle‑driven adventure with short, contained sessions.
100% Completion
Outward
(July 2022)Survival‑oriented open‑world RPG with co‑op support. Completed main story and side content.
Vampire Survivors
(July 2022 - October 2025)The definition of 'gameplay loop' optimization. Pure dopamine efficiency.
Remnant: From the Ashes
(May 2021)Third‑person shooter with Souls‑like structure and co‑op play.
Remnant II
(March 2025)Follow‑up to Remnant with expanded build variety and co‑op progression.
Raji: An Ancient Epic
(December 2024)Action‑adventure set in ancient India with fixed-length campaign.
Resume
Download a PDF resume with condensed employment history and skills.
Sumit Nautiyal
Data Engineer & Software Developer
By downloading, you agree to use this information for professional purposes only.
Contact
Open to roles, research collaborations, and data‑intensive side projects.