Nikhil VVS
About
My name is Nikhil VVS
Data Engineer specializing in Big Data, Cloud Computing, and ETL pipelines. Experienced in real-time data streaming, cloud data warehousing, and AI-driven analytics using AWS, Apache Spark, and SQL. Passionate about designing scalable data solutions and optimizing workflows to drive business insights.

Experience
Accenture
Custom Software Engineering Associate
Dec 2022 – Jul 2023
Exposys Data Labs
Full-Stack Developer Intern
Jun 2021 – Nov 2021
Verzeo
Data Science & Machine Learning Intern
May 2020 – Aug 2020
Built scalable data solutions and automated ETL workflows.
-
Developed high-performance ETL pipelines using Apache Spark & AWS Glue, reducing data processing time by 40%.
-
Implemented real-time data streaming with Apache Kafka, improving event-driven data ingestion.
-
Optimized data storage & query performance in AWS Redshift, increasing efficiency in data retrieval.
-
Designed and deployed CI/CD pipelines with Docker, Kubernetes, and Terraform for seamless cloud deployment.
-
Technologies: AWS Glue, Spark, Kafka, Redshift, SQL, Python, Terraform, Docker, Kubernetes
Designed and optimized web applications for seamless user experience.
-
Developed a real-time chat application using React.js & Node.js, enhancing team collaboration.
-
Optimized MySQL database queries, reducing response times by 25%. Implemented cloud-hosted solutions using AWS Lambda & S3 to improve system scalability.
-
Technologies: React.js, Node.js, MySQL, AWS Lambda, S3
Developed machine learning models for classification tasks using Python, Scikit-learn, and TensorFlow.
-
Conducted data preprocessing & feature engineering to enhance model accuracy.
-
Created data visualizations using Matplotlib & Seaborn to present insights effectively.
-
Deployed a prototype of a fraud detection system using classification algorithms.
-
Technologies: Python, Scikit-learn, TensorFlow, Pandas, Matplotlib, Seaborn
Big Data & Cloud Projects
A showcase of my work in Big Data, Cloud Computing, and Data Engineering, where I build scalable ETL pipelines, real-time streaming solutions, and cloud-based architectures. These projects leverage technologies like Apache Spark, AWS, Kafka, and SQL to optimize data processing and analytics for real-world applications. Explore how I transform raw data into actionable insights through automation and engineering excellence.
Education
University of Texas at Arlington
Master of Science in Computer Science
Aug 2023 - May 2025
-
Relevant Coursework: Cloud Computing & Big Data, Distributed Systems, Machine Learning, Database Systems, Operating Systems, Computer Networks.
-
Graduate Teaching Assistant (GTA) - Assisted in coursework and labs for database systems, guiding students in SQL, database design, and query optimization.
-
AWS Data Engineering Capstone - Built a scalable ETL pipeline using AWS Glue, Redshift, and Lambda, optimizing cloud data processing efficiency.
-
HackUTA Participant - Engaged in UTA's premier hackathon, developing innovative solutions in a fast-paced environment
Chaitanya Bharathi Institute of Technology
Bachelor of Engineering in Computer Science
Aug 2018 - Jun 2022
-
Relevant Coursework: Data Structures, Artificial Intelligence, Machine Learning, Database Management Systems, Design and Analysis of Algorithms, Operating Systems, Computer Networks, Compiler Design, Object Oriented Programming, Probability and Statistics, Computer Architecture and Microprocessor, Decision Theory, Web and Internet Technologies, Soft Computing, Human Computer Interaction.
-
Activities & Achievements:
-
Smart India Hackathon Semifinalist - Participated in India's largest innovation competition, developing solutions for real-world industry problems.
-
CBIT MUN - Delegate Relations - Facilitated and organized Model United Nations events, coordinating logistics and outreach.
-
CBIT Robovanza - Organizing Committee - Led event planning and execution for technical robotics competitions at CBIT.
Skills and Technologies
Programming Languages:
Python, Java, JavaScript (React, Node.js), C
Web Development:
Django, React.js, Node.js, Express.js, Bootstrap, REST APIs, HTML5, CSS3, jQuery, Heroku
Databases & Storage:
AWS DynamoDB, AWS S3, MySQL, PostgreSQL, MongoDB, Firebase RealtimeDB, Oracle SQL
Big Data & Data Processing:
Apache Spark, Hadoop, AWS Glue, Apache Kafka
AI & Machine Learning:
PyTorch, TensorFlow, Keras, Pandas, NumPy, Scikit-learn, OpenCV
DevOps & Infrastructure:
Kubernetes, Docker, Jenkins, AWS (Lambda, Redshift, EC2, Glue, Athena), Bash Scripting, Linux (Development & Deployment)
Tools & Miscellaneous:
Git, Jupyter Notebook, Postman (API Testing), Grafana, LaTeX, VS Code, GDB