Gaurav Raheja

Gaurav Raheja

Pune, India Hindi · English · Russian
Core Expertise
Threat Intelligence SystemsDistributed Systems DesignData Pipeline EngineeringCloud Infrastructure · GCPMonitoring & Agentic DebuggingContainer OrchestrationFull-Stack DevelopmentWeb Scraping & Crawling

About Me

Software Engineer with 7+ years specialising in threat intelligence, data pipelines, and cloud-native systems. At Google, I've contributed to darknet intelligence collection from the ground up — building core systems, growing target coverage, and progressively automating high-friction engineering workflows using LLM agents. Most recently, I built an agentic observability system using Looker and BigQuery agents that brings structured debugging to collection pipelines. I work best when there's ambiguity to resolve and cross-functional coordination to drive.

Experience

Software Engineer
Google
Joined via Mandiant acquisition, Nov. 2022
Aug. 2022 – Present Pune, IN
  • Ensured zero-downtime operational continuity during the Mandiant-Google acquisition by scaling the threat intelligence collection platform with new data sources and targeted reliability improvements.
  • Eliminated a high-friction manual workflow by architecting and deploying an LLM-agent-driven code generation system, reducing related engineering efforts by 70%.
  • Reduced code duplication by nearly 30% across the system by designing and implementing a centralized, custom library.
  • Achieved real-time issue detection and triage across collection pipelines by architecting a proactive observability system powered by Looker and BigQuery agents.
  • Slashed manual monitoring and debugging overhead by 75% by designing and implementing an autonomous agentic pipeline.
Software Developer
One Convergence Pvt. Ltd
Feb. 2020 – Jul. 2022 Hyderabad, IN
  • Built a CLI automation tool in Python, Bash, and Terraform to streamline VM provisioning across multi-cloud environments, integrated with a React dashboard for centralized infrastructure management.
  • Stepped up as interim engineering lead — owned code reviews, coordinated releases, and unblocked UI feature deliveries during a critical project phase.
  • Led a zero-downtime migration from NoSQL to SQL for the core product database, improving query performance and data consistency for production workloads.
  • Designed and shipped high-performance Go APIs enabling cross-team collaboration between UI and ML departments, reducing integration lag across teams.
  • Guided multiple clients through Kubernetes and Helm-based cloud-native migrations, providing hands-on technical leadership on container orchestration and workload transition.
Software Engineer
De-Haze Labs
Jul. 2018 – Aug. 2019 Remote, US
  • Engineered high-throughput web crawlers in Python/Scrapy to automate large-scale data collection with optimized resource utilization.
  • Integrated Dialogflow-based conversational interfaces to bridge automated data pipelines with customer-facing workflows.
  • Drove R&D efforts to prototype new product features, several of which were adopted into the product roadmap.

Open Source

TimeScaleDb

Enhanced Helm Chart flexibility for single-node installations, enabling support for custom secret management. Pull Request #327.

HelmKubernetesSecrets
View on GitHub
Chevotrain

Improved API usability by auditing and correcting critical function documentation. Pull Request #1734.

DocumentationAPI
View on GitHub
HttPie

Systemically resolved issues with Snapcraft autocompletion to improve developer productivity. Pull Request #1189.

PythonSnapcraftAutocompletion
View on GitHub
JenkinsCI Kubernetes Operator

Maintained repository integrity by resolving broken documentation links and cross-references. Pull Request #287, #288.

JenkinsK8sOperator
View on GitHub
TwitterScraper

Optimized Docker build workflows to resolve environment-specific compilation issues. Pull Request #253.

DockerPythonWeb Scraping
View on GitHub

Projects

Academic

Dynamo Scrapper 3 Months

Architected a no-code web scraping platform featuring a GUI for crawler creation and mobile-based deployment, simplifying data extraction for non-technical users.

PythonDjangoScrapyTwistedNativeScript
MedBot 1 Month

Developed a medical prescription analysis tool that identifies drugs and cross-references them with online alternatives via 3rd-party APIs to improve medication accessibility.

C#HTMLAgility PackRestPlus
SCAP 1 Month

Engineered an IoT-based vehicular safety system using Arduino and ultrasonic sensors for proximity detection, automatic speed control, and SOS emergency messaging.

ArduinoUltrasonic sensorNode RedEmbeddedTwillio
Training Placement Portal 5 Months

Designed and implemented a centralized university portal to streamline the management and communication of academic events, training sessions, and placement drives.

PHPHTMLJSCSSBootstrap3

Skills

Languages
PythonGoTypeScriptJavaScriptBash
Backend & Data
FlaskFastAPIScrapyElasticSearchBigQueryApache DataflowApache NiFi
Cloud & Infrastructure
GCPDockerKubernetesHelmTerraformGitHub Actions
Databases
PostgreSQLMongoDB
Frontend
ReactAstro
AI & LLM
LiteLLMPrompt EngineeringLooker

Certifications

  • Mandiant Cyber Threat Intelligence Analyst
    Mandiant

    Demonstrated proficiency in threat intelligence frameworks, including STIX and the Diamond Model, to streamline workflows and improve structural analysis of cyber threats.

    2024

Education

Centre for Development of Advance Computing (CDAC)
Diploma in Advance Secure Software Development
2019 – 2020
Guru Jambheshwar University of Science & Technology
Bachelor of Computer Science & Tech.
2015 – 2019
Nav Bharat Vidya Mandir Sr. Sec. School
Higher Secondary School Certification
2012 – 2014
K.L. Arya D.A.V. Public School
Secondary School Certification
2010 – 2012
hire.gaurav@raheja.dev
+91 90179-20586
linkedin.com/in/this-is-r-gaurav
github.com/this-is-r-gaurav
Pune, India

Gaurav

Profile Image

Software Engineer with 7+ years specialising in threat intelligence, data pipelines, and cloud-native systems. At Google, I've contributed to darknet intelligence collection from the ground up — building core systems, growing target coverage, and progressively automating high-friction engineering workflows using LLM agents. Most recently, I built an agentic observability system using Looker and BigQuery agents that brings structured debugging to collection pipelines. I work best when there's ambiguity to resolve and cross-functional coordination to drive.

Work Experience

Google Pune, IN
Software Engineer Aug. 2022 – Present
  • Ensured zero-downtime operational continuity during the Mandiant-Google acquisition by scaling the threat intelligence collection platform with new data sources and targeted reliability improvements.
  • Eliminated a high-friction manual workflow by architecting and deploying an LLM-agent-driven code generation system, reducing related engineering efforts by 70%.
  • Reduced code duplication by nearly 30% across the system by designing and implementing a centralized, custom library.
  • Achieved real-time issue detection and triage across collection pipelines by architecting a proactive observability system powered by Looker and BigQuery agents.
  • Slashed manual monitoring and debugging overhead by 75% by designing and implementing an autonomous agentic pipeline.
One Convergence Pvt. Ltd Hyderabad, IN
Software Developer Feb. 2020 – Jul. 2022
  • Built a CLI automation tool in Python, Bash, and Terraform to streamline VM provisioning across multi-cloud environments, integrated with a React dashboard for centralized infrastructure management.
  • Stepped up as interim engineering lead — owned code reviews, coordinated releases, and unblocked UI feature deliveries during a critical project phase.
  • Led a zero-downtime migration from NoSQL to SQL for the core product database, improving query performance and data consistency for production workloads.
  • Designed and shipped high-performance Go APIs enabling cross-team collaboration between UI and ML departments, reducing integration lag across teams.
  • Guided multiple clients through Kubernetes and Helm-based cloud-native migrations, providing hands-on technical leadership on container orchestration and workload transition.
De-Haze Labs Remote, US
Software Engineer Jul. 2018 – Aug. 2019
  • Engineered high-throughput web crawlers in Python/Scrapy to automate large-scale data collection with optimized resource utilization.
  • Integrated Dialogflow-based conversational interfaces to bridge automated data pipelines with customer-facing workflows.
  • Drove R&D efforts to prototype new product features, several of which were adopted into the product roadmap.

Open Source Contributions

TimeScaleDb
Helm, Kubernetes, Secrets
  • Enhanced Helm Chart flexibility for single-node installations, enabling support for custom secret management. Pull Request #327.
  • https://github.com/timescale/helm-charts/pull/327
Chevotrain
Documentation, API
  • Improved API usability by auditing and correcting critical function documentation. Pull Request #1734.
  • https://github.com/Chevotrain/chevotrain/pull/1734
HttPie
Python, Snapcraft, Autocompletion
  • Systemically resolved issues with Snapcraft autocompletion to improve developer productivity. Pull Request #1189.
  • https://github.com/httpie/httpie/pull/1189
JenkinsCI Kubernetes Operator
Jenkins, K8s, Operator
  • Maintained repository integrity by resolving broken documentation links and cross-references. Pull Request #287, #288.
  • https://github.com/jenkinsci/kubernetes-operator
TwitterScraper
Docker, Python, Web Scraping
  • Optimized Docker build workflows to resolve environment-specific compilation issues. Pull Request #253.
  • https://github.com/taspinar/twitterscraper/pull/253

Projects

Academic
Dynamo Scrapper 3 Months
Btech. GJUS&T
Tech Stack: Python, Django, Scrapy, Twisted, NativeScript

Architected a no-code web scraping platform featuring a GUI for crawler creation and mobile-based deployment, simplifying data extraction for non-technical users.

MedBot 1 Month
Btech. GJUS&T
Tech Stack: C#, HTML, Agility Pack, RestPlus

Developed a medical prescription analysis tool that identifies drugs and cross-references them with online alternatives via 3rd-party APIs to improve medication accessibility.

SCAP 1 Month
Btech. GJUS&T
Tech Stack: Arduino, Ultrasonic sensor, Node Red, Embedded, Twillio

Engineered an IoT-based vehicular safety system using Arduino and ultrasonic sensors for proximity detection, automatic speed control, and SOS emergency messaging.

Training Placement Portal 5 Months
Btech. GJUS&T
Tech Stack: PHP, HTML, JS, CSS, Bootstrap3

Designed and implemented a centralized university portal to streamline the management and communication of academic events, training sessions, and placement drives.

Certification

Mandiant Cyber Threat Intelligence Analyst

Demonstrated proficiency in threat intelligence frameworks, including STIX and the Diamond Model, to streamline workflows and improve structural analysis of cyber threats.

Skills

Languages
PythonGoTypeScriptJavaScriptBash
Backend & Data
FlaskFastAPIScrapyElasticSearchBigQueryApache DataflowApache NiFi
Cloud & Infrastructure
GCPDockerKubernetesHelmTerraformGitHub Actions
Databases
PostgreSQLMongoDB
Frontend
ReactAstro
AI & LLM
LiteLLMPrompt EngineeringLooker

Education

Centre for Development of Advance Computing (CDAC) 2019 – 2020
Diploma in Advance Secure Software Development
(Hyderabad, IN)
Guru Jambheshwar University of Science & Technology 2015 – 2019
Bachelor of Computer Science & Tech.
(Hisar, IN)
Nav Bharat Vidya Mandir Sr. Sec. School 2012 – 2014
Higher Secondary School Certification
(Hisar, IN)
K.L. Arya D.A.V. Public School 2010 – 2012
Secondary School Certification
(Hisar, IN)

Languages

Hindi Intermediate
English Intermediate
Russian Beginner
Generating PDF…