Agus Raju Thaliyan — Data Scientist and ML Engineer
Google Certified Data Analyst

Hi, I'm Agus Raju Thaliyan. Data Scientist. IEEE Published Author.

Bridging the gap between raw data and autonomous execution. I build production-grade ML pipelines, RAG architectures, and multi-agent systems for enterprise scale.

Trusted & Certified by

PROFESSIONAL
Experience

Current

DEEPMOST AI

Data Science Intern // '26 - Present

Synthetic_Records

10K+

Velocity_Mult

3.0x

RAG_Accuracy

+15%

Architecting Agentic AI workflows and high-fidelity RAG pipelines to simulate autonomous business environments.

Engineered a Multi-Agent System designed to bridge the cold-start data gap. By deploying LangChain and Gemini models, the architecture synthesizes robust B2B training datasets, reducing system latency by ~20% while exponentially outperforming manual curation velocity.

LangChain LLM's Agentic AI Vector DB Firecrawl
Internship JUL '23 - OCT '23

CPPR

Business Development Intern

"Led complex CRM data transitions. Developed Excel ETL pipelines to sanitize 1,200+ duplicate records across 5 high-density categories."

Reliability

40%

Sanitized

1.2K

ACADEMIC
BACKGROUND

2024 — 2026 Graduated

M.Sc. Computer Science

Data Analytics Specialization

Rajagiri College of Social Sciences (RCSS). Advanced focus on scalable data architectures, predictive modeling, and agentic AI systems.

7.5 CGPA
Machine Learning Data Engineering Predictive Analytics
Delta +1.27 ▲
2020 — 2024

B.Sc. Statistics

Mathematics & Probability

Mar Athanasius College of Arts and Science. Built the core mathematical and probabilistic foundation essential for advanced data science.

6.23 CGPA

Selected Projects

Click any card to view source code.

Agentic AI Gemini 2.5 Multi-Agent LangChain

Agentic AI Sales SDR

3-agent B2B simulation engine — Seller, Buyer & Judge agents generating 10,000+ synthetic training samples in under 48 hours with 9+ features per simulation.

View source →
Data Engineering YouTube API v3 Power BI

YouTube News Analytics

End-to-end pipeline analysing 9,000+ videos from 8 Indian news channels. 12-KPI Power BI dashboard with viral outlier detection.

View source →
PythonScikit-LearnXGBoost

Stock Predictor

Random Forest & XGBoost models for real-time market forecasting.

View source →
StatisticsRScikit-Learn

Student Elective Analysis

Chi-square & Spearman correlation across 400+ student records. 82% classification accuracy with Logistic Regression.

View source →

Certified Expertise

01

Research &
Publications

Peer-reviewed literature and conference proceedings.

Featured Scopus-Indexed ICAUC 2026 // Bangkok

"Integrating Nighttime Satellite Radiance and Mobile Network Infrastructure for Fine-Grained Socioeconomic Mapping"

Presented at the International Conference on AI-Driven Smart Systems & Ubiquitous Computing. Explores novel methodologies combining satellite intelligence with telecommunications data.

Publisher

IEEE Xplore

ISBN

979-8-3315-5851-2

Domain

Satellite Intelligence • Socioeconomic AI

02

Industry
Recognition

EY-Recognised Power BI Dashboard

2nd Best Overall

Ernst & Young

Dashboard officially recognised by EY Directors for demonstrating production-grade data visualisation and KPI storytelling skills evaluated against strict industry standards.

Indo-Pacific Health Case Competition

International

Univ. of Melbourne & Rajagiri

Represented Rajagiri in a cross-departmental team against 30+ international teams. Developed an evidence-based intervention strategy for Asthma Management in the Philippines.

03

System
Credentials

Google

Advanced Data Analytics Professional Certificate

Prompt Design in Vertex AI • Build AI Apps with Gemini

Microsoft

Data & ML Certifications

Power BI (3 courses) • Machine Learning with Supervised Learning

IBM

Databases & SQL

Relational database fundamentals and advanced SQL for data science.

HackerRank

SQL Level 5

Complex joins, subqueries, and window functions.

04

Events &
Bootcamps

SAP State HUB Level Hackathon

Chief Coordinator

SAP • Rajagiri College

Led seamless execution, managed technical mentoring, and facilitated a large-scale student development event.

Gen AI Exchange Bootcamp

Invited

Hack2Skill & Google Cloud

Intensive invite-only bootcamp covering advanced generative AI infrastructure and Large Language Model (LLM) solutions.