Blockchain ETL Pipeline
Project Overview:
I developed an ETL (Extract, Transform, Load) pipeline that extracts real-time blockchain trading data via the Bitquery GraphQL API, transforms it into structured tabular form, and loads it into a PostgreSQL database for further analysis. The pipeline automates data collection across multiple blockchain networks like Ethereum and Binance Smart Chain.
Technologies Used:
- Python
- Bitquery GraphQL API
- PostgreSQL
- Pandas
- SQLAlchemy
- Requests
Key Features:
- Fetch transaction data automatically using API calls
- Transform JSON responses into clean structured datasets
- Store blockchain transactions into PostgreSQL tables
- Handle large data volumes efficiently with batching
- Supports multi-network extraction (ETH, BSC, etc.)