Architecture¶

Capability¶

This diagram provides a structured view of the capabilities and sub-capabilities involved in a cryptocurrency prediction application, or at least how we imagine it !

Info

All sub capabilities surrounded by red line are note implemented in ths Proof Of Concept (POC).

Infrastructure - Overview

Data Collection¶

This capability involves gathering data from various sources, which is essential for making accurate predictions.

Sub-capabilities¶

Real-time Market Data Retrieval: Continuously collecting current price, volume, and other relevant market data from exchanges and financial platforms.
Historical Data Access: Accessing past market data to identify patterns and trends over time.
API Integration for Data Sources: Connecting with external APIs to fetch data from different exchanges, news platforms, social media, and more.

Data Processing¶

Once data is collected, it needs to be processed to make it suitable for analysis and model training.

Sub-capabilities¶

Data Cleaning and Normalization: Removing noise and inconsistencies in the data, and normalizing it to a standard format.
Feature Extraction and Selection: Identifying and selecting the most relevant features that will be used in model training.
Data Transformation and Aggregation: Transforming data into a structured format and aggregating it to make it suitable for analysis.

Model Training¶

Training predictive models using the processed data to forecast future cryptocurrency prices and trends.

Sub-capabilities¶

Selection of Machine Learning Algorithms: Choosing the appropriate machine learning algorithms for prediction (e.g., LSTM, Random Forest, SVM).
Training and Validation of Models: Training the models using historical data and validating their performance using a validation set.
_Hyperparameter Tuning: Adjusting the model parameters to optimize performance.

Prediction Generation¶

Generating predictions based on the trained models.

Sub-capabilities:¶

Real-time Prediction Generation: Producing predictions in real-time based on current market data.
Batch Prediction Processing: Running predictions in batches for historical analysis or reporting.
Integration with Prediction APIs: Exposing prediction functionalities through APIs for other applications to consume.

User Interface¶

Providing an interface for users to interact with the application, view predictions, and customize their experience.

Sub-capabilities¶

Dashboard for Visualizing Predictions: Creating interactive dashboards to display predictions, trends, and insights.
Alerts and Notifications: Sending real-time alerts and notifications for significant market movements.
User Customization Options: Allowing users to customize their dashboards and set preferences for alerts and notifications.

Security¶

Ensuring that the application and data are secure from unauthorized access and breaches.

Sub-capabilities¶

Data Encryption and Protection: Encrypting data at rest and in transit to protect it from unauthorized access.
Secure API Access: Implementing security measures like API keys and authentication to secure API endpoints.
Anomaly Detection and Response: Detecting and responding to anomalies in data access and usage.

Scalability¶

Ensuring that the application can handle increased loads and grow as needed.

Sub-capabilities¶

Load Balancing and Resource Management: Distributing workload across multiple servers to ensure optimal performance.
Horizontal Scaling of Services: Adding more instances of services to handle increased demand.
Efficient Data Storage Solutions: Implementing scalable and efficient storage solutions to manage large volumes of data.

Monitoring and Maintenance¶

Regularly monitoring and maintaining the application to ensure it runs smoothly.

Sub-capabilities¶

Performance Monitoring: Continuously monitoring the performance of the application and its components.
Log Management and Analysis: Collecting and analyzing logs to identify and troubleshoot issues.
Regular Model Retraining and Updates: Regularly updating the models with new data to ensure their accuracy and relevance.

Our standard methodology has involved several key steps: - data collection - data loading and preprocessing (normalization, transformation) - model creation - model training - model prediction and evaluation - model deployment.

Architecture layers¶

Business (user process)¶

The user journey refers to the sequence of steps a user takes to interact with our application. In the context of a cryptocurrency prediction application with a FastAPI front end, the user journey can be broken down into several stages.

User Journey

Applicative¶

This diagram illustrate all apps used in our application.

User Journey

Technology¶

When choosing infrastructure containers for a cryptocurrency prediction project, several factors come into play to ensure scalability, reliability, and efficiency.

This project can be view a a POC and we did not use Orchestration with Kubernetes, we used Docker and Docker-Compose on a single VM.

This schema illustrate the containerized structure of the application.

Infrastructure - Overview

Streamlit¶

Streamlit is our frontend application. It's the only service available only on the public network. It shares the public network with the gateway API which is used to access the private services securely. It's exposed on the port 8501.

Structure¶

RepoCrypto/frontend/
├── Dockerfile              # Streamlit container configuration
├── app.py                  # Streamlit entry point
├── requirements.txt        # Python dependencies
├── utils/
│   ├── __init__.py
│   ├── api_client.py      # API client functions
│   └── auth.py            # authentification functions
└── pages/
    ├── __init__.py
    ├── account.py         # manage his account
    ├── administration.py  # manage user accounts
    ├── create_user.py     # create a new user
    ├── data_analysis.py   # data analysis of historical data
    ├── home.py            # home page
    ├── model.py           # model management
    └── predictions.py     # predictions visualization

Some pages are only accessible for admin users, such as administration.py and create_user. Other pages are accessible for all users.

The frontend use the role and the token to manage the access to the pages.

The only service with which the frontend communicate is the gateway API. It's throught the API that the frontend can access authentification / authorization services and backend features.

In streamlit we used plotly to create the charts and display the data.

Below is the scheme of Streamlit features through the API Gateway endpoints

Streamlit scheme

API management¶

in our project, we decided to have 2 FastAPI applications, one is only on a private network for security reasons and the other one is on both private and public networks to work as a bridge between the frontend (streamlit), the private API and other services.

Please see below an illustration of our API architecture

Infrastructure - Overview

Private API : The goal of our private API is to communicates with airflow to trigger training and prediction tasks.

Prediction API¶

PredictionAPI/
├── app/
│   ├── __init__.py
│   ├── main.py          # FastAPI application entry point
│   ├── registry.py      # Prometheus metrics configuration
│   └── prediction/
│       ├── __init__.py
│       └── router.py    # Prediction endpoints and logic
├── Dockerfile          # Container configuration
├── gunicorn_conf.py   # Gunicorn server settings
├── start-reload.sh 
├── start.sh 
└── requirements.txt   # Python dependencies

The Dockerfile configures the containerized environment for the Prediction API. * It is used by the docker-compose file to start this service.

It exposes port 3001, uses gunicorn_conf.py to configure the gunicorn server and requirements.txt to install the dependencies.

The dockerfile is configured to use the start-reload.sh script to start the service in development mode (enabling hot reloading) and the start.sh script to start the service in production mode.

Gunicorn_conf.py configures the Gunicorn WSGI server that runs the FastAPI application. It manages the number of workers, timeout, and other settings.

main.py is the entry point of the FastAPI application. It initializes the application and sets up the necessary configurations. It sets up CORS middleware, implements rate limiting, configures Prometheus metrics middleware and registers routers.

As we have FastAPI running with multiple Gunicorn workers, the request is load balanced across the workers. It also meands that registry is crucial for prometheus metrics to be scraped properly.

Registry.py is used to configure the prometheus metrics which are used in the main.py and router.py files. Router.py defines the routes and the logic behind them.

Endpoints¶

How requests are handled:

graph LR
    A[Client Request] --> B[Docker Container]
    B --> C[Gunicorn]
    C --> D[FastAPI App]
    D --> E[Route Handlers]

we have several endpoints:

GET /metrics
- Response: Prometheus metrics in text format
- This endpoint provides monitoring metrics including:
  - prediction_api_request_count: Total requests
  - prediction_api_request_latency_seconds: Request timing
  - prediction_api_exception_count: Error tracking
  - prediction_api_prediction_count: Prediction usage
  - prediction_api_model_score: Model performance
GET /predict/latest-prediction
- Response: JSON object containing the latest prediction
- It uses the prediction saved in the database to avoid calling the model unnecessarily.
GET /predict/model-evaluation
- Response: JSON object containing the model evaluation (MSE (train/test) and R² score (train/test))
- It uses the evaluation saved in the database to avoid calling the model unnecessarily.
GET /predict/models
- Response: JSON object containing the list of available models
- It uses MLflow client to get the list of models.
GET /predict/best-model
- Response: JSON object containing the best model
- It uses the best models in the database to return the best model based on the MSE.
POST /predict/train
POST /predict/score
POST /predict/predict
- These endpoints are used to trigger the training, scoring and prediction tasks in airflow.

The endpoints works with: * PostgreSQL database * MLflow * Airflow * Prometheus

Gateway API¶

The goal of our gateway API is to work as a bridge between the frontend and the private API and other services for security reasons.

Structure¶

PredictionAPI/
├── app/
│   ├── __init__.py
│   ├── main.py          # FastAPI application entry point
│   ├── database.py      # Database configuration
│   └── authentication/
│       ├── __init__.py
│       ├── security.py  # Token generation and verification functions
│       ├── utils.py     # Password hashing and verification functions
│       └── router.py    # Authentication endpoints and logic
│   └── crypto/
│       ├── __init__.py
│       └──  router.py    # Crypto endpoints and logic
│   └── prediction/
│       ├── __init__.py
│       └── router.py    # Prediction endpoints and logic
├── Dockerfile          # Container configuration
├── gunicorn_conf.py   # Gunicorn server settings
├── start-reload.sh 
├── start.sh 
└── requirements.txt   # Python dependencies

The architecture is quite similar to the private API but it has some differences. It includes authentication and authorization mechanisms based on user roles, passwords and tokens. These enables us to protect sensitive and critical points.

The crypto folder includes the endpoints to get the list of available cryptocurrencies, add new cryptocurrencies based on what's available in Kraken (our dataprovider) delete cryptocurrencies, get current prices or historical data. The prediction folder just had endpoints querying the private API to get the predictions, models, trigger training, scoring and prediction tasks...

Endpoints¶

The list of all endpoints:

POST /auth/signup : Record new user
POST /auth/login (not protected) : User loggin
GET /auth/users/me : Modify user information profil by user
PUT /auth/user/me : Check user account for user
DELETE /auth/users/{username} : Delete users by admin
PUT /auth/users/{username}/role : Modify user role by admin
GET /auth/users : Get users list by admin
GET /crypto/assets : Get all assets in db
POST /crypto/assets : Add new asset
GET /crypto/asset_history/{asset} : Get asset OHLC value history to date
DELETE /crypto/assets/{asset_id} : Get asset OHLC value history to date
GET /crypto/kraken_assets : Get list of assets from provider
GET /crypto/asset_latest/{asset} : Get last OHCL value from Db
GET /prediction/latest-prediction : Get last prediction for asset (BTC)
GET /prediction/model-evaluation : Get last model evalution for asset (BTC)
GET /prediction/best-model : Get prediction of best model for asset (BTC)
GET /prediction/models : List all model experiments
POST /prediction/train : Trigger Airflow DAG training
POST /prediction/score : Trigger Airflow DAG scoring
POST /prediction/predict : Trigger Airflow DAG prediction

The endpoints works with: * Private API * PostgreSQL database * Airflow