Data Workshop

🛠️ Data Workshop Overview

The Data Workshop is a comprehensive platform designed for creating data-driven projects to solve specific organizational challenges, such as Machine Learning models, custom APIs, ETL processes, and more. This workspace provides versatile tools, empowering data teams to innovate and deploy solutions efficiently. ARPIA’s intuitive interface, along with its vast capabilities, enables organizations to harness the full potential of their data assets, addressing diverse business needs.


ARPIA Machine Learning Development Environment


🧩 Key Objects in the Data Workshop

  1. Extract and Load:
    Extract data from sources like databases and load it into destinations, such as data warehouses or applications.

  2. Repository Table:
    Store metadata or manage information about other repository objects.

  3. Transform and Prepare:
    Clean, structure, and transform data for further analysis, filtering, format changes, or merging data from multiple sources.

  4. AI and Machine Learning:
    Build models for classification, prediction, clustering, and pattern detection using AI and machine learning techniques.

  5. Visual Objects:
    Leverage ARPIA’s visualization engine to create insightful data visualizations and analytics.

  6. Data Models (Kubes):
    Logical representations of data structure, including tables, relationships, and attributes.

  7. DataApps:
    Build interactive applications to consume processed data, enhancing user interaction and data utilization.

  8. App Droplets Object:
    Add modules or components to applications that offer specific functionalities like visualizations or data processing.


💻 Development Environments

The ARPIA Data Workshop supports multiple development environments tailored to project needs:

  • PHP:
    Develop applications and APIs, particularly suited for web applications and ETL processes.
  • Python:
    Ideal for building Machine Learning models and performing advanced data analysis tasks.
  • WAC (Web Application Container):
    Deploy and manage web applications persistently, running them as servers.

🚀 Serverless Objects (Containers)

Create and deploy serverless applications that run continuously, providing custom solutions or APIs accessible to end-users across the organization.


⚙️ Computing Resources

ARPIA utilizes Docker technology for efficient processing and application containers:

  • Shared Container Resources:
    Cost-effective computing resources shared among users, ideal for data-driven projects.
  • Dedicated Container Resources:
    Exclusive resources allocated to an organization, ensuring performance and scalability tailored to enterprise needs.

🎛️ Development Interface

Each object in the Data Workshop offers a powerful development interface:

  • Global Files:
    Share libraries and functions across project objects for enhanced development efficiency and code reuse.

  • Connecting to Data Repository:
    Manage tables and datasets essential for project execution through direct access to the project's data repository.

  • Running and Scheduling Projects:
    Execute projects manually or schedule them to run automatically based on predefined conditions.

  • Cloning Projects and Objects:
    Clone projects and objects to reuse code and adapt projects to different data repositories.

  • Dynamic Project Execution:
    Configure project parameters dynamically to optimize workload and performance based on organizational needs.


By utilizing the Data Workshop, organizations can effectively develop, deploy, and manage specialized solutions, unlocking the full potential of their data assets.