This repository provides the base service functionality for Assemblyline 4 services.
An Assemblyline service has the following file structure:
assemblyline-service-<service name>
│
├── Dockerfile
├── <service name>.py
└── service_manifest.yml
This is overview of what each of these does:
Dockerfile
─ Build file for the service container, see Dockerfile section below for more details<service name>.py
─ Contains main service codeservice_manifest.yml
─ Service definition file, see Service manifest section below for more details
Every service must have a service_manifest.yml
file in its root directory. The manifest file presents essential information about the service to the Assemblyline core system, information the system must have before it can run the service.
The diagram below shows all the elements that the manifest file can contain, including a brief description of each.
# Name of the service
name: ResultSample
# Version of the service
version: 1
# Description of the service
description: >
ALv4 Result example service
This service provides examples of how to:
- define your service manifest
- use the different section types
- use tags
- use heuristics to score sections
- use the att&ck matrix
- use the updater framework
- define submission parameters
- define service configuration parameters
# Regex defining the types of files the service accepts and rejects
accepts: .*
rejects: empty|metadata/.*
# At which stage the service should run (one of: FILTER, EXTRACT, CORE, SECONDARY, POST)
# NOTE: Stages are executed in the order defined in the list
stage: CORE
# Which category the service is part of (one of: Antivirus, Dynamic Analysis, External, Extraction, Filtering, Networking, Static Analysis)
category: Static Analysis
# Does the service require access to the file to perform its task
# If set to false, the service will only have access to the file metadata (e.g. Hashes, size, type, ...)
file_required: true
# Maximum execution time the service has before it's considered to be timed out
timeout: 60
# Does the service force the caching of results to be disabled
# (only use for service that will always provided different results each run)
disable_cache: false
# is the service enabled by default
enabled: true
# does the service make APIs call to other product not part of the assemblyline infrastructure (e.g. VirusTotal, ...)
is_external: false
# Number of concurrent services allowed to run at the same time
licence_count: 0
# service configuration block (dictionary of config variables)
# NOTE: The key names can be anything and the value can be of any types
config:
str_config: value1
int_config: 1
list_config: [1, 2, 3, 4]
bool_config: false
# submission params block: a list of submission param object that define parameters
# that the user can change about the service for each of its scans
# supported types: bool, int, str, list
submission_params:
- default: ""
name: password
type: str
value: ""
- default: false
name: extra_work
type: bool
value: false
# Service heuristic blocks: List of heuristics object that define the different heuristics used in the service
heuristics:
- description: This the first Heuristic for ResultSample service.
filetype: pdf
heur_id: AL_RESULTSAMPLE_1
name: Masks has PDF
score: 100
attack_id: T1001
- description: This is second Heuristic for ResultSample service.
filetype: exe
heur_id: AL_RESULTSAMPLE_2
name: Drops an exe
score: 1000
- description: This is third Heuristic for ResultSample service.
filetype: exe
heur_id: AL_RESULTSAMPLE_3
name: Extraction information
score: 0
# Docker configuration block which defines:
# - the name of the docker container that will be created
# - cpu and ram allocation by the container
docker_config:
image: cccs/assemblyline-service-resultsample:latest
cpu_cores: 1.0
ram_mb_min: 128
ram_mb: 256
# Update configuration block
update_config:
# list of source object from where to fetch files for update and what will be the name of those files on disk
sources:
- uri: https://file-examples.com/wp-content/uploads/2017/02/zip_2MB.zip
name: sample_2mb_file
- uri: https://file-examples.com/wp-content/uploads/2017/02/zip_5MB.zip
name: sample_5mb_file
# intervale in seconds at which the updater runs
update_interval_seconds: 300
# Should the downloaded files be used to create signatures in the system
generates_signatures: false
A Dockerfile is required to build the service container that will be executed in the system.
The following items must be set for all services:
- All services must be based on the
cccs/assemblyline-v4-service-base:latest
image - An environment variable must be set for the service path
- Install any service requirements
- Copy the service code into
/opt/al/al_service/
FROM cccs/assemblyline-v4-service-base:latest
# Set the service path
ENV SERVICE_PATH result_sample.ResultSample
# By default, the base service container as the assemblyline user as the running user
# switch to root to perform installation of dependancies
USER root
# See that we all these operations in one line to reduce
# the number of container layers and size of the container
RUN apt-get update && apt-get install -y my_debian_apt_dependency_package && rm -rf /var/lib/apt/lists/*
RUN pip install --no-cache-dir --user my_pip_dependency && rm -rf ~/.cache/pip
# Change to the assemblyline user to make sure your service does not run as root
USER assemblyline
# Copy the service code in the service directory
WORKDIR /opt/al_service
COPY assemblyline_result_sample_service .
To test an Assemblyline service in standalone mode, the run_service_once.py script can be used to run a single task through the service for testing. That script does not require that you have a working version of Assemblyline installed, all you need are the Assemblyline python libraries.
NOTE: The following environment setup has only been tested on Ubuntu 20.04.
-
Install required packages
sudo apt-get install build-essential libffi-dev python3.7 python3.7-dev python3-pip automake autoconf libtool
-
Install Assemblyline v4 service package
pip install --no-cache-dir --user assemblyline-v4-service
-
Add your service development directory path (ie.
/home/ubuntu/assemblyline-v4-service
) to the PYTHONPATH environment variable
-
Ensure the current working directory is the root of the service directory of the service to be run
cd assemblyline-service-<service name>
-
From a terminal, run the
run_service_once
script, where<service path>
is the path to the service module and<file path>
is the path of the file to be processedpython3.11 -m assemblyline_v4_service.dev.run_service_once <service path> <file path>
-
The output of the service (
result.json
and extracted/supplementary files) will be located in a directory where the input file is located
-
Change working directory to root of the service:
cd assemblyline_result_sample_service
-
From a terminal, run the
run_service_once
scriptpython3.11 -m assemblyline_v4_service.dev.run_service_once assemblyline_result_sample_service.result_sample.ResultSample /home/ubuntu/testfile.doc
-
The
results.json
and any extracted/supplementary files will be outputted to/home/ubuntu/testfile_resultsample