Triton Perf

Setup Instructions

Start Triton server
Update Triton GRPC host and port in config.env
Set the preferred concurrency number in config.env
Run python preprocessing.py
Run make app
To benchmark run curl -v localhost:8000/benchmark -d '{}'
Update the input_file variable (on line 87) in postprocessing.py to the output file path from step 4
Run python postprocessing.py to generate the cleaned_output.csv file that has accuracy metrics