last update: 22-07-2025

Implement python AutoTests in CodeGrade

Why are there 2 autotest descriptions?

This text is largely similar to the R AutoTests documentation, but adapted for Python specifically. The main difference will be calling the test in Python instead of R. It will also show the helper functions that are defined so the python tests will run more cleanly.

AutoTests are used to automatically grade assignments that students submit to CodeGrade. Each task can have a separate test that is linked to the specific rubric in the AutoTest configuration so that partial grading is possible. This documentation gives a brief overview on the current configuration of AutoTests for exercises and assignments of the Geoscripting course at Wageningen University.

In each exercise and assignment there is a separate test folder that contains the test scripts, which will be used by CodeGrade to test the student submissions.

Helpful links

Create AutoTest configuration

Getting started

The general idea of AutoTests is to run test scripts that execute the student submissions and check the results. If scripts fail or the results do not match the solutions only partial or no points will be given in the rubric.

AutoTest - Setup in codegrade

CodeGrade configurations are accessible via the exercises on Brightspace, which will link to the CodeGrade page. The AutoTest can be implemented or found via the AutoTest tab: Brightspace Exercise 7 > CodeGrade Exercise 07 > AutoTest .
Create new AutoTest v2 from scratch or copy existing configuration from previous assignments.
Drag & drop the different cells into the window to build the test. Keep it simple and use Upload Files, Allow Internet, Connect Rubric and Script. The actual tests are bash, R or Python scripts that can be run as bash commands via Script.

CodeGrade will build a new testing Ubuntu environment every time the AutoTest Setup configuration changes. This means that CodeGrade checks if the code in each cell has changed since the last snapshot and runs the code again if necessary. Once the AutoTest configuration including the setup works, the snapshot is used when students submit assignments.

Create .sh install requirements file and .yaml environment file containing all necessary python dependencies for the current exercise. These files will be used in the next step to set up the CodeGrade Ubuntu environment for Python. These files should be uploaded into the test folder of each solution repository.

install_requirements.sh:

sudo apt-get update
sudo apt-get install curl

# install micromamba
curl -Ls https://micro.mamba.pm/api/micromamba/linux-64/latest | tar -xvj bin/micromamba

sudo mv bin/micromamba /usr/local/bin/
sudo chmod +x /usr/local/bin/micromamba

eval "$(micromamba shell hook --shell bash)"

# initialize micromamba shell
micromamba shell init -s bash -r ~/micromamba

# create and activate environment 
micromamba env create -f $UPLOADED_FILES/test/environment.yaml -y
micromamba activate environment

environment.yaml:

name: environmnet
channels:
  - conda-forge
dependencies:
  - python=3.12
  - spyder

Important: Similarly to R, it is important to think about the compiling time when setting up the Python environment on CodeGrade VM. Therefore, the preferred package manager is micromamba as it is considerably faster than mamba/conda, and the absolute bare minimum of dependencies is recommended.

Next, use Script to set up the environment on the Ubuntu VM, with setting up the two environment variables: GITLAB_USER_NAME and GITLAB_USER_PAT. These are needed for cloning repositories via https using personal access tokens in the next step. The cloned repository contains the assignment solution and all tests within a test folder.

export GITLAB_USER_NAME=geoscripting-<year>
export GITLAB_USER_PAT=<personal_access_token>

Note: Files are being uploaded into the directory /home/codegrade/fixtures that is defined by the CodeGrade environment variable called $UPLOADED_FILES. In earlier CodeGrade versions this is the same as $FIXTURES.

The next step is to install git and clone solution repository.
Install requirements, defined by install_requirements.sh.

CodeGrade Script cell:

# load setup vars
export GITLAB_USER_NAME=geoscripting-2024
export GITLAB_USER_PAT=example

# set up git and clone solution repo with tests
sudo apt update && sudo apt install -y git
YEAR=`date +%Y`
NR='07'
git clone https://$GITLAB_USER_NAME:$GITLAB_USER_PAT@git.wur.nl/geoscripting-$YEAR/staff/exercise-$NR/exercise-$NR-solution.git $UPLOADED_FILES/tmp
mv $UPLOADED_FILES/tmp/test $UPLOADED_FILES/ && rm -rf $UPLOADED_FILES/tmp

# install requirements
chmod +x $UPLOADED_FILES/test/install_requirements.sh
bash $UPLOADED_FILES/test/install_requirements.sh

# dummy echo command to force CodeGrade to rerun cell
echo "Done."

AutoTest - Tests

This is where the actual tests are being implemented. This section is the main difference with how tests are run in R.

The steps are:

Drag & drop a Rubric cell into the window so that points are being assigned to a specific rubric.
Make sure to Allow Internet if necessary. This could also be outside the Rubric cell or in any other order.
Create a bash script with the code that will run the tests that corresponds to the rubric item.

The next section will explain this in more detail.

Some tests are the same for all exercises, for example checking the git useage. This is done using a bash script. this can be called using:

# use bash test script 
bash $UPLOADED_FILES/test/git_use.sh

How to set up tests for geoscripting?!

In geoscripting we make use of codegrade to do some testing before we even start grading. In ./test_exc08.py you will find an example of how to set up tests for exercise 8 https://git.wur.nl/geoscripting-2025/staff/exercise-8/exercise-8-solution. We will go through the code in this markdown for your understanding. Since it is copied it might contain some duplicate text... Sorry :)

Argparse

In the following code we set up the test so it can be easily called from the command line in codegrade. We need to do this because in codegrade we need to specify the correct path to the student answer directory, we need to make sure the code that is run is not the answer but the actual answer of the student.

# append student path
import sys
import argparse

parser = argparse.ArgumentParser()
parser.add_argument('--student_path', type=str, default='/home/codegrade/student',
                   help='Path to student answer directory')
parser.add_argument('--test', type=int, choices=[1, 2], help='Select test to run (1: geolocator, 2: calc_distances)')
args = parser.parse_args()

sys.path.append(args.student_path)

For this we use Argparse. Argparse is a useful package to set up command line arguments. In this example argparse makes the python file to be called with 2 command line arguments, --student_path and --test. This makes it easy to configure the test as we want. After setting it up like this we can call the test file like this:

python test_exc08.py --student_path /home/codegrade/student --test 1

The first test will be run on the answer saved in the path /home/codegrade/student, the path where the code will be stored in codegrade. Selecting the correct test is important, because we can then select only part of the tests to be run, specifically the test that corresponds to the rubric item.

For testing locally while developing you might want to create a local folder with some answers in it. By changing the answers to something wrong you can see if the test catches it.

Try function

The try function as defined below is a simple way to run a function and catch any assertion errors that might occur:

def try_function(fun):
    try:
        fun()
    except AssertionError as E:
        print(E)
        sys.exit(1)

It does several things. We don't want to show the student an ugly error stack trace, but rather we want to show a nice error message that we can define ourselves. In this way we also make sure we don't give away the answer, and we might in the future want to use this to give some hints for specific mistakes.

Actual testing

The following code now imports the students answers:

# import student module
import distanceCalculator

And now finally the actual testing. Below are 2 tests that test some functionality.

def test_geolocator():
    # g = Nominatim(user_agent='geocoder')
    assert distanceCalculator.geolocator(cityname = "Wageningen") == (51.9685817, 5.668298061827958), "The location Wageningen was not geocoded correctly, there is an issue somewhere."

def test_calc_distances():
    # Round to nearest 10 km
    distance = round(distanceCalculator.calc_distances(cities = ['Wageningen','Brussels', 'London']), -1)
    assert  distance == 7102, "The distance from Wageningen to Brussels to London is not correct there is an issue somewhere."

The first function tests if the geolocator function returns the correct coordinates for Wageningen. Note that we define the coordinate manually instead of running the answer. We can do this because we know Wageningen will not move soon.

The second function tests if the distance calculation is correct for the route Wageningen - Brussels - London. The distance is rounded to the nearest 10 km, different geocoding services might return slightly different distances.

Finally we run the tests, we wrap the functions in the try_function function so we can catch any assertion errors and print a nice error message as explained above:

if args.test is None or args.test == 1:
    try_function(test_geolocator)
if args.test is None or args.test == 2:
    try_function(test_calc_distances)

I hope all is clear! if not, reach out!

Notes:

CodeGrade will not fetch updates from a Git repo if the code within a cell has not changed since the previous snapshot. Simply add an echo command to force CodeGrade to clone an updated repository, since this will trigger to re-run the cell.
Commands that require sudo privileges only work in cells of the Setup tab.
It is possible to upload e.g. wrong test submission to check if the tests work fine. Upload a .zip file containing scripts and subfolders in: General > Upload submission > Test sumission > <upload_zip_file>. Note that, the results of different test submissions will be displayed below each other in the same snapshot.
Student or test submissions will be uploaded into the directory /home/codegrade/student
The final directory should look something like this:

/home/codegrade/
|-- fixtures                          # $UPLOADED_FILES
|   |-- test                          # test folder from solution
|       |-- git_use.sh                # test - git use
|       |-- helper.R                  # test - helper functions
|       |-- install_requirements.R    # R requirements
|       |-- install_requirements.sh   # general requirements (e.g. GDAL)
|       |-- test_exc08.py             # tests
|-- student                           # student or test submission
    |-- output
    |   |-- output_image.png
    |--distanceCalculator
    |   |-- __init_.py               # This is where the students python answers  are
    |-- LICENSE
    |-- README.md

To check the final AutoTest configuration, just click on Build Snapshot. This will set up the Ubuntu VM as configured with Setup and run all tests specified in Tests. Remember that the Setup will be skipped if there are no changes to a previous snapshot. Snapshots can be found and inspected in the Snapshots tab.

AutoTest - Snapshots

This is where all compiled AutoTest configurations can be found. Each snapshot shows the configuration and the output of tests. If everything worked fine and all tests passed, the snapshot may be published to the students by clicking on a specific snapshot and confirming Publish to students. Students will now be able to see test results when submitting assignments.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement python AutoTests in CodeGrade

Why are there 2 autotest descriptions?

Helpful links

Create AutoTest configuration

Getting started

AutoTest - Setup in codegrade

AutoTest - Tests

How to set up tests for geoscripting?!

Argparse

Try function

Actual testing

AutoTest - Snapshots

FilesExpand file tree

autotesting-python.md

Latest commit

History

autotesting-python.md

File metadata and controls

Implement python AutoTests in CodeGrade

Why are there 2 autotest descriptions?

Helpful links

Create AutoTest configuration

Getting started

AutoTest - Setup in codegrade

AutoTest - Tests

How to set up tests for geoscripting?!

Argparse

Try function

Actual testing

AutoTest - Snapshots