-
Notifications
You must be signed in to change notification settings - Fork 461
VboxApps
BOINC supports applications that run in VirtualBox virtual machines. This provides two benefits:
- You don't need to build app versions for different platforms. You can develop your app in your environment of choice (say, Debian Linux), and then bundle the resulting executable together with a virtual machine image containing an appropriate runtime environment. The application can then be run on all platforms (Windows, Mac OS X, all versions of Linux) with no additional work on your part.
- Virtual machines provide the strongest available security sandbox; a program running in a virtual machine cannot access or modify the host system. This makes it feasible to deploy untrusted applications.
BOINC's support for VM apps is based on a program called vboxwrapper that interfaces between the BOINC client and the VirtualBox system.
- Commercial operating systems like Windows and Mac OS X have a license with a pay-per-user clause, so in general you can't use them in the VM image. Similarly, you can't include pay-per-user software such as Matlab in the VM image.
- VirtualBox runs only on Intel-compatible processors. If you want to support other processors (such as ARM, SPARC, etc.), you'll need to use non-VM-based app versions.
- Currently you can't run GPU applications in VirtualBox VMs. This may change in the future.
A VM image is called 32- or 64-bit depending on the operating system it contains. The BOINC host population includes 32-bit and 64-bit hosts. 64-bit hosts can run 32-bit VMs, but not conversely. You can choose to provide 32- or 64-bit VM images, or both.
Possible reasons for using 64-bit VM images:
- The 64-bit version of your app runs significantly faster than the 32-bit version.
- Your app uses more than 3 GB of virtual address space.
If you provide only 32-bit VM images, you must still create separate 32- and 64-bit app versions, using the same VM image but different vboxwrapper executables. (vboxwrapper is a C++ program and has different executables for each platform, include 32- and 64-bit; these are available below. The 32-bit vboxwrapper will generally not work on a 64-bit machine).
There are two ways to package VM apps. NOTE: in the following, we use application_' and '_application version with their BOINC-specific meanings; we'll use executable to refer to the program that runs within the VM.
- Single-purpose app: Include the executable with the application version. Create a separate application for each executable you want to run.
- Multi-purpose app: Include the executable in each workunit. This allows you to use a single application for as many executables as you like. In this case, consider making the executable file sticky; that way, clients will download it only once.
You must create app versions for each platform you want to support; the app versions differ in which vboxwrapper executable they use. If you use both 32- and 64-bit VMs, the versions will also differ in the VM image and the application executable.
The application versions for a given platform are of plan class "vbox32" (for 32-bit machines) or "vbox64" (for 64-bit machines).
To enable multiple cores use the plan classes "vbox32_mt" and "vbox64_mt". By default it will cause the server to assign 2 threads(virtual cores) per VM task. This behavior can be modified in sched_customize.cpp.
For single-purpose apps, an app version includes the following files:
- The VM image, in VirtualBox format.
- Must have logical name "vm_image.vdi".
- Must have the copy_file attribute.
- Should have the gzip attribute for faster download to 7.0+ clients.
- The application executable to be run in the VM image.
- This may be a shell script or a binary program.
- The logical name must be shared/boinc_app.
- Other files needed by the application, all with logical names starting with shared/.
- An XML job description file_' with logical name '_vbox_job.xml (see below)
- vboxwrapper, compiled for the platform (executables are available below).
- All scripts and executables must have the execute permission set.
For multi-purpose apps, any of these files except vboxwrapper may be included in the workunit instead of the app version.
Include <dont_throttle/> in the version.xml file; VirtualBox does its own CPU throttling.
Typically you can use the same VM image for multiple applications. This reduces network traffic and client disk usage.
The job description file has logical name vbox_job.xml (its physical name should include a version number and other info). It has following structure:
<vbox_job>
<memory_size_mb>N</memory_size_mb>
<os_name>name</os_name>
[ <completion_trigger_file>filename</completion_trigger_file> ]
[ <copy_to_shared>filename</copy_to_shared> ]
[ <enable_cache_disk>0|1</enable_cache_disk> ]
[ <enable_cern_dataformat>0|1</enable_cern_dataformat> ]
[ <enable_floppyio>0|1</enable_floppyio> ]
[ <enable_isocontextualization>0|1</enable_isocontextualization> ]
[ <enable_network/> ]
[ <enable_remotedesktop>0|1</enable_remotedesktop> ]
[ <enable_shared_directory/> ]
[ <enable_graphics_support/> ]
[ <fraction_done_filename>filename</fraction_done_filename> ]
[ <job_duration>X</job_duration> ]
[ <minimum_checkpoint_interval>N</minimum_checkpoint_interval> ]
[ <network_bridged_mode/> ]
[ <pf_guest_port>N</pf_guest_port> ]
[ <port_forward>
<host_port>H</host_port>
<guest_port>G</guest_port>
[<is_remote>0|1</is_remote>]
[<nports>N</nports]
</port_forward> ]
[ <trickle_trigger_file>filename</trickle_trigger_file> ]
[ <vm_disk_controller_model>LSILogic|LSILogicSAS|BusLogic|IntelAHCI|PIIX3|PIIX4|ICH6|I82078</vm_disk_controller_model> ]
[ <vm_disk_controller_type>ide|sata|scsi|floppy|sas</vm_disk_controller_type> ]
</vbox_job>
Required elements:
memory_size_mb:: the amount of physical memory allocated to the VM, in megabytes. os_name:: the name of the guest OS as defined by VirtualBox, e.g. "Linux26", "Linux26_64", "Linux24", etc. To see a list of all available OS names, install VirtualBox on a system, and type "vboxmanage list ostypes".
Optional elements: completion_trigger_file:: This provides a more bulletproof way for VM apps to exit; sometimes VMs fail to shut down, and the task hangs indefinitely. When the VM app is done, it writes a file of this name in the shared directory; the file can optionally contain an integer exit code (first line) a bool value for whether it should bubble up to the volunteer (second line) and stderr text (subsequent lines). If vboxwrapper finds this file, it cleans up the VM and exits with the given code (default 0). copy_to_shared:: copy the given file from the slot directory the shared directory before launch. For example, you can use to copy init_data.xml into the VM. This directive can be used more than once. enable_cache_disk:: Mount a virtual disk in the VM. The virtual disk is described by a VDI file in the app version with logical name vm_cache.vdi_' and the '_copy_file attribute. enable_cern_dataformat_':: if '_enable_floppyio is used (see below) initialize the floppy disk image to contain user and host ID and credit as name=value pairs. enable_isocontextualization_':: the VM image is an ISO file named '_vm_isocontext.iso, rather than a VDI file. Also, vboxwrapper will mount the host's guest additions ISO (VBoxGuestAdditions.iso) as a DVD in the VM. enable_floppyio:: create a floppy disk image in the VM, containing the contents of init_data.xml. enable_network:: if present, allow the VM to do network access. enable_remotedesktop:: If the Oracle VirtualBox Extension are installed, it'll enable the use of a remote desktop client to view the console of the VM. enable_shared_directory:: if present, create a directory that is shared between the host OS and the guest OS. Must be set if your application has input or output files. enable_graphics_support:: if present, creates and updates a graphics status file. This is used by HTMLGfx. (v26155+) fraction_done_filename:: the name of a file to which the app will periodically write its fraction done (0 to 1). This is used by the wrapper to report overall fraction done. job_duration:: this specifies the maximum elapsed time of the job, after which vboxwrapper will kill the VM and exit normally. minimum_checkpoint_interval:: minimum number of seconds before a checkpoint/snapshot can be created. Defaults to 10 minutes. (v26086+) network_bridged_mode_':: if '_enable_network is set, use bridged mode; default is NAT mode. pf_guest_port:: enable port forwarding to port N within the VM. This is assumed to be a web server providing application graphics. port_forward:: defines a port forwarding between the given host and guest ports. If nports is specified, N ports will be forwarded: H to G, H+1 to G+1, ... H+N-1 to G+N-1. If is_remote is set, the host ports can be accessed from other computers; otherwise they can be accessed only from processes on this computer. There may be more than one of these elements. trickle_trigger_file:: provides a mechanism for the VM to send trickle-up messages. If a file of the given name appears in the shared directory, vboxwrapper sends a trickle-up message whose variety is the filename and whose contents is the contents of the file, then deletes the file. vm_disk_controller_model:: which disk controller model to emulate. vm_disk_controller_type:: which disk controller type to emulate.
The state file has the name of vbox_webapi.xml.
It has following structure:
<webapi>
<host_port>X</host_port>
</webapi>
Required elements: host_port:: The port, if configured for it, vboxwrapper has assigned to the task for WebAPI requests. (HTTP, XML-RPC, JSON)
The state file has the name of vbox_remote_desktop.xml.
It has following structure:
<remote_desktop>
<host_port>X</host_port>
</remote_desktop>
Required elements: host_port:: The port, if configured for it, vboxwrapper has assigned to the task for Remote Desktop requests. (RDP)
--trickle X :: Send a trickle message reporting elapsed time every X seconds. Use might this for incremental credit granting, or as a "heartbeat" mechanism. -- nthreads N :: Create a virtual machine that will use N cores. --vmimage N:: Use vm_image_N.vdi_' as the VM image, rather than '_vm_image.vdi. This lets you create an app version with several images, and the app_plan function can decide which one to use for the particular host. -- register_only :: Register the VM but don't run it. For debugging; see below.
The input and output files of a VM app must
- Have logical names starting with shared/.
- Have the copy_file attribute.
This causes the BOINC client to copy them to and from the slot/x/shared/ directory.
To debug a VM within the BOINC/VboxWrapper framework:
- Launch BOINC with --exit_before_start
- When BOINC exits, launch vboxwrapper with the --register_only option.
- Set the VBOX_USER_HOME environment variable to the vbox directory under the slot directory. This changes where the VirtualBox applications look for the root VirtualBox configuration files. It may or may not apply to your installation of VirtualBox. It depends on where your copy of VirtualBox came from and what type of system it is installed on.
- Now Launch the VM using the VirtualBox UI. You should now be able to interact with your VM.
To debug what is going on within the guest VM you can use vboxmonitor which writes whatever is received in stdin to the VM guest log (VBox.log). This in turn is read and rewritten to stderr.txt in the slot directory by vboxwrapper.
To use you must either use the premade vboxmonitor or build your own. Once on the guest VM you must execute setuid on it so that it runs with root permissions.
Usage:
[vboxmonitor](root@localhost)# echo this is a test | ./vboxmonitor
this is a test
Log Output:
01:08:57.938896 Guest Log: this is a test
Usage:
[vboxmonitor](root@localhost)# ls -la ../vboxwrapper | ./vboxmonitor
total 37052
drwxrwxr-x 5 boincadm boincadm 4096 Jun 2 12:36 .
drwxrwxr-x 18 boincadm boincadm 4096 Oct 3 2013 ..
-rw-rw-r-- 1 boincadm boincadm 3831 Apr 24 2013 BuildMacVboxWrapper.sh
drwxrwxr-x 2 boincadm boincadm 4096 Apr 24 2013 cernvm
drwxrwxr-x 3 boincadm boincadm 4096 Apr 24 2013 deprecated
-rw-rw-r-- 1 boincadm boincadm 11585 Apr 24 2013 floppyio.cpp
-rw-rw-r-- 1 boincadm boincadm 5910 Apr 24 2013 floppyio.h
-rw-rw-r-- 1 boincadm boincadm 71444 May 21 20:01 floppyio.o
lrwxrwxrwx 1 boincadm boincadm 48 May 21 20:01 libstdc++.a -> /usr/lib/gcc/i386-redhat-linux/4.1.2/libstdc++.a
-rw-rw-r-- 1 boincadm boincadm 927 May 29 2013 Makefile
-rw-rw-r-- 1 boincadm boincadm 1135 Apr 24 2013 Makefile_mac
-rw-rw-r-- 1 boincadm boincadm 90494 Jun 2 12:33 vbox.cpp
-rw-rw-r-- 1 boincadm boincadm 7573 Jun 2 12:33 vbox.h
-rw-rw-r-- 1 boincadm boincadm 249892 Jun 2 12:35 vbox.o
-rw-rw-r-- 1 boincadm boincadm 43848 Jun 2 12:33 vboxwrapper.cpp
-rw-rw-r-- 1 boincadm boincadm 1458 Dec 9 19:21 vboxwrapper.h
-rw-rw-r-- 1 boincadm boincadm 178548 Jun 2 12:35 vboxwrapper.o
-rw-rw-r-- 1 boincadm boincadm 438 May 21 19:56 vboxwrapper_win.h
-rw-rw-r-- 1 boincadm boincadm 2127 May 21 19:56 vboxwrapper_win.rc
drwxrwxr-x 2 boincadm boincadm 4096 Apr 24 2013 vboxwrapper.xcodeproj
Log Output:
01:09:25.148427 Guest Log: total 37052
01:09:25.148762 Guest Log: drwxrwxr-x 5 boincadm boincadm 4096 Jun 2 12:36 .
01:09:25.149071 Guest Log: drwxrwxr-x 18 boincadm boincadm 4096 Oct 3 2013 ..
01:09:25.149445 Guest Log: -rw-rw-r-- 1 boincadm boincadm 3831 Apr 24 2013 BuildMacVboxWrapper.sh
01:09:25.149729 Guest Log: drwxrwxr-x 2 boincadm boincadm 4096 Apr 24 2013 cernvm
01:09:25.150070 Guest Log: drwxrwxr-x 3 boincadm boincadm 4096 Apr 24 2013 deprecated
01:09:25.150374 Guest Log: -rw-rw-r-- 1 boincadm boincadm 11585 Apr 24 2013 floppyio.cpp
01:09:25.150700 Guest Log: -rw-rw-r-- 1 boincadm boincadm 5910 Apr 24 2013 floppyio.h
01:09:25.151029 Guest Log: -rw-rw-r-- 1 boincadm boincadm 71444 May 21 20:01 floppyio.o
01:09:25.151573 Guest Log: lrwxrwxrwx 1 boincadm boincadm 48 May 21 20:01 libstdc++.a -> /usr/lib/gcc/i386-redhat-linux/4.1.2/libstdc++.a
01:09:25.151864 Guest Log: -rw-rw-r-- 1 boincadm boincadm 927 May 29 2013 Makefile
01:09:25.152190 Guest Log: -rw-rw-r-- 1 boincadm boincadm 1135 Apr 24 2013 Makefile_mac
01:09:25.152476 Guest Log: -rw-rw-r-- 1 boincadm boincadm 90494 Jun 2 12:33 vbox.cpp
01:09:25.152757 Guest Log: -rw-rw-r-- 1 boincadm boincadm 7573 Jun 2 12:33 vbox.h
01:09:25.153045 Guest Log: -rw-rw-r-- 1 boincadm boincadm 249892 Jun 2 12:35 vbox.o
01:09:25.199092 Guest Log: -rw-rw-r-- 1 boincadm boincadm 43848 Jun 2 12:33 vboxwrapper.cpp
01:09:25.199479 Guest Log: -rw-rw-r-- 1 boincadm boincadm 1458 Dec 9 19:21 vboxwrapper.h
01:09:25.199806 Guest Log: -rw-rw-r-- 1 boincadm boincadm 178548 Jun 2 12:35 vboxwrapper.o
01:09:25.200147 Guest Log: -rw-rw-r-- 1 boincadm boincadm 438 May 21 19:56 vboxwrapper_win.h
01:09:25.200490 Guest Log: -rw-rw-r-- 1 boincadm boincadm 2127 May 21 19:56 vboxwrapper_win.rc
01:09:25.201024 Guest Log: drwxrwxr-x 2 boincadm boincadm 4096 Apr 24 2013 vboxwrapper.xcodeproj
Windows:
x86: vboxwrapper_26155_windows_intelx86.zip
x64: vboxwrapper_26155_windows_x86_64.zip
Mac OS X:
x86: vboxwrapper_26105_i686-apple-darwin.zip
x64: vboxwrapper_26105_x86_64-apple-darwin.zip
Linux:
x86: vboxwrapper_26105_i686-pc-linux-gnu.zip
x64: vboxwrapper_26105_x86_64-pc-linux-gnu.zip
Linux:
x86: vboxmonitor_26086_i686-pc-linux-gnu.zip
x64: vboxmonitor_26086_x86_64-pc-linux-gnu.zip
These VM images were built using the above instructions for creating VM images. They contain Debian 4.0, without GCC or any build tools installed. They contain the example startup script.
x86: vmimage_x86.zip
x64: vmimage_x64.zip
In most cases, you can use these VM images with no modifications. If your application uses libraries not on the VM images, you can add them as follows:
-
Run VirtualBox, and open the VM image
-
Hit CTRL-C when see "BOINC VM starting" in the console window
-
Install whatever you want (can use apt-get install to install Debian packages).
-
when you're done, type
shutdown -hP 0
The VM image now has the additional libraries. Rename it to avoid confusion with the original version.
The VM, when booted, must do the following:
-
If the applications has input or output files, mount the shared directory using
mount -t vboxsf shared /root/shared
where /root/shared
is the path where the shared directory is to be mounted.
In this case the VM must contain the VirtualBox "guest additions".
Guest additions are required for shared folders to work.
-
Run the application.
-
When the application is finished, shut down the VM (e.g., by running shutdown on Linux).
These steps are typically done by a startup script in the VM image. An example startup script is given below. This script runs the application by doing the following:
- cd into the shared directory
- execute boinc_app, and wait for it to exit.
Using this script, your application executable must have logical name share/boinc_app.
Doing things this way, the VM image is independent of the application. You can use the a single VM image for many applications.
Attention:_' If your '_boinc_app is a bash or perl script you may get problems when the VM is restored from a snapshot. To circumvent this you have to change your startup script to copy the contents of the shared/ directory to another directory 'inside' the VM and execute it there. For example:
echo -- Launching boinc_app
if [ -f /root/shared/boinc_app ]; then
mkdir /root/worker
cp -r /root/shared/* /root/worker/
cd /root/worker/
./boinc_app
cd /root/
rm -rf /root/worker
shutdown -hP 0
else
This way you can still reuse the VM for other applications but have to make sure that your boinc_app control script is copying the output files of the application to ´/root/shared/´ before exiting. This may take some time, so you should do something like:
cp outfile1.zip /root/shared/out1.zip.tmp
cp outfile2.zip /root/shared/out2.zip.tmp
{...}
mv /root/shared/out1.zip.tmp /root/shared/out1.zip
mv /root/shared/out2.zip.tmp /root/shared/out2.zip
The example startup script follows. You can deploy it by appending to /root/.bashrc in the VM image.
echo --- BOINC VM starting
sleep 5
The "sleep 5" gives you time to break into a console session via CTRL-C if you need to make changes to the VM in the future.
echo --- Mounting shared directory
mount -t vboxsf shared /root/shared
if [ $? -ne 0 ]; then
echo --- Failed to mount shared directory
sleep 5
shutdown -hP 0
fi
echo -- Launching boinc_app
if [ -f /root/shared/boinc_app ]; then
cd /root/shared
./boinc_app
shutdown -hP 0
else
echo --- Failed to launch script
sleep 5
fi
shutdown -hP 0
Using the example startup script, the steps in running a vboxwrapper app are:
- BOINC client
- Create slot directory, say
slot/0
- Create slot/0/shared, and copy input files there
- Execute vboxwrapper in the slot directory
- vboxwrapper
- Create and run virtual machine
- Virtual machine
- Startup script
- mounts shared directory
- cd into shared directory
- execute boinc_app
- when boinc_app exits, shut down virtual machine
- vboxwrapper
- delete virtual machine
- call boinc_finish()
- BOINC client
- copy output files from
slot/0/shared
to project directory
The VM image that you distribute need contain only the runtime environment for your applications. In particular, it need not contain:
- Development tools such as gcc
- GUI software such as X11, gtk etc.
Reducing the VM image size reduces the network load on your server and on volunteer hosts, and the disk usage on volunteer hosts.
The easiest way to make a "small" Linux VM is to install the network install of Debian within the VM. You can find the netinst images here. Such VMs have VirtualBox and guest additions installed by default. They have the runtime libraries needed to run C and C++ applications.
During install you'll be asked what role should this Linux machine be configured for. Make sure all roles are unselected before continuing.
First step is to remove non-critical packages that are essential (source). For Debian Wheezy 7.6 that are:
acpi acpid busybox debconf-i18n eject groff-base iamerican ibritish info ispell laptop-detect logrotate installation-report manpages man-db net-tools os-prober rsyslog tasksel tasksel-data traceroute usbutils wamerican
Use aptitude to also remove linux-headers-*
packages but don't remove dkms
or virtualbox-guest-dkms
! Run apt-get autoremove
after this to get rid of now unused packages. apt-get autoclean
and apt-get clean
should remove downloaded archives to free up space.
In order to really purge all files from previously removed packages this command is helpful as it purges all configuration files from uninstalled packages:
dpkg --purge `dpkg --get-selections | grep deinstall | cut -f1`
Now, we can remove the contents of the following folders:
- /usr/share/locale/ - since we don't need all locales.
- /usr/share/doc/ - since we don't need the documentation.
- /usr/share/man/ - since we don't need the manuals and we've removed the man program.
- /var/log/ - since we won't be needing all that logging.
- /var/cache/debconf/ - since that cache is disposable.
- /var/lib/apt/lists/ - since those lists are huge and can quickly be recreated with apt-get update.
If you want to speed up the boot process,
change the default timeout for grub by modifying /etc/default/grub
:
GRUB_TIMEOUT = 0
After saving the update run:
update-grub
To configure Linux for automatic login you'll need to install a different terminal handler. mingetty works well for our purposes.
To install mingetty:
root@boinc-vm-image:/etc/default# apt-get install mingetty
Next you'll need to change the terminal handler assigned to the first virtual terminal by modifying /etc/inittab
.
Change line:
1:2345:respawn:/sbin/getty 38400 tty1
To:
1:2345:respawn:/sbin/mingetty --autologin root --noclear tty1
This will disable the periodic fsck run that occurs every 30 times the VM is turned on: tune2fs -c -1 /dev/sda1
.
To get a better compression ratio you may install and run the zerofree tool to overwrite all empty space on the VDI with zeros.
apt-get install zerofree
telinit 1
{enter root password}
mount -o remount,ro /dev/sda1
zerofree -v /dev/sda1
shutdown -hP 0
Now open a terminal on the host System and compact the VDI file:
vboxmanage modifyhd FILENAME.vdi --compact