Hi. We were running UrBackup Server (1.4.x series) on Centos 6.6 (ext4 for / and xfs file system for storage of backups) for well over a year (Dell PowerEdge 730xd). Occasionally we would see the server process die with the warning that the UrBackup Server died but the PID remained. We would then simply re-start the UrBackup Server process. It would happen only occasionally (maybe once every 2 months).
Since we’ve upgraded to UrBackup Server (2.0.33) (no issues upgrading) we’re now seeing daily crashes. In looking at the UrBackup server logs we see lots of the following errors:
2016-09-12 04:37:04: WARNING: SQLite: disk I/O error errorcode: 522
2016-09-12 04:37:04: ERROR: Error preparing Query [PRAGMA cache_size = -2048]: disk I/O error
2016-09-12 04:37:04: WARNING: SQLite: statement aborts at 4: [ATTACH DATABASE ‘urbackup/backup_server_settings.db’ AS settings_db] disk I/O error errorcode: 522
2016-09-12 04:37:04: ERROR: Error in CQuery::Execute - disk I/O error Stmt: [ATTACH DATABASE ‘urbackup/backup_server_settings.db’ AS settings_db]
2016-09-12 04:37:04: ERROR: SQL: disk I/O error Stmt: [ATTACH DATABASE ‘urbackup/backup_server_settings.db’ AS settings_db]
2016-09-12 04:37:04: Starting incremental file backup…
2016-09-12 04:37:04: WARNING: SQLite: disk I/O error errorcode: 522
2016-09-12 04:37:04: ERROR: Error preparing Query [SELECT incremental, path, resumed, complete, id FROM backups WHERE clientid=? AND tgroup=? AND done=1 ORDER BY backuptime DESC LIMIT 1]: disk I/O error. Retrying in 1s…
2016-09-12 04:37:05: WARNING: SQLite: disk I/O error errorcode: 522
2016-09-12 04:37:05: ERROR: Error preparing Query [SELECT incremental, path, resumed, complete, id FROM backups WHERE clientid=? AND tgroup=? AND done=1 ORDER BY backuptime DESC LIMIT 1]: disk I/O error. Retrying in 1s…
2016-09-12 04:37:06: WARNING: SQLite: disk I/O error errorcode: 522
2016-09-12 04:37:06: ERROR: Error preparing Query [SELECT incremental, path, resumed, complete, id FROM backups WHERE clientid=? AND tgroup=? AND done=1 ORDER BY backuptime DESC LIMIT 1]: disk I/O error. Retrying in 1s…
2016-09-12 04:37:07: WARNING: SQLite: disk I/O error errorcode: 522
2016-09-12 04:37:07: ERROR: Error preparing Query [SELECT incremental, path, resumed, complete, id FROM backups WHERE clientid=? AND tgroup=? AND done=1 ORDER BY backuptime DESC LIMIT 1]: disk I/O error. Retrying in 1s…
2016-09-12 04:37:08: WARNING: SQLite: disk I/O error errorcode: 522
2016-09-12 04:37:08: ERROR: Error preparing Query [SELECT incremental, path, resumed, complete, id FROM backups WHERE clientid=? AND tgroup=? AND done=1 ORDER BY backuptime DESC LIMIT 1]: disk I/O error. Retrying in 1s…
2016-09-12 04:37:09: WARNING: SQLite: disk I/O error errorcode: 522
2016-09-12 04:37:09: ERROR: Error preparing Query [SELECT incremental, path, resumed, complete, id FROM backups WHERE clientid=? AND tgroup=? AND done=1 ORDER BY backuptime DESC LIMIT 1]: disk I/O error
I ran the urbackupsrv repair-database and it found a few issues which it fixed. Ran for a day without issue (no crashes). The we saw the crashes daily again. Same errors. There’s plenty of disk space, no CPU issues or memory issues and the load on the server is not high when this occurs. Disks seem to be OK, but perhaps a server reboot and diskcheck would be warranted.
Upon starting the server I see:
2016-09-12 05:05:32: Starting HTTP-Server on port 55414
2016-09-12 05:05:32: HTTP: Server started up successfully!
2016-09-12 05:05:32: SQLite: recovered 12772 frames from WAL file /usr/local/var/urbackup/backup_server_files.db-wal code: 283
2016-09-12 05:05:33: SQLite: recovered 4602 frames from WAL file /usr/local/var/urbackup/backup_server_link_journal.db-wal code: 283
2016-09-12 05:05:34: SQLite: recovered 61009 frames from WAL file /usr/local/var/urbackup/backup_server_links.db-wal code: 283
2016-09-12 05:05:34: SQLite: recovered 1 frames from WAL file /usr/local/var/urbackup/backup_server_settings.db-wal code: 283
2016-09-12 05:05:35: Started UrBackup…
2016-09-12 05:05:35: Removing temporary files…
2016-09-12 05:05:35: Recreating temporary folder…
2016-09-12 05:05:35: Backup destination cannot handle subvolumes and snapshots. Snapshots disabled.
2016-09-12 05:05:35: Broadcasting on interface IP 172.20.161.26
2016-09-12 05:05:35: Broadcasting on interface IP 172.20.161.233
2016-09-12 05:05:35: Broadcasting on interface IP 169.254.0.2
2016-09-12 05:05:35: InternetService: Server started up successfully!
2016-09-12 05:05:35: UrBackup Server start up complete.
and off we go, backups start and we wait for the next crash.
Any advice appreciated on how to proceed or diagnose. Thanks.
Pete