Difference between revisions of "Upgrade from 3.x to 4.0"

From Proxmox VE
Jump to navigation Jump to search
 
(40 intermediate revisions by 13 users not shown)
Line 1: Line 1:
 
== Introduction ==
 
== Introduction ==
  
Proxmox VE 4.0 introduces major new features, therefore the upgrade must be carefully planned and tested. Depending on your existing configuration, several manual steps are required, including some downtime. NEVER start the upgrade process without a valid backup and without testing the same in a test lab setup.
+
Proxmox VE 4.x introduces major new features, therefore the upgrade must be carefully planned and tested. Depending on your existing configuration, several manual steps are required, including some downtime. NEVER start the upgrade process without a valid backup and without testing the same in a test lab setup.
  
Major upgrades for V4.0:
+
Major upgrades for V4.x:
 
*OpenVZ is removed, a conversion via backup/restore to LXC is needed  
 
*OpenVZ is removed, a conversion via backup/restore to LXC is needed  
 
*New corosync version, therefore clusters has to be re-established
 
*New corosync version, therefore clusters has to be re-established
Line 10: Line 10:
 
If you run a customized installation and/or you installed additional packages, for example for distributed storage like Ceph or sheepdog, DRBD or any other third party packages, you need to make sure that you also upgrade these package to Debian Jessie.  
 
If you run a customized installation and/or you installed additional packages, for example for distributed storage like Ceph or sheepdog, DRBD or any other third party packages, you need to make sure that you also upgrade these package to Debian Jessie.  
  
V4.0 supports only the new '''DRBD9 which is not backwards compatible with the 8.x version''' and is considered only a technology preview.
+
V4.x supports only the new '''DRBD9 which is not backwards compatible with the 8.x version''' and is considered only a technology preview.
  
Generally speaking there are two possibilities to move from 3.x to 4.0
+
Generally speaking there are two possibilities to move from 3.x to 4.x
  
 +
*New installation on new hardware (and restore VM´s from backup) - safest way, recommended!
 
*In-place upgrade via apt, step by step   
 
*In-place upgrade via apt, step by step   
*New installation on new hardware (and restore VM´s from backup) - safest way.
 
  
 
In both cases you'd better empty the browser's cache after upgrade and reload the GUI page or there is the possibility that you see a lot of glitches.
 
In both cases you'd better empty the browser's cache after upgrade and reload the GUI page or there is the possibility that you see a lot of glitches.
 +
 +
== New installation ==
 +
 +
* Backup all VMs and containers to external media (see [[Backup and Restore]])
 +
* Backup all files in /etc You will need various files in /etc/pve, as well as /etc/passwd, /etc/network/interfaces, /etc/resolv.conf and others depending on what has been configured from the defaults.
 +
* Install Proxmox VE from ISO (this will wipe all data on the existing host)
 +
* Rebuild the cluster if you had any
 +
* Restore the file /etc/pve/storage.cfg (this will re-map and make available any external media you used for backup)
 +
* Restore firewall configs /etc/pve/firewall/ and /etc/pve/nodes/<node>/host.fw (if relevant)
 +
* Restore full VMs from Backups (see [[Backup and Restore]])
 +
* Restore/Convert containers (see [[Convert OpenVZ to LXC]])
 +
 +
=== Bypassing Backup and Restore ===
 +
 +
The following is only for advanced users which have knowledge about Proxmox configuration files!
 +
 +
Since Backup and Restore can be a time-consuming process in the following a more rapid method is described - possible only
 +
 +
* for KVM (i.e. not for containers)
 +
 +
* if the (virtual) disk(s) for the VM(s) is (are) located at a storage which is not touched by the installation process (e.g. NFS at an external server)
 +
 +
The steps
 +
 +
* Backup all VMs
 +
* Restore full VMs from Backups
 +
 +
will be replaced by
 +
 +
* Backup <vmid>.conf file(s) for the respective machine(s), they are located under /etc/pve/nodes/<nodename>/lxc/ and /etc/pve/nodes/<nodename>/qemu-server/ respectively
 +
* Backup those storages from the storage configuration ( /etc/pve/storage.cfg ) which are shared and untouched, simply copy the respective lines and append them once to the newly build clusters /etc/pve/storage.cfg
 +
* Restore <vmid>.conf file(s) for the respective machine(s)
 +
 +
'''Note:''' /etc/pve/lxc/ and /etc/pve/qemu-server/ are virtual symlinks for the current nodes lxc and qemu directory.
 +
 +
After you restored the VM configs and restored the external shared Storage configuration - so that it is accessible under the same name in the new cluster again - you should be able to start the VMs again.
 +
No additional reboot is required.
  
 
== In-place upgrade ==
 
== In-place upgrade ==
Line 28: Line 65:
 
* reliable access to all configured storages
 
* reliable access to all configured storages
 
* healthy cluster
 
* healthy cluster
* no VM or CT running (note: VM live migration from 3.4 to 4.0 node or vice versa NOT possible)
+
* no VM or CT running (note: VM live migration from 3.4 to 4.x node or vice versa NOT possible)
 
* valid backup of all OpenVZ containers (needed for the conversion to LXC)
 
* valid backup of all OpenVZ containers (needed for the conversion to LXC)
 
* valid backup of all VM (only needed if something goes wrong)
 
* valid backup of all VM (only needed if something goes wrong)
Line 37: Line 74:
  
 
All has to be done on each Proxmox node's command line (via console or ssh; preferably via console in order to exclude interrupted ssh connections) , some of the steps are optional. If a whole cluster should be upgraded, keep in mind the cluster name and HA configuration like failoverdomains, fencing etc since these have to be restored after upgrade by the new WEB GUI. Again, make sure that you have a valid backup of all CT and VM before you start.
 
All has to be done on each Proxmox node's command line (via console or ssh; preferably via console in order to exclude interrupted ssh connections) , some of the steps are optional. If a whole cluster should be upgraded, keep in mind the cluster name and HA configuration like failoverdomains, fencing etc since these have to be restored after upgrade by the new WEB GUI. Again, make sure that you have a valid backup of all CT and VM before you start.
 +
 +
'''Tip''': ''It is advisable to perform a dry-run of the upgrade first. Install the PVE 3.4 ISO on testing hardware, then upgrade this installation to the latest minor version of PVE 3.4 using the test repo (see [[Package repositories]]) then copy/create relevant configurations to the test machine to replicate your production setup as closely as possible.''
  
 
==== Remove Proxmox VE 3.x packages in order to avoid dependency errors ====
 
==== Remove Proxmox VE 3.x packages in order to avoid dependency errors ====
Line 64: Line 103:
  
 
==== Install the new kernel ====
 
==== Install the new kernel ====
 +
 +
Add repository proxmox jessie location
 +
 +
deb https://enterprise.proxmox.com/debian jessie pve-enterprise
 +
deb http://download.proxmox.com/debian jessie pve-no-subscription
 +
 +
apt-get update
  
 
Check first what the current new kernel's version is  
 
Check first what the current new kernel's version is  
  
  apt-cache search pve-kernel | sort
+
  apt-cache search pve-kernel | sort -V
  
- at the moment (October, 6th 2015) it is 4.2.6-1 - and install it:
+
- at the moment (September 2016) it is 4.4.19-1 - and install it:
  
  apt-get install pve-kernel-4.2.6-1-pve pve-firmware
+
  apt-get install pve-kernel-4.4.19-1-pve pve-firmware
  
 
==== Upgrade the basic system to Debian Jessie ====
 
==== Upgrade the basic system to Debian Jessie ====
  
 
This action will consume some time - depending on the systems performance, this can take up to 60 min or even more. If you run on SSD, the dist-upgrade can be finished in 5 minutes.
 
This action will consume some time - depending on the systems performance, this can take up to 60 min or even more. If you run on SSD, the dist-upgrade can be finished in 5 minutes.
 +
 +
Start with this step to get the initial set of upgraded packages.
 +
 +
apt-get upgrade
 +
 +
Once that's done, move on to the remaining packages to upgrade, with:
  
 
  apt-get dist-upgrade
 
  apt-get dist-upgrade
 +
 +
During either of the above, you may be asked to approve of some new packages replacing configuration files. Do with them as you see fit, but they are not relevant to the Proxmox upgrade.
  
 
Reboot the system in order to activate the new kernel.
 
Reboot the system in order to activate the new kernel.
  
==== Install Proxmox VE 4.0 ====
+
==== Install Proxmox VE 4.x ====
Finally, install the new Proxmox VE 4.0 packages with one single command:
+
Finally, install the new Proxmox VE 4.x packages with one single command:
 
  apt-get install proxmox-ve
 
  apt-get install proxmox-ve
  
Then you should purge configuration files from packages which are no longer needed (assuming you already saved your OpenVZ containers)
+
Then you should purge configuration files from packages which are no longer needed ('''''Note:''' purging vzctl will delete all files in /var/lib/vz/private and /var/lib/vz/root , only run this if you have backed up your openvz containers.''):
  
 
  dpkg --purge vzctl
 
  dpkg --purge vzctl
 
  dpkg --purge redhat-cluster-pve
 
  dpkg --purge redhat-cluster-pve
  
'''Remove the old kernel''' (not a must, but recommended), e.g. (the kernel version has to be adapted to the currently installed one - there can be more old kernels too):
+
'''Remove the old kernel''' (not a must, but recommended), e.g. (the kernel version has to be adapted to the currently installed one - there can be more old kernels too. Use dpkg --list | grep pve-kernel to find any 2.6.* kernels to remove):
 +
 
 +
apt-get remove pve-kernel-2.6.*
 +
 
 +
After these commands, proxmox gives some error message and ignore them.
 +
 
 +
Reboot system,
  
  apt-get remove pve-kernel-2.6.32-41-pve
+
Again,
 +
  apt-get install proxmox-ve
  
 
Finally, reboot and test if all is working as expected.
 
Finally, reboot and test if all is working as expected.
Line 106: Line 167:
  
 
==== Cluster upgrade ====
 
==== Cluster upgrade ====
'''It is not possible to mix Proxmox VE 3.x and earlier with Proxmox VE 4.0 cluster '''
+
'''It is not possible to mix Proxmox VE 3.x and earlier with Proxmox VE 4.x cluster '''
  
 
Due to the new corosync 2.x, the cluster has to be re-established again. Please use the same clustername.
 
Due to the new corosync 2.x, the cluster has to be re-established again. Please use the same clustername.
Line 140: Line 201:
 
  apt-get -fy install
 
  apt-get -fy install
  
=== External links ===
+
* Unable to boot due to grub failure
  
*[https://www.debian.org/releases/jessie/amd64/release-notes/ Release Notes for Debian 8.0 (jessie), 64-bit PC]
+
See [[Recover_From_Grub_Failure]]
  
== New installation ==
+
'''Info: As long as you do not delete the files on the link below, Proxmox starts up with the same virtual machines.'''
  
* Backup all VMs and containers
+
'''/var/lib/vz/images/*'''  --VM's data files are here.
* Save all files from /etc/pve/... on a save place
 
* Install Proxmox VE from ISO
 
* Rebuild the cluster if you had any
 
* Restore /etc/pve/storage.cfg
 
* Restore firewall configs /etc/pve/firewall/ and /etc/pve/nodes/<node>/host.fw
 
* Create empty VMs for the following restore action
 
* Restore VMs from Backups
 
  
* '''For experts only''': Alternatively - if the virtual HD(s) is (are) on an external storage - to the two steps mentioned above you can do the following (Be careful what you do in that case, wrong handling can lead to inconsistent data and cause severe problems in the system):
+
'''/etc/pve/qemu-server/*.conf''' --VM's conf files are here.
 +
=== External links ===
  
** copy the previously saved .conf - file(s) to the proper place in the node where it should run as it is /etc/pve/qemu-server/<vm-id>.conf
+
*[https://www.debian.org/releases/jessie/amd64/release-notes/ Release Notes for Debian 8.0 (jessie), 64-bit PC]
  
* Restore/Convert containers, see [[Convert OpenVZ to LXC]]
 
  
[[Category:HOWTO]] [[Category:Installation]]
+
[[Category:HOWTO]] [[Category:Installation]][[Category: Upgrade]]

Latest revision as of 08:15, 21 January 2022

Introduction

Proxmox VE 4.x introduces major new features, therefore the upgrade must be carefully planned and tested. Depending on your existing configuration, several manual steps are required, including some downtime. NEVER start the upgrade process without a valid backup and without testing the same in a test lab setup.

Major upgrades for V4.x:

  • OpenVZ is removed, a conversion via backup/restore to LXC is needed
  • New corosync version, therefore clusters has to be re-established
  • New HA manager (replacing RGmanager, involving a complete HA re-configuration)

If you run a customized installation and/or you installed additional packages, for example for distributed storage like Ceph or sheepdog, DRBD or any other third party packages, you need to make sure that you also upgrade these package to Debian Jessie.

V4.x supports only the new DRBD9 which is not backwards compatible with the 8.x version and is considered only a technology preview.

Generally speaking there are two possibilities to move from 3.x to 4.x

  • New installation on new hardware (and restore VM´s from backup) - safest way, recommended!
  • In-place upgrade via apt, step by step

In both cases you'd better empty the browser's cache after upgrade and reload the GUI page or there is the possibility that you see a lot of glitches.

New installation

  • Backup all VMs and containers to external media (see Backup and Restore)
  • Backup all files in /etc You will need various files in /etc/pve, as well as /etc/passwd, /etc/network/interfaces, /etc/resolv.conf and others depending on what has been configured from the defaults.
  • Install Proxmox VE from ISO (this will wipe all data on the existing host)
  • Rebuild the cluster if you had any
  • Restore the file /etc/pve/storage.cfg (this will re-map and make available any external media you used for backup)
  • Restore firewall configs /etc/pve/firewall/ and /etc/pve/nodes/<node>/host.fw (if relevant)
  • Restore full VMs from Backups (see Backup and Restore)
  • Restore/Convert containers (see Convert OpenVZ to LXC)

Bypassing Backup and Restore

The following is only for advanced users which have knowledge about Proxmox configuration files!

Since Backup and Restore can be a time-consuming process in the following a more rapid method is described - possible only

  • for KVM (i.e. not for containers)
  • if the (virtual) disk(s) for the VM(s) is (are) located at a storage which is not touched by the installation process (e.g. NFS at an external server)

The steps

  • Backup all VMs
  • Restore full VMs from Backups

will be replaced by

  • Backup <vmid>.conf file(s) for the respective machine(s), they are located under /etc/pve/nodes/<nodename>/lxc/ and /etc/pve/nodes/<nodename>/qemu-server/ respectively
  • Backup those storages from the storage configuration ( /etc/pve/storage.cfg ) which are shared and untouched, simply copy the respective lines and append them once to the newly build clusters /etc/pve/storage.cfg
  • Restore <vmid>.conf file(s) for the respective machine(s)

Note: /etc/pve/lxc/ and /etc/pve/qemu-server/ are virtual symlinks for the current nodes lxc and qemu directory.

After you restored the VM configs and restored the external shared Storage configuration - so that it is accessible under the same name in the new cluster again - you should be able to start the VMs again. No additional reboot is required.

In-place upgrade

In-place upgrades are done with apt, so make sure that you are familiar with apt before you start here.

Preconditions

  • upgraded to latest V3.4 version
  • reliable access to all configured storages
  • healthy cluster
  • no VM or CT running (note: VM live migration from 3.4 to 4.x node or vice versa NOT possible)
  • valid backup of all OpenVZ containers (needed for the conversion to LXC)
  • valid backup of all VM (only needed if something goes wrong)
  • Correct repository configuration (accessible both wheezy and jessie)
  • at least 1GB free disk space at root mount point

Actions Step by Step

All has to be done on each Proxmox node's command line (via console or ssh; preferably via console in order to exclude interrupted ssh connections) , some of the steps are optional. If a whole cluster should be upgraded, keep in mind the cluster name and HA configuration like failoverdomains, fencing etc since these have to be restored after upgrade by the new WEB GUI. Again, make sure that you have a valid backup of all CT and VM before you start.

Tip: It is advisable to perform a dry-run of the upgrade first. Install the PVE 3.4 ISO on testing hardware, then upgrade this installation to the latest minor version of PVE 3.4 using the test repo (see Package repositories) then copy/create relevant configurations to the test machine to replicate your production setup as closely as possible.

Remove Proxmox VE 3.x packages in order to avoid dependency errors

First make sure that your actual installation is "clean", tentatively run

apt-get update && apt-get dist-upgrade

Then start the removal:

apt-get remove proxmox-ve-2.6.32 pve-manager corosync-pve openais-pve redhat-cluster-pve pve-cluster pve-firmware 

Adapt repository locations and update the apt database, point all to jessie, e.g.:

sed -i 's/wheezy/jessie/g' /etc/apt/sources.list
sed -i 's/wheezy/jessie/g' /etc/apt/sources.list.d/pve-enterprise.list
apt-get update

If there is a backports line then remove it. Currently, pve-manager and ceph-common have unmet dependencies with regards to package versions in the jessie backports repo.

In case Ceph server is used: Ceph repositories for jessie can be found at http://download.ceph.com, therefore etc/apt/sources.list.d/ceph.list will contain e.g.:

 deb http://download.ceph.com/debian-hammer jessie main


You also need to install the Ceph repository key to apt, for details, check the wiki on ceph.com.

Install the new kernel

Add repository proxmox jessie location

deb https://enterprise.proxmox.com/debian jessie pve-enterprise
deb http://download.proxmox.com/debian jessie pve-no-subscription
apt-get update

Check first what the current new kernel's version is

apt-cache search pve-kernel | sort -V

- at the moment (September 2016) it is 4.4.19-1 - and install it:

apt-get install pve-kernel-4.4.19-1-pve pve-firmware

Upgrade the basic system to Debian Jessie

This action will consume some time - depending on the systems performance, this can take up to 60 min or even more. If you run on SSD, the dist-upgrade can be finished in 5 minutes.

Start with this step to get the initial set of upgraded packages.

apt-get upgrade

Once that's done, move on to the remaining packages to upgrade, with:

apt-get dist-upgrade

During either of the above, you may be asked to approve of some new packages replacing configuration files. Do with them as you see fit, but they are not relevant to the Proxmox upgrade.

Reboot the system in order to activate the new kernel.

Install Proxmox VE 4.x

Finally, install the new Proxmox VE 4.x packages with one single command:

apt-get install proxmox-ve

Then you should purge configuration files from packages which are no longer needed (Note: purging vzctl will delete all files in /var/lib/vz/private and /var/lib/vz/root , only run this if you have backed up your openvz containers.):

dpkg --purge vzctl
dpkg --purge redhat-cluster-pve

Remove the old kernel (not a must, but recommended), e.g. (the kernel version has to be adapted to the currently installed one - there can be more old kernels too. Use dpkg --list | grep pve-kernel to find any 2.6.* kernels to remove):

apt-get remove pve-kernel-2.6.*

After these commands, proxmox gives some error message and ignore them.

Reboot system,

Again,
apt-get install proxmox-ve

Finally, reboot and test if all is working as expected.

Optional: OpenVZ conversion

Convert the previously backed up containers to LXC, following the HowTo on Convert OpenVZ to LXC

You can also remove the obsolete OpenVZ container data from your local storage.

rm -f /etc/pve/openvz/<ct-id>.conf
rm -R <storage-path>/private/*

Cluster upgrade

It is not possible to mix Proxmox VE 3.x and earlier with Proxmox VE 4.x cluster

Due to the new corosync 2.x, the cluster has to be re-established again. Please use the same clustername.

  • at the first node
pvecm create <clustername>
  • at all other nodes:
pvecm add <first-node´s-IP> -force

The HA configuration (fail-over, fencing etc.) has to be re-configured manually, now supported from WEB GUI, see High Availability Cluster 4.x

After upgrading the last node remove the V3.x cluster data:

rm /etc/pve/cluster.conf

Troubleshooting

  • Failing upgrade to latest Proxmox VE 3.x or removal of old packages:

Make sure that the original repository configuration (for wheezy) is correct. The change to "jessie" repositories has to be done after the removal of old Proxmox VE.

In case of Ceph is used: note that recently the repository url has changed to http://download.ceph.com/

  • Failing upgrade to "jessie"

Make the sure that the repository configuration for jessie is correct.

If there was a network failure and the upgrade has been made partially try to repair the situation with

apt-get -fy install
  • Unable to boot due to grub failure

See Recover_From_Grub_Failure

Info: As long as you do not delete the files on the link below, Proxmox starts up with the same virtual machines.

/var/lib/vz/images/* --VM's data files are here.

/etc/pve/qemu-server/*.conf --VM's conf files are here.

External links