It’s not a hardware problem, it’s running on a Proxmox with 18 more VMs and all working fine! I figured it was memory I increased to 8GB and 4 vcore, UCS doesn’t use today or 1/4 of it and even so the RPC problems continue to occur!
I’ve looked at practically all the logs, searched everywhere and didn’t find the problem, tried to restart the most diverse services, samba, winbind, kerberos, etc., it only comes back when I restart the UCS as a whole.
Then after some time (usually a few days, but it can be hours) every type of login in the domain stops working and when I open the GPMC.MSC it informs RPC error when loading the forest and does not open.
The customer is already stressed with this problem and I don’t know what to do anymore, so I’ll upgrade from 4.4 to 5.0 and if it doesn’t solve it, I’m already looking for a plan B to pull users, information and groups to an AD 2019.