CDSW

From the perspective of a layman, try to understand how to use the framework of ML and DL in combination with CDSW, taking XGBoost as an example.

I don't understand the basic mechanism of machine learning and deep learning at all, so I took this opportunity to try to search for relevant information. This article attempts to analyze how the XGBoost demo is implemented in CDSW from the perspective of CDSW users without ML/DL knowledge. While searching for related information on Cloudera, I also discovered how ML/DL workloads are handled on Google Cloud DataProc. I will also try to make a comparison with CDSW.

How to access CDSW database from outside of CDSW/Kubernetes cluster and change user password

This article attempts to introduce how to forward the Kubernetes Service to the host node, and how to connect to the Cloudera CDSW database and modify the user's password.

CDSW Session Couldn't Start Due To Node Taints - node.kubernetes.io/disk-pressure:NoSchedule

This article records a troubleshooting about CDSW unable to start Session. The reason for the problem involved is that the remaining capacity of nodefs and imagefs used by Kubelet has reached the threshold of evicting the node, resulting in the node being labeled as NoSchedule taint by Kubernetes.

CDSW Diagnostic Bundle Anatomy

cdsw logs output If we run cdsw logs on the CDSW Master node, we’ll see output as below: Generating Cloudera Data Science Workbench diagnostic bundle... Collecting basic system info... Collecting kernel parameters.