Linux environment, I run
categoriMap = train.map(lambdax:x[3]).distinct().zipWithIndex().collectAsMap()pycahrm is
