Jump to content

Analytics/Archive/Hue

From Wikitech
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.
This page contains historical information. It may be outdated or unreliable.

Hue is a web interface for the Hadoop cluster, most notably for running Hive queries. Our Hue instance is at https://hue.wikimedia.org.

General documentation for Hue can be found on our instance at https://hue.wikimedia.org/help/.

Access

To access Hue, you will just need wmf or nda LDAP access. For more details, see Analytics/Data access#LDAP access.[1]

If you want to use Hue to query private data, you will also need analytics-privatedata-users shell access.

Hive query errors with Kerberos

In T242306 it was reported the following error while using the Hive query editor:

Error while compiling statement: FAILED: SemanticException java.lang.RuntimeException: Unable to instantiate org.apache.hadoop.hive.ql.metadata.SessionHiveMetaStoreClient

The error is related to Kerberos authentication, and can be fixed forcing the renewal of the Hive session via the following steps:

  • In the Hue Hive editor, locate the 3-dots button at the top-right corner of the screen and hit it.
  • Then hit "Session" and the "Recreate" button.


Package and release a new version of Hue

We package Hue from the cloudera upstream repository, https://github.com/cloudera/hue/

https://gerrit.wikimedia.org/r/plugins/gitiles/operations/debs/hue/+/refs/heads/master/debian/README.debian

Notes

  1. ↑ In the past, you also needed someone from Analytics Engineering to manually sync your LDAP account to Hue. This is no longer necessary.