Rania ZYANE

Looking for feedback: Is an open-source audit logs dashboard for Hadoop something you'd use?

Hi Hunters 👋,

I'm working on an open-source project and would really appreciate your feedback before I take it further.

I’ve been building a lightweight tool that lets you search and visualize audit logs from services in the Hadoop ecosystem, like Apache Ranger, Hive, HDFS, Impala, and YARN, without needing to set up an external logging stack (like ELK or Splunk).

The idea is to make it easier for data engineers, governance teams, or platform admins to answer questions like:

  • Who accessed this dataset last week?

  • Which users are getting permission denied errors?

  • What actions were blocked by Ranger policies?

The tool runs SQL queries (via Hive or Impala) on logs stored in HDFS, and displays the results in a web dashboard (built with Streamlit). It’s fast to deploy, open source, and designed for on-prem or hybrid clusters.

What I’d love your input on:

  • Do you think a tool like this would be useful in your environment?

  • Have you encountered similar pain points with auditing or log access in Hadoop?

  • Would you prefer it to be part of a broader observability stack, or keep it focused?

I practically finished the build and looking for honest feedback from people who know this space. If it sounds useful, I’d love to share a demo or discuss ideas.

Thanks in advance 🙌

17 views

Add a comment

Replies

Be the first to comment