Saturday, April 8, 2023

AI/ML What is Apache Zeppelin

Web-based notebook that enables data-driven,

interactive data analytics and collaborative documents with SQL, Scala, Python, R and more.


Zeppelin SDK

Not only you can use Zeppelin as interactive notebook, you can also use it as JobServer via Zeppelin SDK (client api & session api)


Spark Interpreter Improved

Spark interpreter provides comparable Python & R user experience like Jupyter Notebook.



Flink Interpreter Improved

Flink interpreter is refactored, supports Scala, Python & SQL. Flink 1.10 and afterwards (Scala 2.11 & 2.12) are all supported.


Yarn Interpreter Mode

You can run interpreter in yarn cluster, e.g. you can run Python interpreter in yarn and R interpreter in yarn.


Inline Configuration

Generic ConfInterpreter provide a way configure interpreter inside each note.


Interpreter Lifecycle Management

Interpreter lifecycle manager automatically terminate interpreter process on idle timeout. So resources are released when they're not in use. See here for more details.

Multi-purpose Notebook

The Notebook is the place for all your needs


 Data Ingestion

 Data Discovery

 Data Analytics

 Data Visualization & Collaboration


Multiple Language Backend

Apache Zeppelin interpreter concept allows any language/data-processing-backend to be plugged into Zeppelin. Currently Apache Zeppelin supports many interpreters such as Apache Spark, Apache Flink, Python, R, JDBC, Markdown and Shell.


References

https://zeppelin.apache.org/



No comments:

Post a Comment