Apache Hive is a data warehouse infrastructure built on top of Hadoop for providing data summarization, query, and analysis.
HiveQL, which is an SQL-like language provided by Hive, provides a mechanism to project structure onto the data and query the data. Also this language allows traditional map/reduce programmers to plug in their custom mappers and reducers when it is inconvenient or inefficient to express this logic in HiveQL.
The following session explains in detail about:
- What is Hive and its need
- Hive Components
- Hive Metastore
- How to run Hive?
- Hive DDL, DML and SQL operations
- StratAppps Hive Developer