The interaction between DDL, DML and SQL is: DDL defines a meta-structure for tables, DML helps to manage/load the data into the tables, and SQL queries the tables to get insightful analysis.
Part 1. DDL Operation
- Create/Drop/Alter Database
- Create/Drop/Truncate Table
- Alter Table/Partition/Column
- Create/Drop/Alter View
- Create/Drop/Alter Index
- Create/Drop Function
- Create/Drop/Grant/Revoke Roles and Privileges
- Show
- Describe
Part 2. DML Operation
- Loading files into tables
- Inserting data into Hive Tables from queries
- Writing data into the filesystem from queries
There are two primary ways of modifying data in Hive:
- LOAD
- INSERT
Part 3. SQL Operation
This is quite similar as SQL language, basically it is based on the Select Syntax
- WHERE Clause
- ALL and DISTINCT Clauses
- Partition Based Queries
- HAVING Clause
- LIMIT Clause
- REGEX Column Specification
More about Select Syntax are:
- GROUP BY
- SORT BY, ORDER BY, CLUSTER BY, DISTRIBUTE BY
- JOIN
- UNION ALL
- TABLESAMPLE
- Subqueries
- Virtual Columns
- Operators and UDFs
- LATERAL VIEW
- Windowing, OVER, and Analytics
- Common Table Expressions
Nice blog..thanks for sharing the info about hive..
ReplyDeleteBig data training in chennai