Apache Hive is a data warehouse infrastructure built on top of Hadoop for querying and analyzing large datasets stored in Hadoop Distributed File System (HDFS). It provides a SQL-like query language called HiveQL for expressing data processing tasks, which are translated into MapReduce jobs and executed on the Hadoop cluster. Hive is commonly used for batch processing and data warehousing in big data environments.