Hadoop is an open-source Java-based framework used to store data & run different applications on commodity hardware clusters. Hadoop is a collection of open-source software that runs as HDFS (Hadoop Distributed File System) and is capable of handling large amounts of data. Hadoop's goal is to store, manage, and provide data for analytical purposes.
Key differences between Hadoop and Oracle:
- Hadoop uses the sharding partitioning method for storing different data on different nodes. Oracle uses the horizontal partitioning method for storing different data on different nodes.
- Hadoop is open-source software, and Oracle is commercial software.
- Hadoop offers an API for user-defined Map/Reduce methods, whereas Oracle does not offer an API for user-defined Map/Reduce methods.
- JDBC, ODBC, and Thrift are the APIs and access methods used in Hive, and on the other side, JDBC, ODBC, ODP.NET, and OCI are the APIs and access methods used in Oracle.